Export ClickHouse to AWS S3 in JSON
Fast, parallel data export with zero intermediate storage
Terminal
.\FastBCP.exe `
--connectiontype "clickhouse" `
--server "host.domain | host.domain,port | host.domain,port/service" `
--database "tpch" `
--trusted `
--sourceschema "tpch10" `
--sourcetable "orders" `
--query "SELECT * FROM tpch10.orders" `
--directory "s3://rootbucket/fastbcpexport/raw/{sourcedatabase}/{sourceschema}" `
--fileoutput "{sourcetable}.json" `
--decimalseparator "." `
--dateformat "yyyy-MM-dd HH:mm:ss" `
--encoding "UTF-8" `
--method "Ntile" `
--distributekeycolumn "o_orderkey" `
--paralleldegree -2 `
--merge false `
--runid "runidfromcaller"Source - ClickHouse
ClickHouse is an ultra-fast column-oriented analytics database. FastBCP leverages ClickHouse's architecture for high-performance exports.
Features:
- •Optimized for analytical queries
- •Native columnar format support
- •Exceptional performance on large volumes
Parallel Method - Ntile
Divides data into N equal partitions based on a numeric column.
Requirement: Requires a numeric distribution column
Available parallel methods with ClickHouse:
Output Format - JSON (JavaScript Object Notation)
JSON is the most widely used data exchange format on the web. FastBCP exports data to JSON Lines (NDJSON) format, with one record per line for efficient streaming.
Features:
- •JSON Lines (NDJSON) format
- •One record per line for streaming
- •UTF-8 encoding
- •Compatible with all JSON parsers
Destination - Amazon S3
Amazon S3 is the industry-leading object storage service. FastBCP uploads exported files directly to S3 buckets with parallel multipart uploads for optimal performance.
Storage Type:
Cloud Object Storage
Features:
- •Parallel multipart upload
- •Server-side encryption support
- •S3 bucket and prefix configuration
- •IAM role and credential support