Export ClickHouse to GCS in JSON

    Fast, parallel data export with zero intermediate storage

    Terminal
    .\FastBCP.exe `
    --connectiontype "clickhouse" `
    --server "host.domain | host.domain,port | host.domain,port/service" `
    --database "tpch" `
    --trusted `
    --sourceschema "tpch10" `
    --sourcetable "orders" `
    --query "SELECT * FROM tpch10.orders" `
    --directory "gs://rootbucket/fastbcpexport/raw/{sourcedatabase}/{sourceschema}" `
    --fileoutput "{sourcetable}.json" `
    --decimalseparator "." `
    --dateformat "yyyy-MM-dd HH:mm:ss" `
    --encoding "UTF-8" `
    --method "Ntile" `
    --distributekeycolumn "o_orderkey" `
    --paralleldegree -2 `
    --merge false `
    --runid "runidfromcaller"
    Get FastBCP

    Source - ClickHouse

    ClickHouse is an ultra-fast column-oriented analytics database. FastBCP leverages ClickHouse's architecture for high-performance exports.

    Features:

    • Optimized for analytical queries
    • Native columnar format support
    • Exceptional performance on large volumes

    Parallel Method - Ntile

    Divides data into N equal partitions based on a numeric column.

    Requirement: Requires a numeric distribution column

    Available parallel methods with ClickHouse:

    Output Format - JSON (JavaScript Object Notation)

    JSON is the most widely used data exchange format on the web. FastBCP exports data to JSON Lines (NDJSON) format, with one record per line for efficient streaming.

    Features:

    • JSON Lines (NDJSON) format
    • One record per line for streaming
    • UTF-8 encoding
    • Compatible with all JSON parsers

    Destination - Google Cloud Storage

    Google Cloud Storage provides unified object storage. FastBCP uploads files to GCS buckets with parallel chunked uploads for fast transfers.

    Storage Type:

    Cloud Object Storage

    Features:

    • Parallel chunked upload
    • Service account authentication
    • Bucket and prefix configuration
    • Multi-regional storage classes