Export ClickHouse to AZ Datalake Gen2 in CSV
Fast, parallel data export with zero intermediate storage
Terminal
.\FastBCP.exe `
--connectiontype "clickhouse" `
--server "host.domain | host.domain,port | host.domain,port/service" `
--database "tpch" `
--trusted `
--sourceschema "tpch10" `
--sourcetable "orders" `
--query "SELECT * FROM tpch10.orders" `
--directory "abfss://storageaccount.dfs.core.windows.net/fastbcpexport/raw/{sourcedatabase}/{sourceschema}" `
--fileoutput "{sourcetable}.csv" `
--decimalseparator "." `
--delimiter "|" `
--dateformat "yyyy-MM-dd HH:mm:ss" `
--encoding "UTF-8" `
--method "Ntile" `
--distributekeycolumn "o_orderkey" `
--paralleldegree -2 `
--merge false `
--runid "runidfromcaller"Source - ClickHouse
ClickHouse is an ultra-fast column-oriented analytics database. FastBCP leverages ClickHouse's architecture for high-performance exports.
Features:
- •Optimized for analytical queries
- •Native columnar format support
- •Exceptional performance on large volumes
Parallel Method - Ntile
Divides data into N equal partitions based on a numeric column.
Requirement: Requires a numeric distribution column
Available parallel methods with ClickHouse:
Output Format - CSV (Comma-Separated Values)
CSV is the universal standard for tabular data exchange. FastBCP exports data to CSV format with configurable delimiters, encoding, and date formats for maximum compatibility.
Features:
- •Configurable delimiters and separators
- •Multiple encoding support (UTF-8, ASCII, etc.)
- •Custom date and decimal formats
- •Header row support
Destination - Azure Data Lake Gen2
Azure Data Lake Storage Gen2 combines Azure Blob Storage with hierarchical namespace for big data analytics. FastBCP optimizes uploads for ADLS Gen2.
Storage Type:
Cloud Data Lake
Features:
- •Hierarchical namespace support
- •Optimized for analytics workloads
- •POSIX-compliant access control
- •Native integration with Azure services