Export Teradata to AZ Datalake Gen2 in Parquet
Fast, parallel data export with zero intermediate storage
Terminal
.\FastBCP.exe `
--connectiontype "teradata" `
--server "host.domain | host.domain,port | host.domain,port/service" `
--database "tpch" `
--trusted `
--sourceschema "tpch10" `
--sourcetable "orders" `
--query "SELECT * FROM tpch10.orders" `
--directory "abfss://storageaccount.dfs.core.windows.net/fastbcpexport/raw/{sourcedatabase}/{sourceschema}" `
--fileoutput "{sourcetable}.parquet" `
--method "Ntile" `
--distributekeycolumn "o_orderkey" `
--paralleldegree -2 `
--merge false `
--runid "runidfromcaller"Source - Teradata
Teradata is an enterprise data warehouse platform. FastBCP ensures optimized exports for Teradata environments.
Features:
- •Native Teradata protocol support
- •Optimized for massively parallel architectures
- •Compatible with on-premise and cloud environments
Parallel Method - Ntile
Divides data into N equal partitions based on a numeric column.
Requirement: Requires a numeric distribution column
Available parallel methods with Teradata:
Output Format - Apache Parquet
Parquet is a columnar file format optimized for analytical processing. FastBCP exports to Parquet with compression and type preservation, ideal for data lake architectures.
Features:
- •Columnar storage for efficient analytics
- •Integrated compression (Snappy, Gzip)
- •Full data type preservation
- •Optimized for big data tools (Spark, Hadoop)
Destination - Azure Data Lake Gen2
Azure Data Lake Storage Gen2 combines Azure Blob Storage with hierarchical namespace for big data analytics. FastBCP optimizes uploads for ADLS Gen2.
Storage Type:
Cloud Data Lake
Features:
- •Hierarchical namespace support
- •Optimized for analytics workloads
- •POSIX-compliant access control
- •Native integration with Azure services