Export PostgreSQL to AZ Datalake Gen2 in BSON
Fast, parallel data export with zero intermediate storage
Terminal
.\FastBCP.exe `
--connectiontype "pgsql" `
--server "host.domain | host.domain,port | host.domain,port/service" `
--database "tpch" `
--trusted `
--sourceschema "tpch10" `
--sourcetable "orders" `
--query "SELECT * FROM tpch10.orders" `
--directory "abfss://storageaccount.dfs.core.windows.net/fastbcpexport/raw/{sourcedatabase}/{sourceschema}" `
--fileoutput "{sourcetable}.bson" `
--method "Ntile" `
--distributekeycolumn "o_orderkey" `
--paralleldegree -2 `
--merge false `
--runid "runidfromcaller"Source - PostgreSQL
PostgreSQL is a powerful and robust open-source relational database. FastBCP optimizes exports from PostgreSQL using its native connector for excellent performance.
Features:
- •Direct streaming read from database
- •Full support for PostgreSQL data types
- •Secure SSL connections
Parallel Method - Ntile
Divides data into N equal partitions based on a numeric column.
Requirement: Requires a numeric distribution column
Available parallel methods with PostgreSQL:
Output Format - BSON (Binary JSON)
BSON is a binary-encoded serialization format, primarily used by MongoDB. FastBCP exports to BSON format for efficient integration with MongoDB ecosystems.
Features:
- •Binary encoding for compact storage
- •MongoDB native format
- •Data type preservation
- •Efficient for document databases
Destination - Azure Data Lake Gen2
Azure Data Lake Storage Gen2 combines Azure Blob Storage with hierarchical namespace for big data analytics. FastBCP optimizes uploads for ADLS Gen2.
Storage Type:
Cloud Data Lake
Features:
- •Hierarchical namespace support
- •Optimized for analytics workloads
- •POSIX-compliant access control
- •Native integration with Azure services