Export PostgreSQL to OneLake in Parquet
Fast, parallel data export with zero intermediate storage
Terminal
.\FastBCP.exe `
--connectiontype "pgsql" `
--server "host.domain | host.domain,port | host.domain,port/service" `
--database "tpch" `
--trusted `
--sourceschema "tpch10" `
--sourcetable "orders" `
--query "SELECT * FROM tpch10.orders" `
--directory "onelake://workspace/lakehouse/fastbcpexports/raw/{sourcedatabase}/{sourceschema}" `
--fileoutput "{sourcetable}.parquet" `
--method "Ntile" `
--distributekeycolumn "o_orderkey" `
--paralleldegree -2 `
--merge false `
--runid "runidfromcaller"Source - PostgreSQL
PostgreSQL is a powerful and robust open-source relational database. FastBCP optimizes exports from PostgreSQL using its native connector for excellent performance.
Features:
- •Direct streaming read from database
- •Full support for PostgreSQL data types
- •Secure SSL connections
Parallel Method - Ntile
Divides data into N equal partitions based on a numeric column.
Requirement: Requires a numeric distribution column
Available parallel methods with PostgreSQL:
Output Format - Apache Parquet
Parquet is a columnar file format optimized for analytical processing. FastBCP exports to Parquet with compression and type preservation, ideal for data lake architectures.
Features:
- •Columnar storage for efficient analytics
- •Integrated compression (Snappy, Gzip)
- •Full data type preservation
- •Optimized for big data tools (Spark, Hadoop)
Destination - Microsoft OneLake
OneLake is Microsoft Fabric's unified data lake. FastBCP exports data directly to OneLake workspaces, enabling seamless integration with Fabric analytics services.
Storage Type:
Unified Data Lake
Features:
- •Direct integration with Microsoft Fabric
- •Workspace and lakehouse support
- •Built on Delta Lake format
- •Unified analytics and governance