Export SQL Server to AZ Datalake Gen2 in BSON

    Fast, parallel data export with zero intermediate storage

    Terminal
    .\FastBCP.exe `
    --connectiontype "mssql" `
    --server "host.domain | host.domain,port | host.domain,port/service" `
    --database "tpch" `
    --trusted `
    --sourceschema "tpch10" `
    --sourcetable "orders" `
    --query "SELECT * FROM tpch10.orders" `
    --directory "abfss://storageaccount.dfs.core.windows.net/fastbcpexport/raw/{sourcedatabase}/{sourceschema}" `
    --fileoutput "{sourcetable}.bson" `
    --method "Ntile" `
    --distributekeycolumn "o_orderkey" `
    --paralleldegree -2 `
    --merge false `
    --runid "runidfromcaller"
    Get FastBCP

    Source - SQL Server

    Microsoft SQL Server is a leading enterprise data platform. FastBCP uses advanced techniques to extract SQL Server data with maximum efficiency.

    Features:

    • Native SQL Server driver
    • Support for SQL Server-specific data types
    • Optimized for Windows and Linux environments

    Parallel Method - Ntile

    Divides data into N equal partitions based on a numeric column.

    Requirement: Requires a numeric distribution column

    Available parallel methods with SQL Server:

    Output Format - BSON (Binary JSON)

    BSON is a binary-encoded serialization format, primarily used by MongoDB. FastBCP exports to BSON format for efficient integration with MongoDB ecosystems.

    Features:

    • Binary encoding for compact storage
    • MongoDB native format
    • Data type preservation
    • Efficient for document databases

    Destination - Azure Data Lake Gen2

    Azure Data Lake Storage Gen2 combines Azure Blob Storage with hierarchical namespace for big data analytics. FastBCP optimizes uploads for ADLS Gen2.

    Storage Type:

    Cloud Data Lake

    Features:

    • Hierarchical namespace support
    • Optimized for analytics workloads
    • POSIX-compliant access control
    • Native integration with Azure services