Export ODBC to AZ Datalake Gen2 in JSON

    Fast, parallel data export with zero intermediate storage

    Terminal
    .\FastBCP.exe `
    --connectiontype "odbc" `
    --dsn "DSN_4_DATASOURCE" `
    --database "tpch" `
    --trusted `
    --sourceschema "tpch10" `
    --sourcetable "orders" `
    --query "SELECT * FROM tpch10.orders" `
    --directory "abfss://storageaccount.dfs.core.windows.net/fastbcpexport/raw/{sourcedatabase}/{sourceschema}" `
    --fileoutput "{sourcetable}.json" `
    --decimalseparator "." `
    --dateformat "yyyy-MM-dd HH:mm:ss" `
    --encoding "UTF-8" `
    --method "Ntile" `
    --distributekeycolumn "o_orderkey" `
    --paralleldegree -2 `
    --merge false `
    --runid "runidfromcaller"
    Get FastBCP

    Source - ODBC

    ODBC (Open Database Connectivity) allows connection to virtually any database. FastBCP supports ODBC connections for maximum compatibility.

    Features:

    • Compatible with any database supporting ODBC
    • Maximum flexibility
    • Standard ODBC driver support

    Parallel Method - Ntile

    Divides data into N equal partitions based on a numeric column.

    Requirement: Requires a numeric distribution column

    Available parallel methods with ODBC:

    Output Format - JSON (JavaScript Object Notation)

    JSON is the most widely used data exchange format on the web. FastBCP exports data to JSON Lines (NDJSON) format, with one record per line for efficient streaming.

    Features:

    • JSON Lines (NDJSON) format
    • One record per line for streaming
    • UTF-8 encoding
    • Compatible with all JSON parsers

    Destination - Azure Data Lake Gen2

    Azure Data Lake Storage Gen2 combines Azure Blob Storage with hierarchical namespace for big data analytics. FastBCP optimizes uploads for ADLS Gen2.

    Storage Type:

    Cloud Data Lake

    Features:

    • Hierarchical namespace support
    • Optimized for analytics workloads
    • POSIX-compliant access control
    • Native integration with Azure services