Export MariaDB to AWS S3 in Parquet
Fast, parallel data export with zero intermediate storage
Terminal
.\FastBCP.exe `
--connectiontype "mysql" `
--server "host.domain | host.domain,port | host.domain,port/service" `
--database "tpch" `
--trusted `
--sourceschema "tpch10" `
--sourcetable "orders" `
--query "SELECT * FROM tpch10.orders" `
--directory "s3://rootbucket/fastbcpexport/raw/{sourcedatabase}/{sourceschema}" `
--fileoutput "{sourcetable}.parquet" `
--method "Ntile" `
--distributekeycolumn "o_orderkey" `
--paralleldegree -2 `
--merge false `
--runid "runidfromcaller"Source - MariaDB
MariaDB is a MySQL fork offering advanced features and better performance. FastBCP ensures full compatibility with MariaDB.
Features:
- •Full MySQL compatibility
- •Support for advanced MariaDB features
- •Optimized performance
Parallel Method - Ntile
Divides data into N equal partitions based on a numeric column.
Requirement: Requires a numeric distribution column
Available parallel methods with MariaDB:
Output Format - Apache Parquet
Parquet is a columnar file format optimized for analytical processing. FastBCP exports to Parquet with compression and type preservation, ideal for data lake architectures.
Features:
- •Columnar storage for efficient analytics
- •Integrated compression (Snappy, Gzip)
- •Full data type preservation
- •Optimized for big data tools (Spark, Hadoop)
Destination - Amazon S3
Amazon S3 is the industry-leading object storage service. FastBCP uploads exported files directly to S3 buckets with parallel multipart uploads for optimal performance.
Storage Type:
Cloud Object Storage
Features:
- •Parallel multipart upload
- •Server-side encryption support
- •S3 bucket and prefix configuration
- •IAM role and credential support