Import XLSX Files to MySQL

    Fast, parallel file import using DuckDBStream

    FastTransfer
    Terminal
    .\FastTransfer.exe `
      --sourceconnectiontype "duckdbstream" `
      --sourceserver ":memory:" `
      --sourceserver "your-server" `
      --sourceuser "your-username" `
      --sourcepassword "your-password" `
      --query "SELECT * FROM read_xlsx('D:\path\to\files\*.xlsx, filename=true')" `
      --targetconnectiontype "mysqlbulk" `
      --targetserver "your-server" `
      --targetuser "your-username" `
      --targetpassword "your-password" `
      --targetdatabase "your-database" `
      --targetschema "your-schema" `
      --targettable "your-table" `
      --method "DataDriven" `
      --distributekeycolumn "filename"  `
      --datadrivenquery "select file from glob('D:\path\to\files\*.xlsx')"  `
      --degree -2  `
      --loadmode "Truncate"  `
      --mapmethod "Name"
    Get FastTransfer

    Source - Excel (XLSX)

    The Excel XLSX format is ubiquitous in enterprise environments. FastTransfer can directly read Excel files without prior conversion.

    Features:

    • Direct reading without Excel installed with DuckDB read_xlsx() syntax
    • Support for multiple sheets
    • Automatic header detection
    • Data type preservation

    Processing - DuckDBStream with DataDriven

    DuckDB is a fast and efficient in-process analytical database. FastTransfer uses DuckDBStream to read multiple file formats with exceptional performance.

    Parallel Method: DataDriven (Files)

    For files, FastTransfer uses the filename as distribution key to parallelize the processing of multiple files simultaneously.

    • Concurrent processing of multiple files
    • Ideal for batch imports
    • Automatic horizontal scaling

    Destination - MySQL

    FastTransfer leverages MySQL's bulk insert API for optimized loading. Data is inserted in batches to maximize throughput.

    Loading method:

    Bulk Insert API

    Advantages:

    • Optimized bulk insert
    • Intelligent batching
    • Support for indexes and constraints