Tool under construction. Launching March 20, 2026. Code will be open sourced under AGPL license. Current version is for testing purposes only.
Modufile
PDF ToolsPDF to Excel

PDF to Excel

Extract tabular data from your PDF into spreadsheets

Drag and drop your files here

or click to browse. Supports PDFs, PNG, JPG, DOCX, and more.

Private.

Works best with PDFs containing clear tabular data. Complex layouts may require manual adjustment after conversion.

Output details

  • 📊 Standard .xlsx format
  • 📋 Column detection from text positions
  • 📄 One sheet per document

About PDF to Excel

Modufile's PDF to Excel tool extracts tabular data from PDFs and generates .xlsx spreadsheet files. It uses MuPDF to extract text blocks with precise position coordinates, then applies column detection based on X-coordinate clustering to reconstruct the original table structure. The resulting spreadsheet is built using SheetJS (xlsx library). This tool works best with PDFs containing clearly structured tables with consistent column alignment — such as financial statements, invoices, reports, and data exports. The entire process runs in your browser, so your files are never sent to a server.

Tech Stack

mupdfExtracts text blocks with precise position coordinates from each PDF page
xlsx (SheetJS)Generates Excel spreadsheet files (.xlsx) from extracted tabular data

Frequently Asked Questions