Global Araç
Pdf Table Extractor
How this works: uses pdf.js to extract text + position from each character box, then clusters items into rows by Y-coordinate and cells by X-gap. Works well on simple single-page tables. Limitations:scanned PDFs (image-only) won’t work — need OCR first. Tables with merged cells or unusual structure may need manual cleanup. All processing in your browser — file never uploaded.
Extract tables from any text-based PDF page into CSV. Uses pdf.js to read positioned text boxes, clusters them into rows by Y-coordinate, splits cells by X-gap. Works well on simple tables; complex layouts may need cleanup.
All processing happens in your browser. The PDF is never uploaded.
Nasıl Kullanılır
- Upload your PDF.
- Pick a page number.
- Download the extracted table as CSV.