Guide

How to Extract Table Data from Messy Copied Text

Copying a table from a PDF, email, or webpage often strips out the column structure, leaving you with a flat block of text. The Table Extractor tool helps recover the rows and columns so you can export the data as a usable CSV.

How table data gets mangled

Most PDF readers and web browsers copy table content as plain text. The rows are usually preserved as separate lines, but the column separators depend entirely on how the source document was formatted:

PDFs often use multiple spaces to align columns visually
Spreadsheets and CSV files typically use tabs or commas
HTML tables may lose all structure and paste as a single block
Pipe-delimited formats are common in database exports

Choosing the right delimiter

Delimiter	Best for
Auto detect	Try this first — works in most cases
Tab	Spreadsheet copy-paste, TSV files
Multiple spaces	PDF column layouts, fixed-width reports
Comma	CSV files, some web table copies
Semicolon	European CSV exports, some database formats
Pipe \|	Database exports, Markdown tables

Step-by-step

Copy the table text

Select the table in your PDF, email, or document and copy it with Ctrl+C.

Paste into the Table Extractor

Open the Table Extractor and paste the text. Click "Load example" to see a sample first.

Try Auto detect

Click Extract table with Auto detect selected. Check the preview — if the columns look right, you're done.

Switch delimiter if needed

If the columns are wrong, try each delimiter option and re-run until the preview looks correct.

Export

Click Download CSV to save a .csv file, or Copy CSV to paste directly into a spreadsheet.

When it won't work well

HTML tables that collapse into a single line when copied — the row structure is lost
PDFs with merged cells — those cannot be recovered after copy-paste
Tables where column widths vary wildly with the spaces delimiter
Multi-level or nested headers — the first row is always treated as the header row

Open Table Extractor →

Related tools

Text Case Fixer

Fix ALL CAPS, weird capitalization, broken spacing, and pasted text mess.

Image Text Cleanup

Turn messy OCR or copied image text into clean usable content.