Guide

How to Extract Table Data from Messy Copied Text

Copying a table from a PDF, email, or webpage often strips out the column structure, leaving you with a flat block of text. The Table Extractor tool helps recover the rows and columns so you can export the data as a usable CSV.

How table data gets mangled

Most PDF readers and web browsers copy table content as plain text. The rows are usually preserved as separate lines, but the column separators depend entirely on how the source document was formatted:

  • PDFs often use multiple spaces to align columns visually
  • Spreadsheets and CSV files typically use tabs or commas
  • HTML tables may lose all structure and paste as a single block
  • Pipe-delimited formats are common in database exports

Choosing the right delimiter

DelimiterBest for
Auto detectTry this first — works in most cases
TabSpreadsheet copy-paste, TSV files
Multiple spacesPDF column layouts, fixed-width reports
CommaCSV files, some web table copies
SemicolonEuropean CSV exports, some database formats
Pipe |Database exports, Markdown tables

Step-by-step

1
Copy the table text
Select the table in your PDF, email, or document and copy it with Ctrl+C.
2
Paste into the Table Extractor
Open the Table Extractor and paste the text. Click "Load example" to see a sample first.
3
Try Auto detect
Click Extract table with Auto detect selected. Check the preview — if the columns look right, you're done.
4
Switch delimiter if needed
If the columns are wrong, try each delimiter option and re-run until the preview looks correct.
5
Export
Click Download CSV to save a .csv file, or Copy CSV to paste directly into a spreadsheet.

When it won't work well

  • HTML tables that collapse into a single line when copied — the row structure is lost
  • PDFs with merged cells — those cannot be recovered after copy-paste
  • Tables where column widths vary wildly with the spaces delimiter
  • Multi-level or nested headers — the first row is always treated as the header row

Related tools