Question 1

How accurate is heading detection?

Accepted Answer

Best for documents with clear typographic hierarchy (academic papers, books, reports). The tool clusters font sizes and assigns the largest cluster as H1, next as H2, etc. Mixed-format documents may need manual cleanup.

Question 2

Does it preserve images / tables?

Accepted Answer

Images are replaced with [image] placeholders. Tables become tab-separated text (Markdown table syntax is complex to infer reliably). For tables specifically, use a dedicated PDF table extractor.

Question 3

Is the file uploaded?

Accepted Answer

No. pdf.js parses the document entirely in your browser.

PDF to Markdown

How heading detection works

What's preserved

What's not

Privacy