PDF Parsing

Intelligently convert PDF to editable Markdown, with table handling and multi-lang translation

Upload PDF File

Drag & Drop files here, or

PDF only, Max 300MB, Max 800 pages

Settings

Points Deduction Info

  • PDF Parsing2 pts/pg
  • PDF Parsing+MD Translation3 pts/pg

How PDF to Markdown conversion works

KolmoPDF uses a Visual Language Model (VLM) to analyze PDF pages and output structured Markdown that preserves the semantic content of the document.

What the parser handles

  • Multi-column layouts: Pages with two or more columns are read in the correct logical order, not left-to-right across the physical page.
  • Mathematical formulas: Inline and display math is output as LaTeX within the Markdown, for example $E = mc^2$ or$$\int_0^\infty e^{-x^2} dx$$.
  • Tables: Tables can be output as Markdown tables or preserved as images. The experimental cross-page table merge feature detects and merges tables that span page breaks.
  • Code blocks: Source code and technical notation is wrapped in fenced code blocks with language hints where detectable.
  • Translation: Enable translation to convert document content into Chinese, Japanese, Korean, French, German, Spanish, or Russian during parsing. Adds 1 credit/page.

File limits

PDF files only. Maximum file size: 300 MB. Maximum page count: 800 pages.