Supported formats

One pipeline for every file type

Documents, spreadsheets, slides, images, audio, video, and whole websites — all converted to clean, structured Markdown your AI can actually use.

Documents

PDF · DOCX · TXT · Markdown · HTML

Layout-aware PDF parsing reconstructs columns, tables, and reading order — and scanned PDFs are read with OCR or AI vision, so even image-only documents become real text.

Slides & spreadsheets

PPTX · XLSX

Slide decks become structured outlines; spreadsheets become Markdown tables that keep rows and columns aligned, so the relationships in your data survive.

Images

PNG · JPG · JPEG · TIFF · BMP

OCR pulls text out of screenshots and scans, while AI vision transcribes and describes charts and diagrams — so the most information-dense parts of a page aren't lost.

Audio & video

MP3 · WAV · M4A · MP4 · MOV · WEBM

Speech is transcribed to timestamped Markdown, with speaker diarization for multi-person recordings — turn meetings, calls, and lectures into searchable text.

Websites

Any URL · whole-site crawl

Convert a single page, or crawl an entire same-origin site into one clean document — documentation, knowledge bases, and articles, ready for your LLM.

Convert anything to AI-ready Markdown

PDFs, Office docs, images, audio, and whole websites — clean Markdown and RAG-ready exports for your LLM, in seconds.