PDF to Speech — Convert PDF to MP3

en-US

Delbert

Style

speed:1.0

pitch:0

Volume:100%

File Format

Format:

Bitrate:

Sample Rate:

Channels:

Pause Control

Pause for paragraphs:

Pause for sentences:

These settings control the duration of silence between text blocks for natural sounding speech.

Background music

Current track No files selected

Volume: 100%

Loop Repeat background endlessly

Open the editor above, click File in the toolbar to upload your PDF, and get a natural-sounding MP3 in seconds — research papers, ebooks, long-form articles, business reports. SpeechGen reads any text-based PDF aloud in 146 languages using the same engine that powers our 5,000+ built-in voices. No software to install, no sign-up for the first 3,000 characters.

How to convert PDF to MP3 — 3 steps

Browser-based, no download. Short documents convert in seconds, full books in a couple of minutes.

Upload your PDF

In the editor above, click the File button on the toolbar and pick your PDF. The engine reads text-based PDFs (the kind exported from Word, LaTeX, InDesign, or any browser).

Up to 50 MB per file
Text-based PDFs (run scanned files through OCR first)
Cyrillic, CJK, Arabic, Hebrew layouts handled

Pick voice and language

Choose from 5,000+ voices across 146 languages. Adjust speed and pitch, or pick a specific accent. Preview before you commit.

146 languages for narration
Adjustable pace, pitch, emphasis
Same library as the main TTS editor

Download MP3

Audio is ready in under a minute for shorter documents, a few minutes for full books. Stream it in your account or download the MP3.

MP3, WAV, OGG output
Stored in your account for replay
Downloadable for offline use

What people convert PDFs into

Four real workflows we see every day. Tap a card to listen — same engine, your file plugs straight into the editor above.

Research paper to audio for commute listening

By Aoede HD

Research papers & theses

12-page IEEE papers, dissertation drafts, lecture notes from arXiv — listen on the commute instead of skimming on a screen. Multi-column layouts and footnotes are flattened automatically before narration.

By Charon HD

Ebooks & novel chapters

Full-book PDFs in any language — German memoirs, Spanish thrillers, English literary fiction. Narrator voice stays consistent for hundreds of pages, no quality drop in chapter twelve.

By Lapetus HD

Reports, memos & white papers

Quarterly reports, market research, board memos — turn a 40-page deck into a 25-minute MP3 to listen to on the train. Lapetus delivers a clean corporate read without sounding robotic.

By Achernar HD

Articles & long-form journalism

Magazine essays, Substack longreads, NYT investigations exported to PDF — turn a 30-minute read into a podcast you can listen to while cooking. Achernar HD has the warm magazine-narrator timbre.

Pro tools for full-length books: use the <cut> tag to split a 300-page novel into chapter-by-chapter MP3s in one synthesis, the <dialog> tag to give each character a different voice across dialogue passages, and <break> tags for precise dramatic pauses between scenes. Each tag has its own quick guide.

PDF-specific handling

Three things this tool does better than copy-pasting raw text into a generic TTS engine.

Multi-column & structured layouts

Two-column research papers, bulleted lists, headings and captions, footnotes — text reflow is structure-aware. Reading order matches the page, not random column-jumping. Header / footer / page numbers are filtered out so the narrator doesn't say "page seventeen" every minute.

Long-file performance

A 30-page paper finishes in under a minute. 200-page books complete in 3–5 minutes. No manual chunking, no chapter splitting — upload once, get a single MP3 (or split into chapter tracks via TOC bookmarks if your PDF has them).

Multilingual within one PDF

Documents that mix two or three languages — research papers with English abstracts and Spanish bodies, bilingual contracts, immigration forms — get language-detected and narrated in the right voice for each section. No splitting required.

Frequently Asked Questions

How do I convert a PDF to audio?

Click the File button in the editor toolbar at the top of this page, pick your PDF, choose a voice and language, click Convert. MP3 lands in your account in 30 seconds for short documents and a few minutes for full books. Nothing to install.

Does it work with scanned PDFs?

No — the engine reads text-based PDFs only (the kind exported from Word, LaTeX, InDesign, or any browser). For image-based PDFs (scanned books, faxed reports, photos of documents), run them through any free OCR tool first — Adobe Acrobat, ABBYY FineReader, or even Google Drive's built-in OCR — to convert the pixels into a text PDF. Then upload here as usual.

Does it skip headers, footers, and page numbers?

Yes. Repeating headers, footers, and standalone page numbers are filtered out so the narrator doesn't read "page seventeen" every minute. Chapter titles and section headings are kept and read aloud at a natural pace.

How are tables, captions, and footnotes handled?

Tables are flattened row-by-row, with column headers read once before each row. Figure / chart captions are read in line where they appear. Footnotes are skipped from the main flow and read at the end of each chapter so they don't break sentence rhythm.

Can I convert password-protected or encrypted PDFs?

No — DRM-protected and password-locked PDFs are rejected on upload for legal and security reasons. Remove the password first (any PDF tool can do this if you have the password), then upload. We can't bypass DRM.

How long does a 100-page PDF take, and what about a 500-page book?

100 pages convert in about 2 minutes (≈3 hours of MP3 audio at normal speed). 500-page books are over the 50 MB upload limit — split into 2–3 parts using any PDF tool, convert each, then concat the MP3s if you want one file.

Can I split a book into chapter-by-chapter MP3s, or give characters different voices?

Yes — both are built in. Wrap chapter breaks in the <cut> tag and one synthesis returns a separate MP3 per chapter. For dialogue between characters, the <dialog> tag voices each speaker with a different actor in a single audio file. Combine both for a full multi-voice audiobook.

Ready to listen to your PDF?

Click File in the editor at the top of this page. First 3,000 characters free — about 5 pages of audio, no card required. After that $5+.

Convert PDF to MP3