Upload audio or video
Drag & drop MP3, WAV, MP4 — or paste a YouTube link. Files up to 1 GB and 3 hours.
Transcribe audio to text and convert to SRT/VTT subtitles in one upload — speaker-labelled, timestamped, ready for any video editor.
Upload, let the AI work, then tune the export to your workflow.
Drag & drop MP3, WAV, MP4 — or paste a YouTube link. Files up to 1 GB and 3 hours.
Our model converts speech to text with 95–98% accuracy, timestamps every line, and labels speakers.
Tune paragraph rhythm and which timestamps appear, then ship as TXT for writers, DOCX for review, or SRT/VTT subtitles for any video player.
If you're transcribing a confidential interview, medical session, legal recording, or internal meeting, here's exactly how we handle the file. No marketing flourish.
Every audio file you upload travels encrypted, and so does the resulting transcript when you fetch it back. SSL/TLS end-to-end.
Your audio file and its transcript are wiped from our storage 3 days after upload. Need to keep a copy? Download it within that window or set a reminder.
Your audio recordings stay yours. They don't enter any training pipeline. The model that transcribes them is pre-trained and frozen — your file is processed once and forgotten.
EU users have the standard rights — copy, deletion, portability — exercisable from your account or via support. Each upload sits at a private URL keyed to the account that created it.
Your content stays private, encrypted, and entirely under your control.
Audio, video, and YouTube on the way in — every common transcript format on the way out.
MP3WAVOGGOPUSAACM4AFLACAMRAIFF3GPWEBM
MP4MOVMKVWMVAVIWEBM
YouTubeYouTube Shorts
TXTDOCXPDFSRTVTTCSVClipboard
Drop audio in — get back a clean transcript and a subtitle file segmented to caption-friendly line lengths. The 38-second sample below converts to SRT and VTT (highlighted) as well as TXT, DOCX, PDF, and CSV. Same shape of output your file will produce.
Speaker 1 · 00:01
What got you started in tech journalism?
Speaker 2 · 00:05
Honestly, by accident. I was covering city hall…
Tech-journalism interview
Sample transcript · 0:38
Speaker 1 · 00:01
What got you started in tech journalism?
Effectively a built-in subtitle generator: SRT and VTT files are pre-segmented to caption-friendly line lengths (≤ 42 chars per line) — drop straight into Premiere, DaVinci, Final Cut, CapCut, or YouTube Studio. Or tune all output controls first.
Most transcription tools dump a single wall of text. Ours splits the transcript by speaker, by pause length, and by paragraph rhythm — tuneable to your downstream tool.
Auto-detect or fix every paragraph at 1, 2, 3, 4, or 8 lines. Useful when you're pasting into a doc that has its own preferred rhythm.
Auto1 line2 lines3 lines4 lines8 linesThe AI starts a new paragraph after a pause. Adjust the pause length to taste — shorter for fast speech, longer for measured monologue.
500 ms700 ms (default)1500 mscustomPer paragraph for skim review, per phrase for legal-cite work, both for full audit trail, or off for clean publishable prose.
ParagraphsPhrasesBothOffAuto-labelled Speaker 1 / Speaker 2. Rename in the editor to match the panelists, hosts, or interview subjects you uploaded.
Speaker namesMerge by speakerHideOne toggle collapses the transcript to publishable prose — ready for a writer, an LLM summarizer, or to paste into a CMS draft.
Plain text modeSkip the file step. Paste the configured transcript straight into Notion, Google Docs, or your CMS — already in the right shape.
Copy to clipboardThese are the languages where the model delivers consistently strong results. Auto-detect picks the right one; mixed-language clips work too.
If your audio is in a less common language, run a 60-second sample on the free tier first.
One transcript engine, every workflow that needs words from sound.
Transcribe audio to text from interviews and field recordings — speaker-labelled output and a transcript generator built for fast quote-pulling.
Turn lectures and seminars into study notes. Add timestamps and skim instead of re-listening.
An audio to text converter that doubles as a show-note generator: feed in an MP3, get blog repurposes, episode summaries, and chapter cues.
Use the built-in subtitle generator to produce SRT and VTT files for YouTube, TikTok, and any video player.
Transcribe depositions, hearings, and meetings with timestamps for line-cite review.
Drop a meeting recording, get an action-item transcript ready to paste into your doc tool.
Test transcription quality on your own audio. No credit card. Top up only when you need more minutes.
The questions we hear most from new users — answered straight.