Drop any audio file, get frame-accurate SRT captions for Premiere Pro — or clean plain text. Free, private, and blazing fast.
Professional-grade transcription with the flexibility video editors actually need.
Whisper returns per-word timestamps so your captions sync to the exact millisecond — no more manually adjusting timing.
Frame accurateAutomatic language detection or manually specify the language. Supports English, Spanish, French, Arabic, Japanese, and dozens more.
Auto-detectEverything runs in your browser. Your audio goes directly to OpenAI's API — no intermediate servers, no stored files, no tracking.
Zero storageExport industry-standard SRT files that import perfectly into Premiere Pro, DaVinci Resolve, Final Cut Pro, and any NLE.
SRT standardNo software to install. No account required. Just bring your OpenAI API key.
Paste your OpenAI API key — it stays in your browser, never touches our servers. Whisper costs ~$0.006 per minute of audio.
Upload any audio or video file up to 25MB. MP3, M4A, WAV, WebM, OGG — all supported. Pick your caption style and language.
Whisper transcribes and we generate your SRT instantly. Copy or download and drop it straight into your video editor.
Choose the caption style that fits your video — from punchy single words to full sentences.
One caption per word — perfect for that viral, high-energy caption style seen on reels and shorts.
Group 2–5 words per caption for a natural reading rhythm without breaking the sentence flow.
Wrap at a max character count — ideal for broadcast standards or platforms with caption width limits.
One caption per Whisper segment — great for narrative content and interviews where full sentences read better.
Your audio never leaves your browser session.
Drop your audio file here, or click to browse
MP3 · M4A · WAV · OGG · WebM · FLAC · up to 25MB