Question 1

Is my audio or video uploaded to transcribe it?

Accepted Answer

No. The tool runs OpenAI's Whisper model compiled to WebAssembly right in your browser. Your file is decoded and transcribed on your own device — nothing is uploaded and no API key is needed.

Question 2

Can I get subtitles / captions?

Accepted Answer

Yes. Along with plain text, you can download timed captions as .srt or .vtt, ready to drop into YouTube, a video editor, or PandaStudio.

Question 3

Does it work for languages other than English?

Accepted Answer

Yes — pick the Multilingual model before choosing your file. The English model is faster; the multilingual one handles many languages.

Question 4

Why is the first run slow?

Accepted Answer

The Whisper model weights download once (tens of MB) and are cached by your browser. After that, transcription speed depends on your device and the length of the file.

Question 5

How accurate is it, and what about long files?

Accepted Answer

It uses the tiny/base Whisper models for speed in-browser, which are good for most clips. For long recordings, higher accuracy, speaker labels, and editing the transcript directly against the video, use the PandaStudio desktop app.

Free Audio & Video Transcriber

How it works

Pick a model & file

Whisper runs on-device

Copy or export captions

Need the full studio, not just a quick fix?

Frequently asked questions