Subtitles, decoded on your machine.

A native macOS workshop for turning audio and video into SRT, VTT, TXT, ASS and JSON subtitles — fast, batch-friendly, with the engine you pick. No SaaS round-trips. No usage caps.

Download for macOS Buy Lifetime · $19

02 / 07 — SCREENSHOT

The workshop, on your machine.

03 / 07 — PILLARS

Three certainties.

P · 01

On-device by default.

Audio leaves your file only if you point UtterPad at a cloud model. The default engines run entirely on your Mac. No network, no telemetry.

P · 02

Pro-tool surface.

Batch queue with per-row model, language, output-format and folder overrides. Drag-reorderable, drag-resizable columns. Phased progress bars that reset between extract and transcribe.

P · 03

Every transcript, every format.

Each completed job auto-exports .srt, .vtt, .txt, .ass and .json — beside the source or to a folder you pick per file.

04 / 07 — PIPELINE

Four phases. Each one resets.

step 01

Extract

extracting audio · 92%
step 02

Transcribe

transcribing · 47%
step 03

Live segments

transcribing · 71%
step 04

Export

done · 100%

05 / 07 — ENGINE MATRIX

Pick the right tool for the recording.

Local · on-device

Whisper Tiny · Base · Small · Medium · Large v3 · Turbo Six tiers from Tiny to Large v3 Turbo. Downloads on demand, runs entirely on-device.
Apple Speech on-device · macOS 14+ Already on your Mac. No download, no network, no setup. Handles long files cleanly.
Parakeet V2 (en) · V3 (en + 25 EU) Lightning fast on Apple Silicon. Realtime-capable.

Cloud · bring your own key

GroqWhisper Large v3 Turbo~10× realtime; lowest latency cloud option.
ElevenLabsScribe v1 · v2Word-level timestamps, multilingual.
DeepgramNova-3 · Nova-3 MedicalSpeaker diarization out of the box.
MistralVoxtralVerbose JSON with segment timing.
Google Gemini2.5 / 3.1 · Pro · FlashAudio-understanding via generateContent.
Custom endpointOpenAI-compatibleBring any provider. Your API key never leaves the Mac's Keychain.

06 / 07 — PRIVACY

What leaves your machine?

audio: Stays on disk for local engines. Only uploaded if you explicitly pick a cloud model.
api keys: Stored in the macOS Keychain. Never written to disk in plaintext, never sent anywhere except the matching provider.
telemetry: None. No analytics, no crash reporters, no opt-in dialogs. The app does not phone home.
microphone: UtterPad does not request microphone access. It processes files you point it at.

12+

Models

export formats

transcription engines

14+

supported a/v containers

network requests by default

07 / 07 — DOWNLOAD

Get your subtitles ready.

Free for 30 days. $19 lifetime if you stick around.

Download UtterPad Buy Lifetime · $19