Models & Providers

OpenWhispr supports multiple speech-to-text engines and AI providers. Run everything locally for privacy, use cloud APIs for speed, or mix both.

🖥️Local Transcription Models

Run speech-to-text entirely on your machine. No internet required, no data leaves your device.

OpenAI Whisper

via whisper.cpp · 99+ languages · GGML format

ModelSizeSpeedQuality
Tiny
75 MBFastestBasic
BaseRecommended
142 MBFastGood
Small
466 MBMediumBetter
Medium
1.5 GBSlowHigh
Large v3
3 GBSlowestBest
Turbo
1.6 GBFastGood

NVIDIA Parakeet

via sherpa-onnx · 25 languages · ONNX INT8

Parakeet TDT 0.6B v3RecommendedState-of-the-art English

State-of-the-art English accuracy with 25-language multilingual support. INT8 quantized for efficient local inference.

680 MB Fast 25 languages 100% local

Parakeet languages: English, Bulgarian, Croatian, Czech, Danish, Dutch, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, Ukrainian.

☁️Cloud Transcription Providers

Use your own API keys for cloud-powered transcription. Bring-your-own-key keeps you in control.

OpenAI

GPT-4o Mini Transcribe

Fast, accurate transcription

Very Fast
GPT-4o Transcribe

Most accurate OpenAI transcription

Standard
Whisper

Original Whisper API endpoint

Standard
Get API key

Groq

Fastest
Whisper Large v3 Turbo

216x real-time speed — the fastest cloud transcription available

Ultra-Fast
Get API key

Mistral

Voxtral Mini

Multilingual transcription from Mistral AI

Fast
Get API key

Custom Endpoint

Any OpenAI-compatible API

Connect Ollama, self-hosted Whisper servers, LocalAI, or any service with an OpenAI-compatible /audio/transcriptions endpoint.

OpenWhispr Cloud

Zero-config transcription and AI processing. No API keys needed — just sign in and go.

Managed transcription

We route to the fastest provider automatically

AI text cleanup included

Formatting, punctuation, and filler removal

Free tier available

Upgrade to Pro for unlimited usage

🧠AI Text Processing

After transcription, OpenWhispr can clean up your text — fixing grammar, removing filler words, and formatting output. Choose from cloud or local AI models.

Cloud Providers (BYOK)

OpenAI

GPT-5.2, GPT-5 Mini, GPT-5 Nano, GPT-4.1, GPT-4.1 Mini, GPT-4.1 Nano

Anthropic

Claude Opus 4.5, Claude Sonnet 4.5, Claude Haiku 4.5

Google

Gemini 3 Pro, Gemini 3 Flash, Gemini 2.5 Flash Lite

Groq

Qwen3 32B, GPT-OSS 120B, GPT-OSS 20B, LLaMA 3.3 70B, LLaMA 3.1 8B, Mixtral 8x7B

Local Models (via llama.cpp)

Qwen

Qwen3 8B

0.6B, 1.7B, 4B, 8B, 32B variants

Mistral

Mistral 7B Instruct

Q4 and Q5 quantizations

Meta LLaMA

LLaMA 3.2 3B

1B, 3B, 8B variants

OpenAI OSS

GPT-OSS 20B

Open-source flagship

Local reasoning models run via llama.cpp with GGUF quantization. All processing stays on your machine.

⚙️How It Fits Together

1. You speak

Hold your hotkey and talk naturally into any app

2. Model transcribes

Whisper, Parakeet, or a cloud provider converts speech to text

3. AI cleans up

Optional AI processing fixes grammar, removes filler, and formats your text

Try every model for free

Download OpenWhispr and choose the models that work best for you. Local models are completely free. Cloud providers use your own API keys.