Changelog

Every release, improvement, and bug fix — pulled directly from our GitHub releases.

v1.7.2May 20, 2026

1.7.2

# OpenWhispr 1.7.2 A small patch on top of 1.7.1 — zero unnecessary Keychain prompts on first launch, cloud transcription working again on Electron's net.fetch, and the Note Formatting selector now actually controls model routing.

Features

  • Note Formatting selector now routes Generate Notes. The Note Formatting tab's model / provider / mode selectors were a no-op end-to-end in 1.7.1 — every Generate Notes call silently fell through to the Cleanup model. Now wired through the same per-scope plumbing, with a Cleanup fallback so untouched-settings users keep their current behavior.
  • Wave Terminal paste. Wave Terminal is now in the Linux terminal allowlist, so auto-paste routes through Ctrl+Shift+V instead of the default Ctrl+V.

Bug Fixes

  • No more spurious macOS Keychain prompts on first launch. First-launch Keychain prompts drop from ~3 to 0. The secret-crypto backend no longer eagerly probes itself before any window appears — it defers Keychain access until you actually save your first secret. The safeStorage key backup is also only written when a new master key is generated, not on every launch.
  • Cloud transcription works again. The 1.7.0 migration from https.request to Electron's net.fetch carried over a manual Content-Length header, which Electron rejects as a forbidden Fetch header — failing with net::ERR_INVALID_ARGUMENT before any bytes hit the wire. Fix covers all five upload paths: cloud transcribe, chunked cloud transcribe, retry, file upload, and BYOK whisper-compatible.
  • Notes view state stability. Fixed a stale-ref issue where switching between notes could lose unsaved enhanced-content edits.
View on GitHub →
v1.7.1May 20, 2026

1.7.1

Features

  • macOS mouse-button hotkeys for dictation
  • Google Calendar: new "sync primary calendar only" toggle to skip shared ones
  • Make automatic note titling optional in Settings
  • Smarter local transcription that ignores silence and breathing, fully configurable for local models

Bug Fixes

  • Cloud meeting transcription restored after a recent OpenAI Realtime API change
  • Local Whisper transcription restored on Windows
  • Music auto-pause and resume works again on the latest macOS
  • Self-hosted models now use their own URLs and API keys instead of falling back to cleanup settings
  • Custom dictionary now reaches meeting note cleanup
  • Language preference preserved on retry, in meetings, and in the dictation preview
  • Custom prompts sync across windows instantly without restart
  • Transcription sync no longer crashes on bad cloud data
  • Idle local AI models release GPU memory after a timeout
  • Works correctly behind corporate proxies for model downloads, GPU detection, and calendar sync
  • Linux: Pop!OS COSMIC support, GNOME Wayland terminal auto-paste, Konsole paste fix, and the dictation overlay no longer steals focus on Sway/i3/Hyprland
View on GitHub →
v1.7.0May 4, 2026

1.7.0

Breaking for Mac users: please delete your old OpenWhispr first. 1.7.0 uses a new app ID — we had to migrate Apple Developer accounts, so you'll be asked to re-grant permissions on first launch. Notes, settings, API keys, and downloaded models carry over automatically.

Summary

  • More sign-in options — new: Sign in with Microsoft, Sign in with Apple on macOS. Auth runs on our own infra now (migrated to Better Auth); self-hosters can point at their own server.
  • Sessions in your OS keychain — survive crashes and Electron restarts. No more random sign-outs.
  • API keys encrypted at rest — all 12 BYOK + enterprise creds moved from .env to the OS keychain. Silent one-time migration on first launch.
  • Background meeting recording — navigate to other notes, open Settings, switch tabs; the audio pipeline keeps going. New floating pill shows live mic levels and clicks back to the recording note.
  • Per-note diarization preferences persist across stop/resume.
  • Cleaner mic capture with a new acoustic gate + better echo cancellation. Music pause/resume on Windows works again.
  • Per-scope LLM setup — pick different providers/models for cleanup, agent, formatting, and chat. You can now have text cleanup toggled ON but your dictation agent toggled OFF.
  • NVIDIA Parakeet parakeet-unified-en-0.6b — new English-only model, 5.91% avg WER, ~631 MB.
  • openwhispr CLI talks directly to the desktop app via a local HTTP bridge (127.0.0.1, bearer-token); falls back to cloud when the app is closed.
  • Proxy-aware fetches + OS CA trust — corporate TLS interception works.
  • Auto-learn corrections now work for Cyrillic, CJK, Arabic, Devanagari.
  • A bunch of other small bug fixes and performance improvements.

Detailed changelog

  • build(mac): migrate Apple signing to Gizmo Labs Inc. by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/639
  • fix(qdrant): set cwd to STORAGE_DIR so packaged app doesn't crash on startup by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/640
  • feat(sync): cross-device delete propagation by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/646
  • feat(meeting): interop cloud streaming providers + AEC/VAD improvements by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/656
  • Re-grant permissions modal for 1.6.11 upgraders by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/659
  • fix(chat): persist first user message when creating a new conversation by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/662
  • fix(tls): trust OS CA store for Node-side TLS; keep realtime WS on OpenAI only by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/657
  • Fix adding non-ASCII languages to dictionary by @wake0up0ne0 in https://github.com/OpenWhispr/openwhispr/pull/666
  • Fix dictionary correction tracking for TextPattern-based controls (alongside existing ValuePattern support) by @wake0up0ne0 in https://github.com/OpenWhispr/openwhispr/pull/665
  • Bundle ID migration: com.herotools.openwispr → com.gizmolabs.openwhispr by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/669
  • feat(cli): local HTTP bridge for unified CLI by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/676
  • fix(notes): update folder selection when moving notes between folders by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/678
  • refactor: split dictation cleanup from dictation agent + per-scope LLM config by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/677
  • refactor(auth): switch desktop to Better Auth + add Microsoft sign-in by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/686
  • fix(network): proxy-aware fetches + actionable connectivity errors by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/687
  • chore(release): 1.7.0 confidence cleanup by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/689
  • ci(release): rename VITE_NEON_AUTH_URL → VITE_AUTH_URL by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/690
  • fix(auth): initiate desktop OAuth from browser, not renderer by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/692
  • feat(auth): Sign in with Apple on macOS by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/691
  • fix(onnx): cap embedding segments and isolate inference in utility process by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/693
  • fix(sidecars): reap children on quit and on next launch (#683) by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/694
  • docs(changelog): plain-English 1.7.0 + lockfile refresh by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/695
  • fix(llama): raise Vulkan startup timeout, stop server before re-download by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/698
  • fix(media): fall back to media key when GSMTC fails on Windows by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/697
  • docs(readme): acknowledge Hugging Face as model hub by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/703
  • feat(transcribe): generate clientTranscriptionId for cloud sync dedup by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/702
  • feat(security): encrypt API keys at rest via safeStorage by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/629
  • fix(media): use AsTask bridge for WinRT async calls in GSMTC scripts by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/706
  • fix(media): resume playback on no-audio-detected event by @kdenney in https://github.com/OpenWhispr/openwhispr/pull/701
  • feat(auth): switch desktop to Authorization: Bearer + safeStorage by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/704
  • fix(ci): compile macos-media-remote binary in release and build workflows by @kdenney in https://github.com/OpenWhispr/openwhispr/pull/700
  • feat(reasoning): self-hosted OpenAI-compatible parity + thinking-mode toggle by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/708
  • feat(parakeet): add parakeet-unified-en-0.6b model by @milanleonard in https://github.com/OpenWhispr/openwhispr/pull/713
  • feat(meeting): background recording with floating pill and responsive layout by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/709
  • fix(runtime): unblock model downloads, surface ONNX worker errors, dedupe hotkey re-register by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/716
  • fix(linux): resolve symlink in launcher wrapper by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/717

New Contributors

  • @wake0up0ne0 made their first contribution in https://github.com/OpenWhispr/openwhispr/pull/666
  • @kdenney made their first contribution in https://github.com/OpenWhispr/openwhispr/pull/701
  • @milanleonard made their first contribution in https://github.com/OpenWhispr/openwhispr/pull/713
View on GitHub →
v1.6.10April 20, 2026

1.6.10

Features

  • Speaker diarization controls — per-meeting toggle, "N others in call" stepper, auto-label on 1-on-1s, fallback to "You/Others" when labeling is off
  • Integrations hub — new top-level view with MCP connectivity for Claude, ChatGPT, and Cursor; API keys moved here
  • Enterprise connections - connect to your organisations instance via bedrock

Bug Fixes & Improvements

  • Settings reorganization — "AI Models" split into Speech-to-Text and Language Models with clearer sub-tabs
  • Agent hotkey moved under Hotkeys
  • Meeting transcription tuned to reduce GPT-4o hallucinations during silence
  • Model cache opens at the correct folder so downloaded models are actually visible
  • Sync — cleared transcripts and deleted folders no longer reappear after restart
  • Echo leak detector now runs pre-AEC so it can actually see system audio bleed
View on GitHub →
v1.6.9April 16, 2026

1.6.9

New features

Transcript Export — Save transcripts to disk as TXT, SRT, or JSON directly from the meeting view. Cloud Sync — Notes, folders, conversations, and transcriptions now sync bidirectionally across devices. API Keys — Create and manage API keys from Settings to integrate OpenWhispr with your workflows programmatically. Linux Push-to-Talk — Native push-to-talk via evdev with guided permission setup. Auto-Paste Toggle — New setting to disable automatic pasting after dictation. Agent Folders — The agent can now list, create, and match folders semantically when organizing notes.

Improvements

  • Upgraded to Electron 41 and Node 24
  • Redesigned provider tabs as compact pill buttons
  • Local model descriptions replaced with clickable spec links
  • llama.cpp detection now probes /v1/models and prefers /chat/completions

Fixes

  • Windows hotkey no longer false-triggers after Win+L lock/unlock
  • Fixed stuck recording when push-to-talk key-up is missed on Windows
  • Relaxed speech gate thresholds with no-audio toast for failed transcriptions
  • Retry button now shown on failed transcriptions
  • Notes sync when updated externally
  • Tuned speaker diarization to prevent excessive speaker creation
  • TLS/certificate errors now surfaced in model download UI
  • Floating icon position persists across restarts
  • Fixed notification window sizing and dismiss circle visibility
View on GitHub →
v1.6.8April 15, 2026

1.6.8

New features

  • Meeting recording with on-device speaker diarization — capture mic and system audio together, and every segment is automatically labeled with who said it. Diarization runs 100% locally: voices never leave your device. System audio capture now works on Windows and Linux (Chromium loopback and the native audio portal), not just macOS.
  • Local live transcription preview — see your words appear in real time as you speak, streamed from a local Whisper or Parakeet model before you release the hotkey. Fully on-device, no cloud round trip.
  • Self-hosted support — first-class option to point OpenWhispr at your own LAN server for both transcription and agent inference. One click to swap between OpenAI, Anthropic, Gemini, a local GGUF, or your self-hosted endpoint.
  • Speaker to contact linking — voice profiles bind to calendar attendees, so recurring participants get named automatically after the first tag.
  • Improved echo cancellation — WebRTC AEC now runs as a native sidecar, paired with a custom cross-correlation detector and text-level dedupe to cleanly handle external speakers, system-audio capture, and low-echo-quality mics.
  • Gemma 4 models — Gemma 4 31B and Gemma 4 26B MoE added to the local model registry.
  • Smart transcription error handling — when a transcription fails, inline CTAs let you retry, switch provider, or open logs.

Linux

  • Hyprland global hotkeys via hyprctl, plus correct active-window detection for terminal paste.
  • GNOME shortcut conflict detection, active-key display, and proper fallback when the requested shortcut is taken.

Fixes

  • Windows: installer DPI scaling corrected, GPU compositing re-enabled.
  • Local Whisper: stricter speech gate cuts hallucinated output on silent audio.
  • Diarization: session-scoped results, disk spooling for long meetings, reliable spinner, correct speaker label propagation.
  • Notes: transcripts scope to their originating note, view auto-switches when a meeting recording starts.
  • UI: fixed 420px window width that grows only with text, consistent across phases; preview overlay redesigned.
  • Settings: inference mode auto-switches when you select a new provider.
  • i18n: speaker labels, cloud settings, integrations, and calendar strings tightened across locales.
  • Stability: 22 React Compiler lint issues resolved, tighter effect dependencies, cleaner hook usage across 15 files.
View on GitHub →
meeting-aec-helper-v1.0.0April 14, 2026

Meeting AEC Helper v1.0.0

Prebuilt WebRTC Audio Processing sidecar for meeting mic echo cancellation.
Binaries built from native/meeting-aec-helper/ against pinned webrtc-apm + abseil-cpp commits.
View on GitHub →
v1.6.7April 3, 2026

1.6.7

AI Chat & Semantic Search — Ask questions about your notes with the new embedded chat panel. Conversations sync to the cloud, and a local semantic search engine finds notes by meaning, not just keywords. Meeting Improvements — Calendar attendees automatically appear on meeting notes, auto-detection works for browser meetings, and echo cancellation cleans up mic input. Save Notes as Files — Export your notes to local Markdown files that mirror your folder structure. Responsive Settings — Settings dialog adapts gracefully to smaller windows. Bug Fixes — Resolved issues with meeting transcription, folder switching, clipboard pasting, and participant saving. Improved Linux Wayland support and Windows build signing. Huge thanks to @xAlcahest @DamianPala and others for their contributions to platform stability and improvements!
View on GitHub →
v1.6.6March 23, 2026

1.6.6

Rich Text Notes

  • Notes now use a rich text editor with Obsidian-style live preview — Markdown syntax hides as you type, giving you a clean writing experience

Meeting Transcription Upgrades

  • Dual-channel transcription — mic and system audio are captured separately with speaker-labeled chat bubbles so you can see who said what
  • Timestamped segments — meeting transcripts now include timestamps in chronological order
  • Smarter meeting notes — AI-generated summaries are now speaker-aware for more accurate meeting recaps

New Local Models

  • Mistral Nemo 12B and Gemma 3 12B are now available for on-device AI processing

macOS Audio Improvements

  • Native system audio capture via CoreAudio Tap — no more "screen recording" permission prompt on macOS 14.2+
  • macOS 15+ now shows the correct system audio consent dialog instead of the legacy screen recording one

Linux

  • KDE Wayland support — native global shortcuts now work on KDE Plasma via D-Bus, joining GNOME and Hyprland support
  • Fixed KDE Plasma overlay window and hotkey behavior
  • Fixed clipboard paste reliability on KDE Wayland

Simplified Permissions

  • Permission prompts consolidated to a single "Grant Access" button
  • Permissions are now re-checked against the OS each time you open settings, so the UI always reflects your actual state

Bug Fixes

  • Fixed Gemini agent streaming routing to the wrong endpoint
  • Fixed Windows mic volume being permanently altered during dictation
  • Fixed mono transcription failure on Linux
  • Fixed Bluetooth audio issues during meetings
  • Fixed meeting detection notifications firing when you're already in a meeting
  • Fixed held modifier keys not releasing before paste on Windows
  • Fixed paused media being unpaused during dictation
  • Fixed Google OAuth users skipping onboarding
  • Fixed AI cleanup prompt refusing to transcribe command-like speech
  • Fixed agent hotkey conflicts not showing a warning
View on GitHub →
v1.6.4March 16, 2026

1.6.4

Meeting Mode

  • Meeting mode hotkey — Assign a dedicated hotkey to instantly snap the panel into meeting mode and start a new meeting note
  • Smarter meeting detection — Reduced false positives; background apps like FaceTime no longer trigger notifications unless mic activity is detected
  • Meeting detection toggle — Disable meeting detection entirely from Settings

Auto-Update

  • Update notification — A slide-in notification appears when a new version is available, with an "Update Now" button

Multi-Monitor

  • Cursor-aware positioning — The floating icon now appears on whichever monitor your cursor is on

New Models

  • GPT-5.4 — Added as the new flagship OpenAI model
  • Qwen 3.5 — New local models added; removed sub-1B models

Bug Fixes

  • Fixed paused media resuming unexpectedly on Windows when no active playback sessions exist
  • Fixed Windows hotkey listener state corruption when switching hotkeys
  • Fixed silence detection rejecting valid speech (lowered threshold)
  • Fixed Windows paste not working in Windows Terminal (scan codes now included)
  • Fixed API keys saved in Settings being overridden by shell environment variables
  • Fixed local LLM models being deleted during Windows app updates
  • Fixed hotkey tooltip display for multi-key combos on macOS
  • Fixed RPM install conflicts with other Electron apps on Linux
  • Added Tailscale VPN (CGNAT range) support for self-hosted setups

Other

  • Panel start position now persists across app restarts
  • Cross-window settings sync (hotkey changes apply instantly)
  • Account deletion flow with cloud data cleanup
  • Agent mode window renamed to "Agent Chat"
View on GitHub →
v1.6.3March 14, 2026

1.6.3

Clearer Permissions

  • "Screen Recording" → "System Audio" — all permission prompts now clearly state we capture other participants' audio, not your screen
  • Electron 39 — eliminates the purple "screen recording" indicator, the "Your screen is being observed" lock screen message, and the misleading permission prompt on macOS 14.2+

Better Soft Voice Recognition

  • Auto Gain Control now enabled for dictation, automatically boosting quiet speech
  • Lower VAD sensitivity so soft-spoken audio is no longer missed
  • Less clipping at the start of speech — increased padding so quiet beginnings aren't cut off

Linux

  • Hyprland Wayland support — native global shortcuts using hyprctl keybindings + D-Bus

Bug Fixes

  • Fixed wl-copy failing silently on Wayland due to a too-short timeout
  • Fixed media staying paused after recording silence with "Pause media on dictation" enabled
View on GitHub →
v1.6.2March 11, 2026

1.6.2

System Audio Capture

Record system audio alongside your microphone in Notes — capture meeting audio, lectures, and more. Automatically enabled when screen recording permission is granted (macOS).

Smarter Meeting Detection

Meeting detection now uses native OS events instead of polling, cutting background CPU usage to near-zero.

Bug Fixes

  • Fixed hotkey reliability on Windows 11 (modifier-only shortcuts)
  • Fixed macOS Globe key failing silently on fresh installs — now prompts for accessibility permission
  • Fixed initial audio being dropped during realtime streaming
  • Fixed custom dictionary errors on large dictionaries
  • Fixed Parakeet model extraction on Windows 10
View on GitHub →
v1.6.1March 9, 2026

1.6.1

  • Even faster transcription on OpenWhispr Pro + Websockets for gpt4o
  • Bug fixes
View on GitHub →
v1.6.0March 7, 2026

1.6.0

  • Agent mode! Separate hotkey to spin up a chat, now you don't have to open a claude/chatgpt window. Check it out in settings.
  • Link your Google calendar and record meetings without a bot
  • Search your transcripts and notes
  • Brought back the ability to clear all transcripts
  • Choose voice recorder start position
  • Toggle to minimize the dashboard on startup
  • Toggle to pause music etc. when dictating
  • A bunch of other bug fixes and performance improvements
Massive thank you to @xAlcahest for all the major contributions to linux stability and other improvements!
View on GitHub →
v1.5.5March 2, 2026

1.5.5

  • Added Google Gemma local LLMs
  • Bug fixes and performance improvements
View on GitHub →
v1.5.4February 26, 2026

1.5.4

  • Word edit detection + auto add to dictionary
View on GitHub →
windows-text-monitor-v1.0.0February 26, 2026

Windows Text Monitor v1.0.0

Prebuilt Windows text monitor binary for auto-learn correction monitoring.
This binary uses Windows UI Automation to detect text field changes after paste.
View on GitHub →
v1.5.1February 24, 2026

1.5.1

  • CUDA support for whisper-server (local whisper models)
  • GPU support for llama.cpp (Local LLMs)
  • Other small bug fixes and performance improvements
View on GitHub →
v1.5.0February 23, 2026

1.5.0

  • Create and manage notes
  • Upload audio files and automatically transcribe them into notes
  • AI actions in notes
  • Referral scheme (give 1 month get 1 month)
  • Improved dark mode
View on GitHub →
v1.4.11February 13, 2026

1.4.11

  • Japanese as an interface language
  • Paste improvements on Windows
View on GitHub →
v1.4.10February 13, 2026

1.4.10

  • Better support for Simplified vs Traditional Chinese on OpenWhispr Pro
  • Reliability improvements for OpenWhispr Pro
View on GitHub →
v1.4.9February 13, 2026

1.4.9

  • Better auto-paste support for Linux users
  • Better language support for OpenWhispr Pro
View on GitHub →
windows-fast-paste-v1.0.0February 13, 2026

Windows Fast Paste v1.0.0

Prebuilt Windows fast-paste binary for clipboard paste operations.
Uses Win32 SendInput API with automatic terminal detection (Ctrl+V for normal apps, Ctrl+Shift+V for terminals).
View on GitHub →
v1.4.8February 12, 2026

1.4.8

  • Internationlization ( EN, PT, IT, ZT, DE, ES, RU, ZH)
  • Bug fixes & performance improvements
View on GitHub →
v1.4.5February 10, 2026

1.4.5

  • Faster startup — Lazy-loaded heavy components and deferred non-critical init so you get to dictating quicker
  • Redesigned toasts — Dark HUD-style notifications that actually look like they belong here
  • New loading screen — Branded splash with a proper spinner instead of staring at nothing
  • Dictation sound effects — Optional start/stop audio cues (toggle in settings) with warm, soft tones
  • Auto-hide floating icon — The dictation icon can now get out of your way
  • Smoother scrolling — Thinner, floating scrollbar thumbs that fade in when needed
  • Dark mode fixes — Dropdowns and toasts now respect your theme properly
  • Auth-aware routing — If your session expires, you'll see a sign-in screen instead of a void
  • Globe key persistence — Your hotkey config actually sticks now
  • Discord support — Find us in the support menu
View on GitHub →