Changelog

Every release, improvement, and bug fix — pulled directly from our GitHub releases.

v1.7.5July 11, 2026

1.7.5

# OpenWhispr 1.7.5 New models and providers, multiple hotkeys per action, and around a dozen fixes across meetings, notes, transcription, and every platform.

New

Latest cloud models: OpenAI GPT-5.6 (Sol/Terra/Luna) and Anthropic Claude Fable 5 and Sonnet 5 — plus Claude Fable 5 for AWS Bedrock.

OpenRouter is now a built-in provider, with a searchable, grouped picker for its 300+ models.

Corti can run AI cleanup and reasoning, not just transcription. For healthcare use, that text stays on Corti (EU) or the HIPAA-compliant OpenWhispr Cloud — never a third-party model.

Tinfoil confidential transcription now covers uploaded audio, not just live dictation.

Multiple hotkeys per action — bind more than one key to dictation, agent, voice agent, or meeting.

Enterprise / AWS Bedrock: region-aware model IDs, a live model catalog, and Agent Mode (tool-calling) support.

Fixed

Meetings: remote speakers no longer collapse into one; long calls reconnect past the 60-minute limit without losing audio; fewer words dropped during cross-talk.

Notes: enhancement no longer reuses another note's transcript or renames notes you've already titled.

Cloud transcription: long recordings retry failed chunks instead of failing or leaving gaps.

macOS: dictation into Claude Desktop and claude.ai no longer breaks after the first paste.

Windows: fixed a false "No audio detected", the firewall prompt for local Parakeet, and local transcription failing on some builds.

Linux: fixed Wayland global-hotkey crashes on Node 24.

Plus several smaller fixes — full list in the changelog.

Thanks

Thanks to @xAlcahest, @danielmccannsayles, @IdrisGit, and @Alexey-Sachko for their work on this release — and a welcome to @Alexey-Sachko on their first contribution!

View on GitHub →

v1.7.4July 7, 2026

1.7.4

# OpenWhispr 1.7.4 A feature-and-reliability release: two new privacy-first ways to transcribe and reason, native Windows meeting audio, sync that keeps working in the background, and a broad sweep of fixes across dictation, clipboard, updates, and Linux — plus security hardening. Nothing here changes your existing setup; it all just works better.

Highlights

Tinfoil — confidential cloud (BYOK). A new provider that runs both transcription *and* AI cleanup/agent inside attested secure enclaves, so your audio and text stay private even when processed in the cloud. (#875, #944)

Azure AI Foundry / Azure OpenAI speech-to-text. Point the custom transcription provider at your own Azure deployment — OpenWhispr now speaks Azure's endpoint format and auth. (#997)

Native Windows meeting audio. Meetings now capture sound from every app on every output device instead of only your default speakers, via a native Windows helper. Older Windows automatically falls back to the previous method. (#960)

Enterprise SSO at sign-in, right from onboarding. (#1034)

Sync

Runs in the background, even from the tray — dictations upload and other devices' changes pull down on window focus, on network reconnect, and every 5 minutes, even if you never open the main window. (#1070, #1072)

Snippets now sync across your devices, like your dictionary already does. (#1037)

No more duplicate folders when you sign in on a new device. (#1086)

Dictation & notes

Cleaner cleanup — your transcript gets tidied instead of occasionally getting "answered," and dictations only go to your voice agent when you actually address it by name. (#1073)

Turkish snippets — triggers containing İ / ı now match correctly. (#1050)

API keys no longer vanish when you type one and click away — a papercut that mostly hit Corti/BYOK onboarding. (#1039)

Clipboard, audio & media

Rich clipboard formats survive auto-paste, clipboard restore is faster, and pasting reliably targets the right app on macOS. (#1020, #1038, #1000)

Mic falls back to your default device when a saved one goes stale, and empty recordings no longer crash transcription. (#978, #891)

Paused music/video resumes the instant you stop recording (not after transcription) and won't get stuck paused on a quick tap. (#1030, #1061)

Platform & reliability

Updates: fixed a crash on the first "Install & Restart" click after an update — macOS installs no longer stall. (#1012)

Local transcription: GPU model reloads after sleep, multi-GPU machines pick the right card, the voice-activity model now ships in packaged builds, realtime streaming is hardened, and llama.cpp was updated. (#1032, #1018, #1000, #1044, #995)

Hotkeys: modifier-only hotkeys (dictation, voice agent, agent, meeting) now work simultaneously on Windows/Linux. (#1001)

Linux: installs on openSUSE and launches on hardened kernels that restrict user namespaces. (#1014, #1042)

Self-hosted: transcription endpoints can now choose their own model. (#1043)

Quieter startup — suppressed benign Qdrant warnings. (#1083)

Security

Cleared all critical/high dependency alerts and restored lockfile integrity with a CI guard (SOC 2 secure-code hardening). (#1051, #1069)

Contributors

Thanks to everyone who shipped 1.7.4 — with a warm welcome to our first-time contributors (*): @xAlcahest · @Chadpiha · @IdrisGit · @danielmccannsayles* · @zachdotai* · @bikramjitk* · @SyntaxSawdust* · @UmutEmreOnder* · @greatcoat* · @chirag127*

View on GitHub →

windows-system-audio-helper-v1.0.0July 7, 2026

Windows System Audio Helper v1.0.0

Prebuilt Windows system audio helper binary for meeting transcription.

Captures system audio via WASAPI process loopback (exclude mode), hearing every application on every output device while excluding OpenWhispr's own audio. Requires Windows 10 2004 or later; older systems automatically fall back to Chromium display-media loopback.

View on GitHub →

v1.7.3June 24, 2026

1.7.3

✨ New

Snippets — say a shortcut like "cal link" and the full text gets pasted

Voice Agent hotkey — one key sends your speech straight to your AI assistant

Medical dictation via Corti for clinical-grade accuracy

Smarter setup that adapts to how *you'll* actually use OpenWhispr

Redesigned dictionary that now syncs across your devices

Audio upload gets its own settings and a cancel button

Discarded dictations are saved, so nothing gets lost

Notification controls to pick which alerts OpenWhispr can show

New AI models: Claude Opus 4.8, Gemini 3.5 Flash, Gemma 4, plus xAI transcription

View on GitHub →

v1.7.2May 20, 2026

1.7.2

# OpenWhispr 1.7.2 A small patch on top of 1.7.1 — zero unnecessary Keychain prompts on first launch, cloud transcription working again on Electron's net.fetch, and the Note Formatting selector now actually controls model routing.

Features

Note Formatting selector now routes Generate Notes. The Note Formatting tab's model / provider / mode selectors were a no-op end-to-end in 1.7.1 — every Generate Notes call silently fell through to the Cleanup model. Now wired through the same per-scope plumbing, with a Cleanup fallback so untouched-settings users keep their current behavior.

Wave Terminal paste. Wave Terminal is now in the Linux terminal allowlist, so auto-paste routes through Ctrl+Shift+V instead of the default Ctrl+V.

Bug Fixes

No more spurious macOS Keychain prompts on first launch. First-launch Keychain prompts drop from ~3 to 0. The secret-crypto backend no longer eagerly probes itself before any window appears — it defers Keychain access until you actually save your first secret. The safeStorage key backup is also only written when a new master key is generated, not on every launch.

Cloud transcription works again. The 1.7.0 migration from https.request to Electron's net.fetch carried over a manual Content-Length header, which Electron rejects as a forbidden Fetch header — failing with net::ERR_INVALID_ARGUMENT before any bytes hit the wire. Fix covers all five upload paths: cloud transcribe, chunked cloud transcribe, retry, file upload, and BYOK whisper-compatible.

Notes view state stability. Fixed a stale-ref issue where switching between notes could lose unsaved enhanced-content edits.

View on GitHub →

v1.7.1May 20, 2026

1.7.1

Features

macOS mouse-button hotkeys for dictation

Google Calendar: new "sync primary calendar only" toggle to skip shared ones

Make automatic note titling optional in Settings

Smarter local transcription that ignores silence and breathing, fully configurable for local models

Bug Fixes

Cloud meeting transcription restored after a recent OpenAI Realtime API change

Local Whisper transcription restored on Windows

Music auto-pause and resume works again on the latest macOS

Self-hosted models now use their own URLs and API keys instead of falling back to cleanup settings

Custom dictionary now reaches meeting note cleanup

Language preference preserved on retry, in meetings, and in the dictation preview

Custom prompts sync across windows instantly without restart

Transcription sync no longer crashes on bad cloud data

Idle local AI models release GPU memory after a timeout

Works correctly behind corporate proxies for model downloads, GPU detection, and calendar sync

Linux: Pop!OS COSMIC support, GNOME Wayland terminal auto-paste, Konsole paste fix, and the dictation overlay no longer steals focus on Sway/i3/Hyprland

View on GitHub →

v1.7.0May 4, 2026

1.7.0

Breaking for Mac users: please delete your old OpenWhispr first. 1.7.0 uses a new app ID — we had to migrate Apple Developer accounts, so you'll be asked to re-grant permissions on first launch. Notes, settings, API keys, and downloaded models carry over automatically.

Summary

More sign-in options — new: Sign in with Microsoft, Sign in with Apple on macOS. Auth runs on our own infra now (migrated to Better Auth); self-hosters can point at their own server.

Sessions in your OS keychain — survive crashes and Electron restarts. No more random sign-outs.

API keys encrypted at rest — all 12 BYOK + enterprise creds moved from .env to the OS keychain. Silent one-time migration on first launch.

Background meeting recording — navigate to other notes, open Settings, switch tabs; the audio pipeline keeps going. New floating pill shows live mic levels and clicks back to the recording note.

Per-note diarization preferences persist across stop/resume.

Cleaner mic capture with a new acoustic gate + better echo cancellation. Music pause/resume on Windows works again.

Per-scope LLM setup — pick different providers/models for cleanup, agent, formatting, and chat. You can now have text cleanup toggled ON but your dictation agent toggled OFF.

NVIDIA Parakeet parakeet-unified-en-0.6b — new English-only model, 5.91% avg WER, ~631 MB.

openwhispr CLI talks directly to the desktop app via a local HTTP bridge (127.0.0.1, bearer-token); falls back to cloud when the app is closed.

Proxy-aware fetches + OS CA trust — corporate TLS interception works.

Auto-learn corrections now work for Cyrillic, CJK, Arabic, Devanagari.

A bunch of other small bug fixes and performance improvements.

Detailed changelog

build(mac): migrate Apple signing to Gizmo Labs Inc. by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/639

fix(qdrant): set cwd to STORAGE_DIR so packaged app doesn't crash on startup by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/640

feat(sync): cross-device delete propagation by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/646

feat(meeting): interop cloud streaming providers + AEC/VAD improvements by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/656

Re-grant permissions modal for 1.6.11 upgraders by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/659

fix(chat): persist first user message when creating a new conversation by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/662

fix(tls): trust OS CA store for Node-side TLS; keep realtime WS on OpenAI only by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/657

Fix adding non-ASCII languages to dictionary by @wake0up0ne0 in https://github.com/OpenWhispr/openwhispr/pull/666

Fix dictionary correction tracking for TextPattern-based controls (alongside existing ValuePattern support) by @wake0up0ne0 in https://github.com/OpenWhispr/openwhispr/pull/665

Bundle ID migration: com.herotools.openwispr → com.gizmolabs.openwhispr by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/669

feat(cli): local HTTP bridge for unified CLI by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/676

fix(notes): update folder selection when moving notes between folders by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/678

refactor: split dictation cleanup from dictation agent + per-scope LLM config by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/677

refactor(auth): switch desktop to Better Auth + add Microsoft sign-in by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/686

fix(network): proxy-aware fetches + actionable connectivity errors by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/687

chore(release): 1.7.0 confidence cleanup by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/689

ci(release): rename VITE_NEON_AUTH_URL → VITE_AUTH_URL by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/690

fix(auth): initiate desktop OAuth from browser, not renderer by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/692

feat(auth): Sign in with Apple on macOS by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/691

fix(onnx): cap embedding segments and isolate inference in utility process by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/693

fix(sidecars): reap children on quit and on next launch (#683) by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/694

docs(changelog): plain-English 1.7.0 + lockfile refresh by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/695

fix(llama): raise Vulkan startup timeout, stop server before re-download by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/698

fix(media): fall back to media key when GSMTC fails on Windows by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/697

docs(readme): acknowledge Hugging Face as model hub by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/703

feat(transcribe): generate clientTranscriptionId for cloud sync dedup by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/702

feat(security): encrypt API keys at rest via safeStorage by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/629

fix(media): use AsTask bridge for WinRT async calls in GSMTC scripts by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/706

fix(media): resume playback on no-audio-detected event by @kdenney in https://github.com/OpenWhispr/openwhispr/pull/701

feat(auth): switch desktop to Authorization: Bearer + safeStorage by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/704

fix(ci): compile macos-media-remote binary in release and build workflows by @kdenney in https://github.com/OpenWhispr/openwhispr/pull/700

feat(reasoning): self-hosted OpenAI-compatible parity + thinking-mode toggle by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/708

feat(parakeet): add parakeet-unified-en-0.6b model by @milanleonard in https://github.com/OpenWhispr/openwhispr/pull/713

feat(meeting): background recording with floating pill and responsive layout by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/709

fix(runtime): unblock model downloads, surface ONNX worker errors, dedupe hotkey re-register by @gabrielste1n in https://github.com/OpenWhispr/openwhispr/pull/716

fix(linux): resolve symlink in launcher wrapper by @xAlcahest in https://github.com/OpenWhispr/openwhispr/pull/717

New Contributors

@wake0up0ne0 made their first contribution in https://github.com/OpenWhispr/openwhispr/pull/666

@kdenney made their first contribution in https://github.com/OpenWhispr/openwhispr/pull/701

@milanleonard made their first contribution in https://github.com/OpenWhispr/openwhispr/pull/713

View on GitHub →

v1.6.10April 20, 2026

1.6.10

Features

Speaker diarization controls — per-meeting toggle, "N others in call" stepper, auto-label on 1-on-1s, fallback to "You/Others" when labeling is off

Integrations hub — new top-level view with MCP connectivity for Claude, ChatGPT, and Cursor; API keys moved here

Enterprise connections - connect to your organisations instance via bedrock

Bug Fixes & Improvements

Settings reorganization — "AI Models" split into Speech-to-Text and Language Models with clearer sub-tabs

Agent hotkey moved under Hotkeys

Meeting transcription tuned to reduce GPT-4o hallucinations during silence

Model cache opens at the correct folder so downloaded models are actually visible

Sync — cleared transcripts and deleted folders no longer reappear after restart

Echo leak detector now runs pre-AEC so it can actually see system audio bleed

View on GitHub →

v1.6.9April 16, 2026

1.6.9

New features

Transcript Export — Save transcripts to disk as TXT, SRT, or JSON directly from the meeting view. Cloud Sync — Notes, folders, conversations, and transcriptions now sync bidirectionally across devices. API Keys — Create and manage API keys from Settings to integrate OpenWhispr with your workflows programmatically. Linux Push-to-Talk — Native push-to-talk via evdev with guided permission setup. Auto-Paste Toggle — New setting to disable automatic pasting after dictation. Agent Folders — The agent can now list, create, and match folders semantically when organizing notes.

Improvements

Upgraded to Electron 41 and Node 24

Redesigned provider tabs as compact pill buttons

Local model descriptions replaced with clickable spec links

llama.cpp detection now probes /v1/models and prefers /chat/completions

Fixes

Windows hotkey no longer false-triggers after Win+L lock/unlock

Fixed stuck recording when push-to-talk key-up is missed on Windows

Relaxed speech gate thresholds with no-audio toast for failed transcriptions

Retry button now shown on failed transcriptions

Notes sync when updated externally

Tuned speaker diarization to prevent excessive speaker creation

TLS/certificate errors now surfaced in model download UI

Floating icon position persists across restarts

Fixed notification window sizing and dismiss circle visibility

View on GitHub →

v1.6.8April 15, 2026

1.6.8

New features

Meeting recording with on-device speaker diarization — capture mic and system audio together, and every segment is automatically labeled with who said it. Diarization runs 100% locally: voices never leave your device. System audio capture now works on Windows and Linux (Chromium loopback and the native audio portal), not just macOS.

Local live transcription preview — see your words appear in real time as you speak, streamed from a local Whisper or Parakeet model before you release the hotkey. Fully on-device, no cloud round trip.

Self-hosted support — first-class option to point OpenWhispr at your own LAN server for both transcription and agent inference. One click to swap between OpenAI, Anthropic, Gemini, a local GGUF, or your self-hosted endpoint.

Speaker to contact linking — voice profiles bind to calendar attendees, so recurring participants get named automatically after the first tag.

Improved echo cancellation — WebRTC AEC now runs as a native sidecar, paired with a custom cross-correlation detector and text-level dedupe to cleanly handle external speakers, system-audio capture, and low-echo-quality mics.

Gemma 4 models — Gemma 4 31B and Gemma 4 26B MoE added to the local model registry.

Smart transcription error handling — when a transcription fails, inline CTAs let you retry, switch provider, or open logs.

Linux

Hyprland global hotkeys via hyprctl, plus correct active-window detection for terminal paste.

GNOME shortcut conflict detection, active-key display, and proper fallback when the requested shortcut is taken.

Fixes

Windows: installer DPI scaling corrected, GPU compositing re-enabled.

Local Whisper: stricter speech gate cuts hallucinated output on silent audio.

Diarization: session-scoped results, disk spooling for long meetings, reliable spinner, correct speaker label propagation.

Notes: transcripts scope to their originating note, view auto-switches when a meeting recording starts.

UI: fixed 420px window width that grows only with text, consistent across phases; preview overlay redesigned.

Settings: inference mode auto-switches when you select a new provider.

i18n: speaker labels, cloud settings, integrations, and calendar strings tightened across locales.

Stability: 22 React Compiler lint issues resolved, tighter effect dependencies, cleaner hook usage across 15 files.

View on GitHub →

meeting-aec-helper-v1.0.0April 14, 2026

Meeting AEC Helper v1.0.0

Prebuilt WebRTC Audio Processing sidecar for meeting mic echo cancellation.

Binaries built from native/meeting-aec-helper/ against pinned webrtc-apm + abseil-cpp commits.

View on GitHub →

v1.6.7April 3, 2026

1.6.7

AI Chat & Semantic Search — Ask questions about your notes with the new embedded chat panel. Conversations sync to the cloud, and a local semantic search engine finds notes by meaning, not just keywords. Meeting Improvements — Calendar attendees automatically appear on meeting notes, auto-detection works for browser meetings, and echo cancellation cleans up mic input. Save Notes as Files — Export your notes to local Markdown files that mirror your folder structure. Responsive Settings — Settings dialog adapts gracefully to smaller windows. Bug Fixes — Resolved issues with meeting transcription, folder switching, clipboard pasting, and participant saving. Improved Linux Wayland support and Windows build signing. Huge thanks to @xAlcahest @DamianPala and others for their contributions to platform stability and improvements!

View on GitHub →

v1.6.6March 23, 2026

1.6.6

Rich Text Notes

Notes now use a rich text editor with Obsidian-style live preview — Markdown syntax hides as you type, giving you a clean writing experience

Meeting Transcription Upgrades

Dual-channel transcription — mic and system audio are captured separately with speaker-labeled chat bubbles so you can see who said what

Timestamped segments — meeting transcripts now include timestamps in chronological order

Smarter meeting notes — AI-generated summaries are now speaker-aware for more accurate meeting recaps

New Local Models

Mistral Nemo 12B and Gemma 3 12B are now available for on-device AI processing

macOS Audio Improvements

Native system audio capture via CoreAudio Tap — no more "screen recording" permission prompt on macOS 14.2+

macOS 15+ now shows the correct system audio consent dialog instead of the legacy screen recording one

Linux

KDE Wayland support — native global shortcuts now work on KDE Plasma via D-Bus, joining GNOME and Hyprland support

Fixed KDE Plasma overlay window and hotkey behavior

Fixed clipboard paste reliability on KDE Wayland

Simplified Permissions

Permission prompts consolidated to a single "Grant Access" button

Permissions are now re-checked against the OS each time you open settings, so the UI always reflects your actual state

Bug Fixes

Fixed Gemini agent streaming routing to the wrong endpoint

Fixed Windows mic volume being permanently altered during dictation

Fixed mono transcription failure on Linux

Fixed Bluetooth audio issues during meetings

Fixed meeting detection notifications firing when you're already in a meeting

Fixed held modifier keys not releasing before paste on Windows

Fixed paused media being unpaused during dictation

Fixed Google OAuth users skipping onboarding

Fixed AI cleanup prompt refusing to transcribe command-like speech

Fixed agent hotkey conflicts not showing a warning

View on GitHub →

v1.6.5March 17, 2026

1.6.5

Meeting detection improvements

View on GitHub →

v1.6.4March 16, 2026

1.6.4

Meeting Mode

Meeting mode hotkey — Assign a dedicated hotkey to instantly snap the panel into meeting mode and start a new meeting note

Smarter meeting detection — Reduced false positives; background apps like FaceTime no longer trigger notifications unless mic activity is detected

Meeting detection toggle — Disable meeting detection entirely from Settings

Auto-Update

Update notification — A slide-in notification appears when a new version is available, with an "Update Now" button

Multi-Monitor

Cursor-aware positioning — The floating icon now appears on whichever monitor your cursor is on

New Models

GPT-5.4 — Added as the new flagship OpenAI model

Qwen 3.5 — New local models added; removed sub-1B models

Bug Fixes

Fixed paused media resuming unexpectedly on Windows when no active playback sessions exist

Fixed Windows hotkey listener state corruption when switching hotkeys

Fixed silence detection rejecting valid speech (lowered threshold)

Fixed Windows paste not working in Windows Terminal (scan codes now included)

Fixed API keys saved in Settings being overridden by shell environment variables

Fixed local LLM models being deleted during Windows app updates

Fixed hotkey tooltip display for multi-key combos on macOS

Fixed RPM install conflicts with other Electron apps on Linux

Added Tailscale VPN (CGNAT range) support for self-hosted setups

Other

Panel start position now persists across app restarts

Cross-window settings sync (hotkey changes apply instantly)

Account deletion flow with cloud data cleanup

Agent mode window renamed to "Agent Chat"

View on GitHub →

v1.6.3March 14, 2026

1.6.3

Clearer Permissions

"Screen Recording" → "System Audio" — all permission prompts now clearly state we capture other participants' audio, not your screen

Electron 39 — eliminates the purple "screen recording" indicator, the "Your screen is being observed" lock screen message, and the misleading permission prompt on macOS 14.2+

Better Soft Voice Recognition

Auto Gain Control now enabled for dictation, automatically boosting quiet speech

Lower VAD sensitivity so soft-spoken audio is no longer missed

Less clipping at the start of speech — increased padding so quiet beginnings aren't cut off

Linux

Hyprland Wayland support — native global shortcuts using hyprctl keybindings + D-Bus

Bug Fixes

Fixed wl-copy failing silently on Wayland due to a too-short timeout

Fixed media staying paused after recording silence with "Pause media on dictation" enabled

View on GitHub →

v1.6.2March 11, 2026

1.6.2

System Audio Capture

Record system audio alongside your microphone in Notes — capture meeting audio, lectures, and more. Automatically enabled when screen recording permission is granted (macOS).

Smarter Meeting Detection

Meeting detection now uses native OS events instead of polling, cutting background CPU usage to near-zero.

Bug Fixes

Fixed hotkey reliability on Windows 11 (modifier-only shortcuts)

Fixed macOS Globe key failing silently on fresh installs — now prompts for accessibility permission

Fixed initial audio being dropped during realtime streaming

Fixed custom dictionary errors on large dictionaries

Fixed Parakeet model extraction on Windows 10

View on GitHub →

v1.6.1March 9, 2026

1.6.1

Even faster transcription on OpenWhispr Pro + Websockets for gpt4o

Bug fixes

View on GitHub →

v1.6.0March 7, 2026

1.6.0

Agent mode! Separate hotkey to spin up a chat, now you don't have to open a claude/chatgpt window. Check it out in settings.

Link your Google calendar and record meetings without a bot

Search your transcripts and notes

Brought back the ability to clear all transcripts

Choose voice recorder start position

Toggle to minimize the dashboard on startup

Toggle to pause music etc. when dictating

A bunch of other bug fixes and performance improvements

Massive thank you to @xAlcahest for all the major contributions to linux stability and other improvements!

View on GitHub →

v1.5.5March 2, 2026

1.5.5

Added Google Gemma local LLMs

Bug fixes and performance improvements

View on GitHub →

v1.5.4February 26, 2026

1.5.4

Word edit detection + auto add to dictionary

View on GitHub →

windows-text-monitor-v1.0.0February 26, 2026

Windows Text Monitor v1.0.0

Prebuilt Windows text monitor binary for auto-learn correction monitoring.

This binary uses Windows UI Automation to detect text field changes after paste.

View on GitHub →

v1.5.3February 25, 2026

1.5.3

Bug fixes and performance improvements

View on GitHub →

v1.5.2February 24, 2026

1.5.2

Bug fixes and performance improvements

View on GitHub →

v1.5.1February 24, 2026

1.5.1

CUDA support for whisper-server (local whisper models)

GPU support for llama.cpp (Local LLMs)

Other small bug fixes and performance improvements

View on GitHub →

v1.5.0February 23, 2026

1.5.0

Create and manage notes

Upload audio files and automatically transcribe them into notes

AI actions in notes

Referral scheme (give 1 month get 1 month)

Improved dark mode

View on GitHub →

v1.4.11February 13, 2026

1.4.11

Japanese as an interface language

Paste improvements on Windows

View on GitHub →

v1.4.10February 13, 2026

1.4.10

Better support for Simplified vs Traditional Chinese on OpenWhispr Pro

Reliability improvements for OpenWhispr Pro

View on GitHub →

v1.4.9February 13, 2026

1.4.9

Better auto-paste support for Linux users

Better language support for OpenWhispr Pro

View on GitHub →

windows-fast-paste-v1.0.0February 13, 2026

Windows Fast Paste v1.0.0

Prebuilt Windows fast-paste binary for clipboard paste operations.

Uses Win32 SendInput API with automatic terminal detection (Ctrl+V for normal apps, Ctrl+Shift+V for terminals).

View on GitHub →