Changelog
Release notes and updates for TypeWhisper.
May 2026
May 14, 2026
TypeWhisper v0.8.1
TypeWhisper 0.8.1 is a focused Windows release that expands cloud provider coverage, improves plugin marketplace reliability, and fixes several dictation and local transcription issues found after 0.8.0.
New Features
- Added OpenRouter model catalog support so compatible models can be discovered and selected more easily.
- Added OpenRouter transcription support for cloud dictation workflows.
- Added the xAI Grok provider plugin with streaming transcription, text generation, and TTS support.
- Added branded plugin logos for major bundled providers.
Improvements
- Hardened plugin registry category handling so marketplace entries stay compatible as categories expand.
- Made the Live Transcript window movable and improved its settings experience.
- Improved local GPU acceleration reload behavior for local transcription providers.
Fixes
- Fixed final decoding for cloud dictation responses.
- Fixed Live Transcript failure handling so provider errors are surfaced more reliably.
- Improved model acceleration layout coverage and related plugin system behavior.
May 14, 2026
Bug Fixes
- Clear blank sensitive header secrets
- Gate local API token enforcement
- Store sensitive headers in secrets
- Secure local API access
Other Changes
- Add WhisperKit Distil Large v3 Turbo model (#539)
- Recover failed dictation audio (#534)
- Add Parakeet live transcription session
- Fix echo handling localization labels (#529)
- Make plugin settings windows scrollable (#527)
- Fix Supertonic plugin release build
- Fix Supertonic synthesizer cache synchronization
- Add experimental Supertonic TTS plugin
- Clean up plugin registry release metadata
- Fix onboarding welcome layout and transcription helper compatibility
- Fix Parakeet onboarding completion
- Extend OpenAI compatible transcription timeout
- Handle multichannel input capture formats
May 13, 2026
TypeWhisper v0.8.0
TypeWhisper 0.8.0 is a major Windows release focused on faster local transcription, a broader plugin marketplace, richer speech and TTS provider support, and steadier everyday dictation behavior.
New Features
- Added GPU acceleration for local transcription so supported local models can run faster on capable Windows systems.
- Added multi-category plugin marketplace support, making discovery clearer as plugins expand across transcription, text generation, TTS, memory, and workflow integrations.
- Added the Supertonic TTS provider plugin and a dedicated TTS marketplace category.
- Added side-specific modifier hotkeys for workflows that need distinct left/right keyboard shortcuts.
- Added Russian localization for the Windows app.
- Added Windows indicator style variants for a more flexible recording status experience.
- Added licensed industry term packs for specialized vocabulary workflows.
- Expanded OpenAI and ChatGPT plugin capabilities.
Improvements
- Improved plugin marketplace behavior and polish, including category handling and uninstall state feedback.
- Improved whisper.cpp native runtime packaging so bundled local transcription dependencies are more reliable.
- Improved touch scrolling across ScrollViewer controls.
- Limited Supertonic TTS runtime packaging to keep bundled plugin output cleaner.
Fixes
- Fixed microphone reconnect handling.
- Reduced repeated short transcription phrases and improved dictation ellipsis cleanup.
- Fixed plaintext insertion for email clients.
- Fixed live preview font size updates.
- Fixed live transcript no-speech handling.
- Fixed dictionary usage count localization.
- Fixed marketplace plugin uninstall state handling.
May 13, 2026
Bug Fixes
- Clear blank sensitive header secrets
- Gate local API token enforcement
- Store sensitive headers in secrets
- Secure local API access
Other Changes
- Fix Supertonic plugin release build
- Fix Supertonic synthesizer cache synchronization
- Add experimental Supertonic TTS plugin
- Clean up plugin registry release metadata
- Fix onboarding welcome layout and transcription helper compatibility
- Fix Parakeet onboarding completion
- Extend OpenAI compatible transcription timeout
- Handle multichannel input capture formats
May 13, 2026
Other Changes
- Add TTS marketplace category
- Limit Supertonic TTS runtime packaging
- Add Supertonic TTS provider plugin (#119)
- Fix dictation ellipsis cleanup (#116)
- Reduce short repeated transcription phrases (#117)
- Fix plaintext insertion for email clients (#118)
- Fix live preview font size updates (#114)
- Fix dictionary usage count localization (#115)
- Fix live transcript no-speech handling (#113)
May 12, 2026
TypeWhisper 1.4.0-rc3 refreshes the 1.4 release-candidate line from current main (855a5823). This candidate focuses on lower-latency dictation starts, workflow engine overrides, audio diagnostics, model/plugin refreshes, licensed term packs, watch-folder file jobs, and memory extraction fixes since 1.4.0-rc2.
Highlights
- Added the OpenAI voice refresh for TypeWhisper 1.4 and refreshed the Qwen3 ASR plugin for 1.1.0.
- Reduced hotkey recording start latency and cold-start latency.
- Fixed exact modifier-combo hotkey matching.
- Added dictation workflow engine overrides.
- Removed legacy profile runtime rules.
- Added support diagnostics export for audio device failures.
- Allowed USB input selection without preflight capture.
- Added licensed industry term packs.
- Fixed Parakeet startup model selection status.
- Fixed memory extraction for normal dictations.
- Added watch folder file-job automation.
Testing Focus
- Verify hotkey start/cold-start latency and exact modifier combinations.
- Smoke-test dictation workflow engine overrides and normal dictation memory extraction.
- Export audio diagnostics and select USB input devices without preflight capture.
- Check OpenAI, Qwen3, and Parakeet plugin/model settings.
- Download and apply licensed industry term packs.
- Run watch folder file-job automation.
- Confirm the RC release publishes DMG/ZIP assets and updates Sparkle on the
release-candidatechannel.
May 12, 2026
Bug Fixes
- remove legacy profile runtime rules (#502)
- reduce hotkey cold-start latency (#497)
- reduce hotkey recording start latency
Other Changes
- Add watch folder file job automation (#514)
- Fix memory extraction for normal dictations (#512)
- Add SDK auth role status for OpenAI transcription (#511)
- Add audio diagnostics export (#510)
- Allow USB input selection without preflight capture (#508)
- Add dictation workflow engine overrides (#509)
- Add licensed industry term packs
- Fix Parakeet startup model selection status (#501)
- Fix scheduled app test CI (#500)
- Refresh Qwen3 ASR plugin for 1.1.0
- Clarify GPL and commercial licensing copy
- Fix exact modifier combo matching
- Add OpenAI voice refresh for TypeWhisper 1.4
May 11, 2026
TypeWhisper 1.3.3 is a stable hotfix release for the 1.3 line. It fixes selected USB audio interfaces still being marked as not compatible before preview or recording can start.
Bug Fixes
- Fixed USB microphones and audio interfaces, including RME Babyface-style devices, being blocked by selection-time compatibility checks.
- Kept non-Bluetooth input selection non-blocking so TypeWhisper can reach the input-only HAL preview and recording path added in
1.3.2. - Preserved Bluetooth route activation, stabilization, and aggregate-route validation behavior.
Testing Focus
- If a USB interface still showed
(Not compatible)in TypeWhisper1.3.2, test selecting it again in1.3.3. - After selection, use
Test Microphoneto confirm the device now reaches preview capture.
May 11, 2026
May 11, 2026
Other Changes
- Add Windows indicator style variants (#102)
- Add licensed industry term packs
May 11, 2026
TypeWhisper 1.3.2 is a stable hotfix release for the 1.3 line. It fixes selected USB audio interfaces being marked as not compatible when the selected input and current output device use different clocks or sample rates.
Bug Fixes
- Fixed selected USB microphones and audio interfaces that could show
(Not compatible)before preview or recording started. - Routed explicit non-Bluetooth input devices through an input-only CoreAudio HAL capture path, avoiding stale AVAudioEngine full-duplex formats when input and output clocks differ.
- Kept the existing AVAudioEngine route for system default and Bluetooth inputs, preserving the established Bluetooth stabilization and recovery behavior.
- Downmixed multi-channel input buffers to mono before the existing 16 kHz transcription conversion path.
Testing Focus
- If a USB interface such as an RME Babyface, Elgato Wave XLR, or similar device showed
(Not compatible)in TypeWhisper1.3.1, test selecting it again in1.3.2. - If the device still fails, also test with the interface set as the macOS default input and with input/output sample rates aligned at 48 kHz.
May 10, 2026
May 9, 2026
Bug Fixes
- reduce hotkey recording start latency
Other Changes
- Refresh Qwen3 ASR plugin for 1.1.0
- Clarify GPL and commercial licensing copy
- Fix exact modifier combo matching
- Add OpenAI voice refresh for TypeWhisper 1.4
May 9, 2026
Bug Fixes
- enable touch scroll on all ScrollViewer controls (#99)
May 8, 2026
Other Changes
May 7, 2026
TypeWhisper v0.7.2
TypeWhisper 0.7.2 is a stability and polish release for Windows. It improves dictation reliability, adds Japanese localization, makes the preview bubble easier to control, and hardens the newest workflow and paste behavior from the 0.7 line.
New Features
- Added Japanese localization for the Windows app.
- Added an auto-hide setting for the preview bubble.
- Added manual workflow controls and advanced workflow triggers on Windows.
Improvements
- Kept the preview bubble compact and scrolled so longer live transcription output stays easier to follow.
- Improved Parakeet transcription tail handling with optional tail padding, better final-result selection, and internal diagnostics for clipped endings.
- Added audio tail telemetry around recording stop so hard-to-reproduce tail clipping can be diagnosed more clearly.
Fixes
- Reduced repeated transcription phrases in final dictation output.
- Fixed paste handoff latency so inserted text is less likely to race with the target app.
- Preserved clean no-speech handling for short and quiet clips while improving final transcription padding.
May 7, 2026
TypeWhisper 1.4.0-rc2 refreshes the 1.4 release-candidate line on the current main branch. This candidate is based on e9808037 and focuses on workflow ordering, recorder reliability, plugin SDK polish, and integration fixes since 1.4.0-rc1.
Highlights
- Added drag-and-drop ordering for workflows, with matching priority following the saved workflow order.
- Kept workflow reordering disabled while search is filtering the list, so filtered subsets do not rewrite global priority by accident.
- Added combined workflow trigger support and spoken-punctuation compatibility coverage for older workflow behavior.
- Improved recorder reliability for FaceTime/system audio sample handling and selected USB microphones with mismatched output clocks.
- Fixed Gemma plugin cache recovery after failed downloads.
- Added structured speaker diarization support for AssemblyAI through the Plugin SDK path.
- Tightened the 1.4 community plugin registry automation and permissions.
- Replaced spoken-language flag icons with neutral language badges.
- Fixed fixed-display indicators in fullscreen Spaces and clarified the German recorder settings label.
Testing Focus
- Verify workflow drag reorder affects matching priority and remains disabled while filtering.
- Smoke-test combined trigger workflows and spoken-punctuation compatibility controls.
- Record with selected USB microphones and system audio or FaceTime paths.
- Retry a Gemma model download after failure and verify cache recovery.
- Check AssemblyAI diarization settings and structured speaker labels.
- Confirm the 1.4 community registry feed and Integrations tabs still resolve plugins correctly.
- Verify fixed-display indicators in fullscreen Spaces.
May 7, 2026
Bug Fixes
- disambiguate German recorder settings label
- support combined workflow triggers (#466)
Other Changes
- Fix fixed-display indicators in fullscreen spaces
- Add Swift CodeQL quality workflow
- Set AssemblyAI minimum host version for diarization release
- Add structured speaker diarization SDK path
- Fix selected USB mic capture with mismatched output clocks (#474)
- Fix system audio recorder sample handling
- Replace spoken language flags with neutral badges
- Limit community registry workflow permissions
- Add community plugin registry workflow
- Refactor first-party plugin sources into the SDK package (#464)
- Mirror v1 plugin releases to community registry (#462)
- Refactor RTF clipboard formatting (#463)
- Add xAI Grok plugin (#461)
May 6, 2026
Bug Fixes
- disambiguate German recorder settings label
- support combined workflow triggers (#466)
Other Changes
- Fix system audio recorder sample handling
- Replace spoken language flags with neutral badges
- Limit community registry workflow permissions
- Add community plugin registry workflow
- Refactor first-party plugin sources into the SDK package (#464)
- Mirror v1 plugin releases to community registry (#462)
- Refactor RTF clipboard formatting (#463)
- Add xAI Grok plugin (#461)
May 5, 2026
May 5, 2026
May 4, 2026
TypeWhisper 1.4.0-rc1 opens the 1.4 release-candidate line with the first community-plugin preparation work and a clearer Integrations hub. This work is intentionally scoped to 1.4; the 1.3.x line continues to use the existing official plugin registry feed.
Highlights
- Added the
plugins-community-v1.jsonregistry feed for1.4preview and stable builds. - Kept
1.3.xbuilds on the existingplugins-v1.jsonmarketplace feed. - Added registry
sourcemetadata so official and community plugins can be separated in the app UI. - Redesigned Integrations around a compact grouped list with dedicated Installed, Discover, and Manual tabs.
- Added source, hosting, and multi-capability badges plus compact capability filtering so bundled, marketplace, community, manual, local, cloud, transcription, LLM, action, and memory integrations can be scanned together.
- Exposed user workflows to plugins through read-only Plugin SDK snapshots for community plugin use cases such as workflow-aware post-processing.
- Removed the 1.3-era Legacy cleanup surface from Settings. The 1.4 settings navigation goes directly to Workflows without a Legacy migration page.
Testing Focus
- Verify official marketplace plugins still resolve from
plugins-v1.jsonon1.3.x. - Verify
1.4.0-rc*,1.4.0daily, and1.4.0stable builds resolve fromplugins-community-v1.json. - Smoke-test the Integrations hub tabs, Discover search, Community toggle, capability filter menu, multi-capability badges, manual bundle installation, and incompatible bundle warnings.
- Verify plugins can read
host.availableWorkflowswithout receiving internal app model objects. - Verify the Settings sidebar no longer shows a Legacy page and old profile/prompt navigation requests land in Workflows.
May 4, 2026
May 4, 2026
TypeWhisper 1.3.1 is a stable maintenance release for the 1.3 line. It brings the workflow improvements from the 1.3.1 release candidates to stable users, adds direct workflow text hotkeys and a spoken-language dictation template, improves plugin metadata, and includes reliability fixes for Deepgram code-switching, Bluetooth audio, indicators, local LLM cleanup, and large automation requests.
New Features
- Added an Apple Translate workflow processor for local translation workflows on supported macOS configurations.
- Added a manual workflow trigger so workflows can stay out of automatic dictation matching and run from the Workflow Palette instead.
- Added a recorder toggle in the menu bar for faster access to app audio recording.
- Added packaging support for the bundled Filler Words plugin.
- Added a workflow dictation template for spoken-language transformations.
- Added direct workflow text hotkeys so workflows can run on selected or supplied text without starting a dictation.
Bug Fixes
- Exposed Deepgram multilingual code-switching support so mixed-language dictation can use the provider capability correctly.
- Reduced workflow post-processing latency so prompt and processor workflows return results faster after transcription.
- Wrapped Apple Intelligence workflow input so dictated text is treated as source content instead of executable instructions.
- Stabilized Bluetooth dictation capture and guarded AirPods output volume during audio routing changes.
- Opted indicators out of fullscreen spaces to avoid unexpected overlay behavior when apps use fullscreen mode.
- Restored indicator visibility across macOS spaces.
- Disabled automatic Live Transcript opening by default.
- Auto-unloaded local LLM post-processing resources after use to reduce background memory pressure.
- Supported large CLI transcriptions without truncating or failing oversized requests.
- Kept the language list available before transcription plugins finish loading.
- Fixed a Parakeet live preview fallback race.
Other Changes
- Classified plugin hosting explicitly so local and cloud plugins appear with clearer metadata in plugin listings.
- Documented plugin hosting metadata in the SDK manifest guidance.
- Updated Parakeet vocabulary boosting capability detection for current local-model behavior.
- Expanded automated coverage for workflow text hotkeys, spoken-language workflow templates, plugin hosting compatibility, and Deepgram multilingual code-switching.
May 3, 2026
May 2, 2026
TypeWhisper 1.3.1-rc2 is planned as the final release candidate before 1.3.1. This update rounds out the 1.3.1 line with spoken-language workflow dictation templates, direct text hotkeys for workflows, and clearer plugin hosting metadata before the stable release.
New Features
- Added a workflow dictation template for spoken-language transformations.
- Added direct workflow text hotkeys so workflows can run on selected or supplied text without starting a dictation.
Bug Fixes
- Classified plugin hosting explicitly so local and cloud plugins appear with clearer metadata in plugin listings.
Other Changes
- Documented plugin hosting metadata in the SDK manifest guidance.
- Expanded automated coverage for workflow text hotkeys, spoken-language workflow templates, and plugin hosting compatibility.
May 2, 2026
May 1, 2026
TypeWhisper 1.3.1-rc1 starts the 1.3.1 release-candidate line with new workflow processors, manual workflow controls, plugin improvements, and a focused reliability pass for Bluetooth audio, indicators, local LLM post-processing, and large automation requests.
New Features
- Added an Apple Translate workflow processor for local translation workflows on supported macOS configurations.
- Added a manual workflow trigger so workflows can stay out of automatic dictation matching and run from the Workflow Palette instead.
- Added a recorder toggle in the menu bar for faster access to app audio recording.
Bug Fixes
- Reduced workflow post-processing latency so prompt and processor workflows return results faster after transcription.
- Wrapped Apple Intelligence workflow input so dictated text is treated as source content instead of executable instructions.
- Stabilized Bluetooth dictation capture and guarded AirPods output volume during audio routing changes.
- Opted indicators out of fullscreen spaces to avoid unexpected overlay behavior when apps use fullscreen mode.
- Disabled automatic Live Transcript opening by default.
- Auto-unloaded local LLM post-processing resources after use to reduce background memory pressure.
- Supported large CLI transcriptions without truncating or failing oversized requests.
- Kept the language list available before transcription plugins finish loading.
Other Changes
- Updated Parakeet vocabulary boosting capability detection for current local-model behavior.
- Added packaging support for the bundled Filler Words plugin.
May 1, 2026
New Features
- add Apple Translate workflow processor (#427)
- add recorder menu bar toggle (#414)
- add manual workflow trigger (#411)
Bug Fixes
- opt indicators out of fullscreen spaces (#434)
- wrap Apple Intelligence workflow input (#432)
- disable live transcript auto-open by default (#430)
- auto-unload local LLM post-processing (#425)
- reduce workflow post-processing latency (#426)
- update Parakeet vocabulary boosting capability (#423)
- support right-side modifier hotkeys (#413)
- support large CLI transcriptions (#410)
- keep language list available before plugins load (#404)
- guard AirPods output volume during audio routing (#403)
Other Changes
April 2026
April 30, 2026
New Features
- add Apple Translate workflow processor (#427)
- add recorder menu bar toggle (#414)
- add manual workflow trigger (#411)
Bug Fixes
- auto-unload local LLM post-processing (#425)
- reduce workflow post-processing latency (#426)
- update Parakeet vocabulary boosting capability (#423)
- support right-side modifier hotkeys (#413)
- support large CLI transcriptions (#410)
- keep language list available before plugins load (#404)
- guard AirPods output volume during audio routing (#403)
Other Changes
April 28, 2026
April 26, 2026
April 25, 2026
TypeWhisper v0.7.0
TypeWhisper 0.7.0 is a major workflow and automation release for Windows. It brings the workflow system into the main settings experience, expands file transcription automation, adds new provider capabilities, and tightens dictation reliability for everyday use.
New Features
- Added the unified Workflows surface for prompt actions, matching rules, workflow-specific hotkeys, language and model overrides, output formatting, and action routing.
- Added batch file transcription queues with per-file status tracking, export options, and improved model selection behavior.
- Added watch folder transcription so folders can be monitored and processed automatically with configurable export output.
- Added the recent transcriptions palette for quickly finding, editing, copying, and reusing recent dictation results.
- Added the ElevenLabs transcription plugin, including streaming support and localized settings.
- Added TTS provider plugin support, including host services, playback sessions, voice metadata, and a built-in Windows SAPI provider.
- Added stable, release-candidate, and daily update channel support in the app and release packaging.
Improvements
- Expanded the local HTTP API and bundled CLI with per-request engine and model overrides, language hints, translation targets, dictionary term management, and dictation controls.
- Improved automation compatibility for external tools through richer API parsing, status responses, and CLI installation support.
- Improved file transcription handling for larger queues, subtitle export, media formats, and watch folder output.
- Improved plugin marketplace and plugin host behavior so bundled and external plugins can expose more capabilities consistently.
- Improved short and quiet speech handling with configurable padding and gain behavior.
- Refined README, issue templates, PR templates, package dry runs, plugin smoke checks, and release workflow coverage.
Fixes
- Fixed dictation tail clipping so the end of short recordings is less likely to be cut off.
- Hardened automatic paste reliability after transcription completes.
- Fixed prompt provider selection and related history/prompt provider race conditions.
- Fixed autostart toggle handling.
- Hardened watch folder exception handling so background folder processing is more resilient.
April 25, 2026
TypeWhisper 1.3 is the stable macOS release centered on the new Workflows experience, more capable automation surfaces, and a broad round of dictation, UI, and local-model reliability work across the 1.3.0-rc* cycle.
Highlights
- Unified Workflows - TypeWhisper now brings prompt actions and matching rules into one dedicated Workflows surface, with a native editor, wizard flow, and a global
Alwaysfallback trigger. - Better automation control - The local HTTP API and CLI now support per-request STT engine and model selection, while multilingual language hints make mixed-language dictation easier to steer.
- More polished dictation feedback - Spoken feedback, adjustable transcript sizing, faster recent-transcription recovery, and grouped menu bar actions make day-to-day use smoother.
- Stronger local-model path - The
1.3cycle hardens local plugin downloads, Hugging Face token handling, model restore behavior, and Parakeet language filtering.
Workflows & Prompt Processing
- Replaced the old Prompt Actions and Rules settings surfaces with the new unified Workflows experience.
- Added a native macOS sidebar and wizard-style create/edit flow for workflows.
- Added a global
Alwaysworkflow trigger that acts as the fallback after website and app-specific matches. - Hardened workflow prompts so dictated text is always treated as source text to transform, not instructions to execute.
- Added per-workflow temperature controls for supported LLM providers.
- Preserved focus in the original target app during local prompt processing.
Dictation, Feedback & UI Polish
- Added spoken feedback through the bundled
System Voiceplugin, scoped to transcription readback only. - Added multilingual language hints with multi-select search and selected-count feedback.
- Added shortcuts for reopening recent transcriptions and a recovery palette for recent results.
- Added adjustable live transcript preview sizing for dictation overlays.
- Improved the notch and overlay indicator behavior, including active-screen re-anchoring and timer layout fixes.
- Grouped menu bar actions into clearer sections for faster navigation.
- Fixed Fn hotkey handling so both press-and-release and press-and-hold strategies work reliably.
Local Models, Plugins & Reliability
- Upgraded the local model runtime and download flows for bundled on-device plugins.
- Added shared Hugging Face token handling for the bundled local-model plugins and clearer validation where required.
- Fixed Parakeet language filtering against the current FluidAudio upstream behavior.
- Improved WhisperKit restore and unload handling.
- Split the marketplace feeds by compatibility line and hardened plugin loading so incompatible external bundles are skipped instead of breaking launch.
- Made dictionary prompts engine-aware and kept streaming dictionary terms working across AssemblyAI, Soniox, and SpeechAnalyzer.
- Added subtitle exports for Watch Folder jobs.
Power Users & Automation
- Added per-request STT engine and model selection through the local HTTP API and the
typewhisperCLI. - Kept the documented
/v1/*API and CLI command surface stable for the1.xline. - Improved plugin compatibility validation and diagnostics for marketplace and bundled plugins.
April 25, 2026
Other Changes
- Add WhisperKit lifecycle regression tests (#401)
April 25, 2026
Other Changes
- Add recent transcriptions palette (#72)
- Add short and quiet clip handling controls
- Add TTS provider plugin support
- Add advanced API and CLI settings
- Add batch file transcription queue
- Harden watch folder exception handling
- Implement watch folder file transcription
- Add Windows workflow automation
- Add Raycast-compatible Windows API support
- Add About update channel selector
- Align automation API and CLI
April 24, 2026
TypeWhisper 1.3.0-rc5 is planned as the final release candidate before 1.3.0. This update tightens the new Workflows experience with a global fallback trigger and safer prompt boundaries, adds a round of menu bar and indicator polish, and refreshes the local-model path with the Parakeet language-filter fix plus shared Hugging Face token handling.
New Features
- Added a global
Alwaysworkflow trigger that acts as the fallback after website and app-specific workflow matches. - Grouped the menu bar actions into clearer sections so the most common dictation and app actions are faster to scan.
Bug Fixes
- Hardened workflow prompts so dictated text is treated as source content to transform instead of instructions to execute.
- Preserved focus in the original target app during local prompt processing so on-device workflows do not foreground TypeWhisper unexpectedly.
- Fixed notch timer wrapping so longer timers stay readable without breaking the indicator layout.
- Fixed Parakeet language filtering to match the current FluidAudio upstream behavior more reliably.
Other Changes
- Added a shared Hugging Face token helper for the bundled local-model plugins, including Parakeet, Gemma 4, Granite, Qwen3, Voxtral, and WhisperKit.
- Expanded automated coverage around workflow matching, workflow prompt boundaries, menu bar indicator settings, notch layout, and plugin manifest validation.
April 24, 2026
Other Changes
- Add Raycast-compatible Windows API support
- Add About update channel selector
- Align automation API and CLI
April 23, 2026
TypeWhisper 1.3.0-rc4 makes the new unified Workflows experience the headline feature of the 1.3 release-candidate line, backed by watch-folder subtitle exports, adjustable live transcript sizing, recent-transcription shortcuts, and a round of dictation, indicator, and WhisperKit fixes since rc3.
New Features
- Replaced the old Prompt Actions and Rules settings surfaces with the new unified Workflows experience, including the new dedicated settings area for creating, editing, and organizing workflows.
- Added subtitle exports for Watch Folder jobs.
- Added an adjustable live transcript preview size for dictation overlays.
- Added shortcuts for reopening recent transcriptions.
Bug Fixes
- Restored the prompt palette shortcut when the default provider is unavailable.
- Fixed active-screen indicator placement so overlays re-anchor more reliably.
- Reduced recorder live preview lag during dictation.
- Improved WhisperKit restore and unload handling.
Other Changes
- Added the German Medicine community term pack.
April 23, 2026
Bug Fixes
- restore prompt palette shortcut when default provider is unavailable
Other Changes
- Add adjustable live transcript preview size
- Fix active screen indicator placement
- Fix WhisperKit restore and unload handling
- remove duplicate settings sidebar toggle
- replace prompts and rules with workflows
- add missing 1.3.0 rc2 and rc3 release notes
April 22, 2026
Bug Fixes
- restore prompt palette shortcut when default provider is unavailable
Other Changes
- add missing 1.3.0 rc2 and rc3 release notes
April 21, 2026
TypeWhisper 1.3.0-rc3 refreshes the 1.3 release-candidate line with the redesigned prompt workflow, per-prompt temperature controls, a recent-transcription recovery palette, and the new post-update licensing prompt, plus a round of dictation, dictionary, and plugin-loading fixes since rc2.
New Features
- Redesigned Prompt Actions settings with a native macOS sidebar and a wizard-style create/edit flow.
- Added per-prompt temperature controls for supported LLM providers, including recommended defaults for the built-in presets.
- Added a recent transcription recovery palette so you can quickly reopen and reinsert recent results.
- Added a post-update licensing prompt that routes upgraded installs into the refreshed licensing flow.
Bug Fixes
- Refined quiet-dictation and short-speech handling for more reliable low-signal recordings.
- Made dictionary prompts engine-aware so providers can clip global dictionary terms to their documented limits without destabilizing the session.
- Isolated disabled plugins so they stay registered without being eagerly loaded, which reduces startup and settings regressions from incompatible bundles.
- Removed the duplicate settings sidebar button on macOS 14.
Other Changes
- Documented the new
DictionaryTermsBudgetProvidingplugin SDK hook for engines with explicit dictionary-term limits.
April 21, 2026
New Features
- add prompt temperature controls and lazy plugin loading
Bug Fixes
- remove duplicate macOS 14 settings sidebar button
April 19, 2026
TypeWhisper 1.3.0-rc2 rolls up the stabilization work that landed after rc1, with a focus on push-to-talk reliability, short-speech handling, audio recovery hardening, and the refreshed local-model and licensing flows.
Reliability & Input Handling
- Fixed modifier-combo push-to-talk so transient
flagsChangedchurn does not stop recording before the final key release. - Added the default push-to-talk discard behavior when extra keys interrupt an active hold, while keeping the stop point predictable on the actual hotkey release.
- Added an advanced opt-in to transcribe more short and quiet clips instead of classifying them as no speech.
- Hardened audio recovery, preview recovery, and recording-abort cleanup around route changes and device loss.
- Delayed media playback resume after dictation to avoid premature restarts.
- Ignored benign startup configuration changes that should not trigger unnecessary recovery work.
Local Models & Plugins
- Upgraded the shared local-model runtime stack and added Gemma 4 support with clearer model-policy messaging.
- Improved local-model download, cancellation, and Hugging Face token flows across the bundled local plugins.
- Fixed Gemini model discovery and prompt-processing fallback handling.
Settings & UX
- Redesigned the in-app licensing settings flow.
- Aligned spoken-language picker labels and search behavior with the app locale.
April 19, 2026
New Features
- redesign the licensing settings flow
Bug Fixes
- ignore benign startup config change on recording start
- align language picker search with app locale
- bounded recovery + terminal error propagation
Other Changes
- Fix Gemini model discovery and prompt fallback handling
- Trigger website rebuilds for preview releases
April 17, 2026
TypeWhisper v0.6.1
TypeWhisper 0.6.1 is a focused stability release that tightens dictation behavior, file transcription model selection, provider state handling, and release packaging.
Improvements
- Improved provider and model state handling so the app is less likely to lose the selected configuration while switching between engines and prompts.
- Refined the bundled Groq integration with better LLM model selection and refresh behavior.
Fixes
- Fixed recorder-compatible hotkey parsing so captured shortcuts round-trip correctly through the UI and runtime hotkey pipeline.
- Fixed file transcription to load the model the user actually selected instead of falling back unexpectedly.
- Fixed the silent-stop overlay reset path and closed a streaming session leak that could leave overlay state behind after recording ended.
Release Prep
- Hardened the release workflow so app tags prefer curated notes from
docs/releases/and fall back to commit-generated notes only when no curated file exists. - Scoped previous-tag lookup to app tags only, preventing plugin tags from polluting app release note ranges.
- Prepared the Windows release metadata for
0.6.1, including CLI version reporting and Groq plugin metadata alignment forplugin-groq-v1.0.2.
April 17, 2026
TypeWhisper 1.3.0-rc1 opens the 1.3 release-candidate line. The release adds spoken feedback, per-request STT control through the HTTP API and CLI, multilingual language hints, and the new plugin-compatibility safeguards introduced after #327, on top of a set of hotkey, audio, and plugin reliability fixes.
New Features
- Spoken feedback (text-to-speech) with a new
System Voicemarketplace plugin that uses the built-in macOSAVSpeechSynthesizervoices. Voice and speed are configurable in the settings, and the feedback is scoped to transcription readback only. - Per-request STT engine and model selection through the HTTP API (
engine/modelon/v1/transcribe) and the CLI (typewhisper transcribe --engine <id> --model <id>). When omitted, the active profile defaults are used. - Multilingual language hints with a multi-select picker, inline search, and a selected-count indicator, wired through to the transcription providers.
Bug Fixes
- Restored GPT-5 and Codex compatibility for the bundled OpenAI plugin.
- Fixed Fn-hotkey handling so both press-and-release and press-and-hold strategies work reliably.
- Spoken feedback is now limited to transcription readback and no longer narrates unrelated UI events.
- Addressed the 1.2.3 review blockers (K1-K4, M1, M7, M8) in the release pipeline and plugin registry.
- Split the marketplace into legacy and
v1feeds, moved plugin and term-pack fetches out of startup, and hardened plugin loading so incompatible external bundles are skipped instead of breaking launch. - Hardened audio recovery during Bluetooth route changes to avoid crashes.
- Dictionary terms are forwarded through the streaming providers (AssemblyAI, Soniox, SpeechAnalyzer) without breaking the live session.
Other Changes
- Refactored the Qwen3 context-bias formatter for cleaner prompt injection and easier future tuning.
- Incremental live transcription, Google Cloud STT release mapping, and the multi-release plugin registry are carried forward from the late
1.2.xwork. - All current marketplace plugins now have
v1releases for the RC line, includingSystem Voiceas an installable TTS plugin.
April 17, 2026
New Features
- per-request STT engine/model selection (#317) (#325)
- add TTS providers for spoken feedback (#323)
- add multilingual language hints (#312) (#319)
Bug Fixes
- dedupe build-file UUID collision between Qwen3ContextBiasFormatter and MinimalIndicatorView
- limit spoken feedback to transcription readback (#324)
- address release 1.2.3 review blockers (K1-K4, M1/M7/M8) (#320)
- support Fn press/release hotkey strategies (#316)
- restore GPT-5 OpenAI plugin compatibility (#313)
Other Changes
April 16, 2026
New Features
- wire dictionary terms through transcription providers
Bug Fixes
- keep AssemblyAI streaming active with dictionary terms
Other Changes
- Update localized string catalog
- Allow optional profile scope and default fallback
- Fix PR guard bot handling
- Add Google Cloud STT plugin release mapping
- Keep plugin registry backward compatible
- Fix legacy plugin release migration
- Add multi-release registry and incremental live transcription
- fix audio recovery crash during Bluetooth route changes
- remove temporary plugin debug logging
- Remove deprecated activation option
- Allow empty correction replacements
- Add manual uninstall instructions to README
- Fix managed window activation on repeated menu bar opens
- Update localized string catalog ordering
- Fix #287 validate selected audio inputs on selection
April 14, 2026
TypeWhisper 1.2.2 is a stable maintenance release centered on the rebuilt OpenAI integration. The bundled OpenAI plugin is now presented as OpenAI / ChatGPT and can use either a traditional API key or a ChatGPT/Codex login for prompt processing.
Highlights
- OpenAI / ChatGPT login - the bundled OpenAI plugin now supports browser-based ChatGPT/Codex OAuth in addition to API-key auth.
- Better GPT-5 and Codex support - reasoning effort can be tuned directly in plugin settings, and the bundled model list now includes current GPT-5/Codex variants such as
GPT-5.3-Codex-Spark. - Fixed Codex response handling - the ChatGPT/Codex path now speaks the actual Codex Responses protocol, including the required request shape and SSE response parsing.
Fixes & Compatibility
- Fixed
400failures from the ChatGPT/Codex endpoint caused by the old request format. - Improved OpenAI OAuth compatibility for the browser login flow and Codex login import.
- Restored SDK compatibility for external plugins after extending the OpenAI chat helper, so older plugin builds keep loading correctly.
- The marketplace OpenAI / ChatGPT plugin now correctly requires TypeWhisper
1.2.2or newer.
April 14, 2026
Bug Fixes
- guard incompatible transcription engines on Intel
- disable macOS 14 settings sidebar animation
- show script/region variants in language pickers
- use curated notes for GitHub releases
Other Changes
- cover script and region language display variants
- update stable release wording
- update stable release wording in README
- rename release readiness guide
- hide legacy 1.1 filename in README
April 14, 2026
New Features
- add whisper.cpp marketplace plugin
Bug Fixes
- add new plugins to marketplace registry workflow
- clear whisper model selection on unload
Other Changes
- harden CI and prepare v0.6.0 release
- Refine settings shell and localization
- Implement custom protocol handling and improve Discord claim integration
- Add supporter Discord claims and live profile context
- Refactor structured settings pages
- Polish settings, wizard, and snippets UX
- Overhaul Windows settings UI
- Fix Win hotkey modifier handling
- Add vocabulary boosting for local Parakeet transcription
April 14, 2026
TypeWhisper 1.2 is the new stable macOS release, focused on speed, reliability, and day-to-day polish. Dictation starts noticeably faster, background CPU use is lower while idle, and audio behavior is more resilient on real-world setups like AirPods, Bluetooth devices, and changing input routes.
Highlights
- Much faster dictation start – metadata capture and browser URL resolution no longer block the critical startup path.
- Lower idle CPU usage – background listeners were tightened up to reduce unnecessary CPU activity while TypeWhisper is idle.
- More reliable audio behavior – better recovery after AirPods, Bluetooth profile switches, and
AVAudioEnginereconfiguration. - Better day-to-day usability – a new Minimal indicator, transcript preview controls, and a more guided Rules workflow make TypeWhisper easier to use.
Speed & Efficiency
- Reduced dictation start latency by moving metadata capture and browser URL resolution off the critical path.
- Reduced unnecessary background CPU usage while TypeWhisper is idle.
- Improved handling for very short clips, especially when streaming preview is active or Parakeet is selected.
- Fixed streaming-preview and no-speech-guard behavior so brief speech is retained more reliably.
Reliability
- Hardened audio engine startup and teardown around
AVAudioEnginerace conditions. - Improved recovery after AirPods and Bluetooth profile switches.
- Restored the selected Apple Speech model on app launch.
- Fixed engine startup so TypeWhisper auto-selects the configured engine more reliably.
- Fixed dictation start when no microphone is available, with a cleaner failure path.
- Fixed a hotkey event tap crash.
- Improved compatibility with remapped Hyperkey shortcuts and combo hotkeys, including Caps Lock edge cases.
- Fixed WhisperKit download progress so it stays attached to the selected model.
- Hardened bundled plugin installation and update flows so installed plugin bundles remain managed correctly.
Usability & Polish
- Added the new Minimal indicator style for a compact floating status view.
- Added a toggle to disable live transcript preview for Notch and Overlay indicators.
- Reframed Profiles as Rules with a guided editor for app- and website-based matching.
- Backported the modern settings sidebar so the updated navigation also works on macOS 14.
- Added an app visibility picker to choose between menu bar, Dock, or Dock-only-while-window-open behavior.
- Polished the notch indicator layout and matched its height more closely to the display safe area.
- Fixed multiple German localization issues and other small copy and layout regressions.
- Avoided unwanted clipboard history writes when clipboard preservation is enabled.
Add-ons & Platform Updates
- Added optional HuggingFace token support to the bundled Qwen3, Granite, and Voxtral MLX plugins for higher download limits and clearer download errors.
- Added the new Gemma 4 local LLM plugin via MLX for on-device prompt processing.
- Improved Gemma 4 prompt processing and bumped the bundled Gemma 4 plugin to
1.0.1. - Localized built-in term pack metadata in English and German.
- Added the supporter Discord claim flow for eligible supporters.
Power Users & Developers
- Added dictation session IDs to API responses for local automation workflows.
- Added GPT-5 token parameter support to the OpenAI chat helper for SDK and plugin integrations.
- Improved plugin load diagnostics with structured OSLog logging.
April 13, 2026
New Features
- improve Gemma4 prompt processing
- polish notch indicator layout and presentation (#257)
- backport settings sidebar to macOS 14
Bug Fixes
- keep WhisperKit download progress on selected model
- restore minimal indicator sources after merge
- keep installed dev plugins from deleting themselves
- add words label to usage tooltip
Other Changes
- add 1.2.0-rc5 release notes
- add PR guard workflow and CODEOWNERS
- enable Gemma4 plugin releases
- Add missing German localizations
- Fix hotkey event tap crash (Fixes #263)
- Add Gemma 4 local LLM plugin via MLX (#193)
- recreate build-release before logging (#261)
- Fix German localization for recording status feedback
April 13, 2026
Bug Fixes
- restore plugin ABI compatibility and log install failures
Other Changes
- Prepare v1.2.0-rc2 release notes and docs
- Avoid idle MediaRemote listener CPU usage
April 12, 2026
Bug Fixes
- ignore caps lock origin for combo hotkeys
- use OSLog interpolation for plugin load errors
- harden plugin installs and prepare Parakeet 1.2.3 release
- auto-select configured engine (#228)
Other Changes
- Prepare v1.2.0-rc2 release notes and docs
- Fix dictation start when no microphone is available
- Support GPT-5 token parameters in OpenAI chat helper
- Add supporter Discord claim flow (#232)
April 11, 2026
New Features
Bug Fixes
- harden audio engine startup for AirPods profile changes (#224)
- support remapped hyperkey hotkeys (#223)
- reduce dictation start latency for issue 212
- localize built-in term packs (#220)
- honor streaming preview in no-speech guard (#214)
Other Changes
- refresh 1.2 rc1 release checklist (#227)
- update 1.1 release docs (#226)
- Fix AVAudioEngine teardown race in stopRecording (#209) (#210)
- Bump plugin versions: Qwen3 1.0.14, Granite 1.0.1, Voxtral 1.0.4
- Fix EXC_BAD_ACCESS crash in AppDelegate.windowDidBecomeKey (#208)
- Add HuggingFace token support and fix error messages for MLX plugins (#207)
- Fix AVAudioEngine race condition crash in stopRecording (#204)
- Disable SWIFT_EMIT_LOC_STRINGS for plugin targets to prevent Xcode from modifying String Catalog
- Suppress indicator streaming text when plugin provides own display (#203)
- Add lifetime licenses and supporter tiers for private users (#202)
April 11, 2026
Other Changes
- Fix AVAudioEngine teardown race in stopRecording (#209) (#210)
- Bump plugin versions: Qwen3 1.0.14, Granite 1.0.1, Voxtral 1.0.4
- Fix EXC_BAD_ACCESS crash in AppDelegate.windowDidBecomeKey (#208)
- Add HuggingFace token support and fix error messages for MLX plugins (#207)
- Fix AVAudioEngine race condition crash in stopRecording (#204)
- Disable SWIFT_EMIT_LOC_STRINGS for plugin targets to prevent Xcode from modifying String Catalog
- Suppress indicator streaming text when plugin provides own display (#203)
- Add lifetime licenses and supporter tiers for private users (#202)
April 10, 2026
Other Changes
- Fix EXC_BAD_ACCESS crash in AppDelegate.windowDidBecomeKey (#208)
- Add HuggingFace token support and fix error messages for MLX plugins (#207)
- Fix AVAudioEngine race condition crash in stopRecording (#204)
- Disable SWIFT_EMIT_LOC_STRINGS for plugin targets to prevent Xcode from modifying String Catalog
- Suppress indicator streaming text when plugin provides own display (#203)
- Add lifetime licenses and supporter tiers for private users (#202)
April 9, 2026
Other Changes
- Fix AVAudioEngine race condition crash in stopRecording (#204)
- Disable SWIFT_EMIT_LOC_STRINGS for plugin targets to prevent Xcode from modifying String Catalog
- Suppress indicator streaming text when plugin provides own display (#203)
- Add lifetime licenses and supporter tiers for private users (#202)
April 9, 2026
Bug Fix
- Fixed marketplace not showing new plugins —
RegistryPlugindeserialization failed when optional fields were missing from the registry JSON. Switched from positional record to init properties with defaults.
April 8, 2026
What's New in v0.5.0
Major Architecture Change
- Replaced SQLite with JSON file storage — All data (history, dictionary, snippets, profiles, prompts) is now stored as simple JSON files. No more database migrations. Faster startup, easier debugging, and human-readable data files.
- Removed Microsoft.Data.Sqlite dependency — Smaller application footprint.
New Features
- Error Log — Structured error logging with categories (transcription, recording, prompt, plugin). View errors and export diagnostics in Settings → About.
- App-aware Formatting — Transcribed text is automatically formatted based on the target app: Markdown for Obsidian/Notion, HTML for Outlook, plain for code editors.
- Audio History — Recordings are now saved as WAV files alongside transcription history. Play back past recordings directly from the history view.
- Watch Folder — Monitor a folder for audio files and automatically transcribe them with sidecar .txt output.
- Memory Service — Automatically extract lasting personal facts from transcriptions using LLM. Stored via memory storage plugins.
- Spoken Feedback — Text-to-speech readback of transcribed text after each dictation (Settings → Advanced).
- Accessibility Announcements — Screen reader support via UI Automation live region events for Narrator, NVDA, and JAWS.
- System Audio Capture — WASAPI loopback capture for recording system audio output.
- CLI Tool —
typewhispercommand-line tool withstatus,models, andtranscribecommands. - License & Supporter Tiers — Polar.sh license activation with Bronze/Silver/Gold supporter badges.
- Release Channels — Stable, Release Candidate, and Daily update channels.
- Audio Recorder — Standalone recording feature in Settings → Recorder with auto-transcription and recording history.
UI Improvements
- Redesigned Settings Sidebar — Restructured to match Mac app: Home, General, Recording, Models, Recorder, History, Dictionary, Snippets, Profiles, Prompts, Integrations, Advanced, License, About.
- Dashboard Enhancements — Trend indicators (↑↓%) on all stat cards, "All Time" period option, recent transcriptions section.
- Overlay Redesign — Centered text layout, subtle border, larger font, more padding. Matches the new preview in Settings.
- Overlay Preview — Live preview in General settings showing actual widgets, updates in real-time when changing widget selections.
- Marketplace Grouping — Plugins are now grouped by category (Transcription Engines, LLM Providers, Memory, Actions, Utilities) with count badges.
New Plugins (14)
LLM Providers: Claude (Anthropic), OpenRouter, Cerebras, Cohere, Fireworks Transcription Engines: Gladia, Speechmatics, Soniox, Cloudflare ASR, Google Cloud STT, Qwen3 STT, Voxtral (Mistral) Memory Storage: File Memory, OpenAI Vector Memory
Plugin SDK Enhancements
ActionCompletedEventfor tracking action plugin results- App context (process name, window title, URL) in transcription events
IMemoryStoragePlugininterface for memory storage backendsIPluginSettingsActivityfor progress reporting in plugin settingsSetStreamingDisplayActive()for plugins that handle their own streaming UICategoryandIsLocalfields in plugin manifest for marketplace grouping
Bug Fixes
- #21 — Added option to disable saving transcriptions to history (Settings → Advanced)
- #22 — Fixed intermittent hotkey toggle failures with 300ms debounce
- #24 — Hotkey events are now swallowed to prevent triggering media playback in background apps
- Hotkeys can now be cleared with Escape, Delete, or Backspace
- Timer and other widgets show proper fallback values in overlay preview
April 8, 2026
New Features
- Add vocabulary boosting to ParakeetPlugin, closes #129 (#130)
- Add V2/V3 model version selection to ParakeetPlugin, fixes #137 (#146)
- Add dynamic model fetching to GeminiPlugin, fixes #136 (#145)
- Add Google Cloud Speech-to-Text plugin (#147)
- Add dictionary JSON import and export (#148)
- Non-blocking model downloads (#149)
- Add configurable sound feedback settings, fixes #139 (#150)
- Add post-processing transparency to history and indicators, fixes #134 (#151)
- Record app audio with separate tracks, refs #141 (#152)
- Add community term pack registry (#154)
- Add Prompts & AI wizard step and cross-tab navigation, fixes #135 (#155)
- Add drag-and-drop reordering for prompts, fixes #153 (#156)
- Add media pause during recording, fixes #159 (#160, #163)
- Add mouse button shortcuts, fixes #171 (#172)
- Add Auto Enter profile setting, fixes #167 (#169)
- Add toggle to disable history saving (#168)
- Show both STT and AI-processed text in history, fixes #176 (#183)
- Sort marketplace plugins by download count, closes #186 (#187)
- Add per-profile translation toggle, fixes #199 (#201)
- Add business licensing with Polar.sh integration (#195)
- Add lifetime licenses and supporter tiers for private users (#202)
- Suppress indicator streaming text when plugin provides own display (#203)
New Plugins
- OpenRouterPlugin with model search, pricing, and credits display (#173)
- Soniox cloud transcription plugin (#174)
- Cohere cloud transcription plugin, fixes #177 (#185)
- Speechmatics cloud transcription plugin, fixes #192
Bug Fixes
- Detect Fn hotkey on release to avoid Fn+key conflicts, fixes #132 (#144)
- Update FluidAudio for Swift 6 actor safety, fixes #129 (#131)
- Fix community term pack loading state
- Fix ultrashort push-to-talk dictation (#157)
- Fix prompt toggle opening edit screen instead of toggling (#158)
- Fix notch indicator not displaying on other virtual desktops, fixes #165 (#166)
- Embed MediaRemoteAdapter framework in app bundle, fixes #159
- Fix 'No model loaded' error when engine plugin is still loading, fixes #182 (#184)
- Fix prompt toggle not responding to clicks, fixes #186 (#188)
- Fix recording started sound not playing, fixes #189 (#194)
- Fix silent recording on multi-channel audio interfaces (#197)
- Fix clipboard preservation toggle not working, fixes #198 (#200)
- Add error checking and logging for audio device selection, fixes #181 (#196)
Other Changes
- Remove App Store build support (#161)
- Disable implicit build dependencies to fix Xcode memory crashes
April 8, 2026
April 7, 2026
New Features
- Add vocabulary boosting to ParakeetPlugin, closes #129 (#130)
- Add V2/V3 model version selection to ParakeetPlugin, fixes #137 (#146)
- Add dynamic model fetching to GeminiPlugin, fixes #136 (#145)
- Add Google Cloud Speech-to-Text plugin (#147)
- Add dictionary JSON import and export (#148)
- Non-blocking model downloads (#149)
- Add configurable sound feedback settings, fixes #139 (#150)
- Add post-processing transparency to history and indicators, fixes #134 (#151)
- Record app audio with separate tracks, refs #141 (#152)
- Add community term pack registry (#154)
- Add Prompts & AI wizard step and cross-tab navigation, fixes #135 (#155)
- Add drag-and-drop reordering for prompts, fixes #153 (#156)
- Add media pause during recording, fixes #159 (#160, #163)
- Add mouse button shortcuts, fixes #171 (#172)
- Add Auto Enter profile setting, fixes #167 (#169)
- Add toggle to disable history saving (#168)
- Show both STT and AI-processed text in history, fixes #176 (#183)
- Sort marketplace plugins by download count, closes #186 (#187)
- Add per-profile translation toggle, fixes #199 (#201)
- Add business licensing with Polar.sh integration (#195)
New Plugins
- OpenRouterPlugin with model search, pricing, and credits display (#173)
- Soniox cloud transcription plugin (#174)
- Cohere cloud transcription plugin, fixes #177 (#185)
- Speechmatics cloud transcription plugin, fixes #192
Bug Fixes
- Detect Fn hotkey on release to avoid Fn+key conflicts, fixes #132 (#144)
- Update FluidAudio for Swift 6 actor safety, fixes #129 (#131)
- Fix community term pack loading state
- Fix ultrashort push-to-talk dictation (#157)
- Fix prompt toggle opening edit screen instead of toggling (#158)
- Fix notch indicator not displaying on other virtual desktops, fixes #165 (#166)
- Embed MediaRemoteAdapter framework in app bundle, fixes #159
- Fix 'No model loaded' error when engine plugin is still loading, fixes #182 (#184)
- Fix prompt toggle not responding to clicks, fixes #186 (#188)
- Fix recording started sound not playing, fixes #189 (#194)
- Fix silent recording on multi-channel audio interfaces (#197)
- Fix clipboard preservation toggle not working, fixes #198 (#200)
- Add error checking and logging for audio device selection, fixes #181 (#196)
Other Changes
- Remove App Store build support (#161)
- Disable implicit build dependencies to fix Xcode memory crashes
April 6, 2026
April 5, 2026
New Features
- add local Gemma 4 LLM plugin (LLamaSharp/GGUF)
Bug Fixes
- include NuGet dependencies and native runtimes in plugin ZIP
Other Changes
- add gemma-local to plugin publish workflow
April 5, 2026
New Features
- Silence hallucination filter — Two-layer detection (client-side energy gate + server-side
no_speech_prob) prevents cloud Whisper models from hallucinating random text on silent audio (#19) - Visual hybrid hotkey indicators — Toggle mode now shows amber overlay border and mode-aware status text ("Recording (Toggle) - press hotkey to stop") so you always know recording is active (#7)
- Plugin localization — All 6 cloud plugins (OpenAI, Groq, Gemini, Deepgram, AssemblyAI, OpenAI Compatible) now fully support English and German in their settings views (#14)
- IBM Granite Speech plugin — New local transcription engine available in the Marketplace
Bug Fixes
- App no longer freezes when no microphone is connected; shows feedback and auto-recovers when a mic is plugged in (#5)
- Groq plugin now has a 30-second HTTP timeout instead of infinite, preventing hangs on slow/unreachable API (#6)
- Prompt actions now show a clear warning when no LLM provider is configured, instead of silently doing nothing (#8)
- Translation is correctly skipped when the configured source language already matches the target language (#9)
- System language detection now uses the Win32 API directly, fixing incorrect language when launched via the Velopack installer (#15)
- Granite Speech plugin download now includes retry logic and validation (#4)
April 4, 2026
New Features
- Add vocabulary boosting to ParakeetPlugin, closes #129 (#130)
- Add V2/V3 model version selection to ParakeetPlugin, fixes #137 (#146)
- Add dynamic model fetching to GeminiPlugin, fixes #136 (#145)
- Add Google Cloud Speech-to-Text plugin (#147)
- Add dictionary JSON import and export (#148)
- Non-blocking model downloads (#149)
- Add configurable sound feedback settings, fixes #139 (#150)
- Add post-processing transparency to history and indicators, fixes #134 (#151)
- Record app audio with separate tracks, refs #141 (#152)
- Add community term pack registry (#154)
- Add Prompts & AI wizard step and cross-tab navigation, fixes #135 (#155)
- Add drag-and-drop reordering for prompts, fixes #153 (#156)
- Add media pause during recording, fixes #159 (#160, #163)
- Add mouse button shortcuts, fixes #171 (#172)
- Add Auto Enter profile setting, fixes #167 (#169)
- Add toggle to disable history saving (#168)
- Show both STT and AI-processed text in history, fixes #176 (#183)
- Sort marketplace plugins by download count, closes #186 (#187)
New Plugins
- OpenRouterPlugin with model search, pricing, and credits display (#173)
- Soniox cloud transcription plugin (#174)
- Cohere cloud transcription plugin, fixes #177 (#185)
Bug Fixes
- Detect Fn hotkey on release to avoid Fn+key conflicts, fixes #132 (#144)
- Update FluidAudio for Swift 6 actor safety, fixes #129 (#131)
- Fix community term pack loading state
- Fix ultrashort push-to-talk dictation (#157)
- Fix prompt toggle opening edit screen instead of toggling (#158)
- Fix notch indicator not displaying on other virtual desktops, fixes #165 (#166)
- Embed MediaRemoteAdapter framework in app bundle, fixes #159
- Fix 'No model loaded' error when engine plugin is still loading, fixes #182 (#184)
- Fix prompt toggle not responding to clicks, fixes #186 (#188)
Other Changes
- Remove App Store build support (#161)
April 4, 2026
Other Changes
- Add SonioxPlugin to plugin-release workflow, fixes #175
- Add Soniox cloud transcription plugin (#174)
- Add OpenRouterPlugin with model search, pricing, and credits display (#173)
- bump actions/checkout from 4 to 6 (#170)
- Add mouse button shortcuts, fixes #171 (#172)
- Add Auto Enter profile setting, fixes #167 (#169)
- Add toggle to disable history saving (#168)
April 2, 2026
April 1, 2026
New Features
- Add vocabulary boosting to ParakeetPlugin, closes #129 (#130)
- Add V2/V3 model version selection to ParakeetPlugin, fixes #137 (#146)
- Add dynamic model fetching to GeminiPlugin, fixes #136 (#145)
- Add Google Cloud Speech-to-Text plugin (#147)
- Add dictionary JSON import and export (#148)
- Non-blocking model downloads (#149)
- Add configurable sound feedback settings, fixes #139 (#150)
- Add post-processing transparency to history and indicators, fixes #134 (#151)
- Record app audio with separate tracks, refs #141 (#152)
- Add community term pack registry (#154)
- Add Prompts & AI wizard step and cross-tab navigation, fixes #135 (#155)
- Add drag-and-drop reordering for prompts, fixes #153 (#156)
- Add media pause during recording, fixes #159 (#160, #163)
Bug Fixes
- Detect Fn hotkey on release to avoid Fn+key conflicts, fixes #132 (#144)
- Update FluidAudio for Swift 6 actor safety, fixes #129 (#131)
- Fix community term pack loading state
- Fix ultrashort push-to-talk dictation (#157)
- Fix prompt toggle opening edit screen instead of toggling (#158)
- Fix notch indicator not displaying on other virtual desktops, fixes #165 (#166)
- Embed MediaRemoteAdapter framework in app bundle, fixes #159
Other Changes
- Remove App Store build support (#161)
April 1, 2026
Other Changes
- Embed MediaRemoteAdapter framework in app bundle, fixes #159
- Fix notch indicator not displaying on other virtual desktops, fixes #165 (#166)
- Add media pause during recording, fixes #159 (#163)
- Remove App Store build support (#161)
- Add media pause during recording, fixes #159 (#160)
- Fix prompt toggle opening edit screen instead of toggling (#158)
- Fix ultrashort push-to-talk dictation (#157)
March 2026
March 31, 2026
March 30, 2026
New Features
- Add vocabulary boosting to ParakeetPlugin, closes #129 (#130)
- Add V2/V3 model version selection to ParakeetPlugin, fixes #137 (#146)
- Add dynamic model fetching to GeminiPlugin, fixes #136 (#145)
- Add Google Cloud Speech-to-Text plugin (#147)
- Add dictionary JSON import and export (#148)
- Non-blocking model downloads (#149)
- Add configurable sound feedback settings, fixes #139 (#150)
- Add post-processing transparency to history and indicators, fixes #134 (#151)
- Record app audio with separate tracks, refs #141 (#152)
- Add community term pack registry (#154)
- Add Prompts & AI wizard step and cross-tab navigation, fixes #135 (#155)
- Add drag-and-drop reordering for prompts, fixes #153 (#156)
Bug Fixes
March 30, 2026
New Features
- add V2/V3 model version selection to ParakeetPlugin, fixes #137 (#146)
- add dynamic model fetching to GeminiPlugin, fixes #136 (#145)
- add vocabulary boosting to ParakeetPlugin, closes #129 (#130)
- option to preserve clipboard content after dictation (#128)
- add ElevenLabs transcription plugin (#125)
- add Gladia transcription plugin (#126)
- add Claude LLM plugin for Anthropic Claude API (#124)
- add Cerebras plugin for ultra-fast LLM inference (#123)
- add Fireworks AI plugin for cloud transcription and LLM (#122)
- automatic daily releases via cron schedule (#117)
- dynamic LLM model fetching for OpenAI & Groq plugins, fixes #113
Bug Fixes
- detect Fn hotkey on release to avoid Fn+key conflicts, fixes #132 (#144)
- update FluidAudio for Swift 6 actor safety, fixes #129 (#131)
- repair corrupted pbxproj from ElevenLabs/Gladia merge
- improve short recording recognition and add user feedback (#118)
- add German translations for About tab, fixes #114 (#116)
Other Changes
- Record app audio with separate tracks, refs #141 (#152)
- Add post-processing transparency to history and indicators, fixes #134 (#151)
- Add configurable sound feedback settings, fixes #139 (#150)
- Non-blocking model downloads (#149)
- Add dictionary JSON import and export (#148)
- Add Google Cloud Speech-to-Text plugin (#147)
- Update README video attachment (#121)
- Update README demo video (#119)
- update screenshot assets for improved documentation
March 29, 2026
New Features
- add vocabulary boosting to ParakeetPlugin, closes #129 (#130)
- option to preserve clipboard content after dictation (#128)
- add ElevenLabs transcription plugin (#125)
- add Gladia transcription plugin (#126)
- add Claude LLM plugin for Anthropic Claude API (#124)
- add Cerebras plugin for ultra-fast LLM inference (#123)
- add Fireworks AI plugin for cloud transcription and LLM (#122)
- automatic daily releases via cron schedule (#117)
- dynamic LLM model fetching for OpenAI & Groq plugins, fixes #113
Bug Fixes
- update FluidAudio for Swift 6 actor safety, fixes #129 (#131)
- repair corrupted pbxproj from ElevenLabs/Gladia merge
- improve short recording recognition and add user feedback (#118)
- add German translations for About tab, fixes #114 (#116)
Other Changes
March 28, 2026
TypeWhisper 1.0 is the first stable release - a system-wide dictation app for macOS that lets you choose your own transcription engine.
Highlights
Your Engine, Your Choice Pick from local engines (WhisperKit, Parakeet, Qwen3, Granite, SpeechAnalyzer) or cloud services (Groq, OpenAI, Deepgram, AssemblyAI, Cloudflare) - all as plugins you can install and swap freely.
System-Wide Dictation Four hotkey modes (hybrid, push-to-talk, toggle, prompt palette) with a notch-integrated recording indicator. Dictation works in any app - just press your hotkey and talk.
Prompt Processing Send transcriptions through LLM providers (Apple Intelligence, Groq, OpenAI, Gemini) with bundled presets for fixing grammar, summarizing, extracting structured data, and more. Create your own custom prompt actions.
Profiles App- and URL-based profiles that automatically switch language, engine, prompt, and output format depending on where you're typing.
Plugin System A full Plugin SDK with four extension points (transcription engines, LLM providers, post-processors, action plugins). Install plugins from the built-in marketplace or build your own.
New Features
- Option to preserve clipboard content after dictation (#127)
- Reorganize Settings tabs for cleaner UX (#111)
- VoiceOver support, spoken feedback, and read-back TTS (#105)
- Error display UI with notch banner and persistent error log (#109)
- Enhance hotkey handling to distinguish modifier releases from key releases
- Inline command detection per profile, fixes #84 (#87)
- Double-tap hotkey support, fixes #106 (#107)
- Audio recorder with mic and system audio capture (#98)
- Structured data prompt presets (#95)
- Watch folder for automatic audio transcription (#99)
- CGEventTap for hotkey event suppression, fixes #101
- App-aware auto-formatting with profile-based output format, fixes #55
- Category grouping and hosting filter in plugin marketplace
- Memory plugin system with LLM extraction and correction tracking
- Kimi K2 LLM model support
Bug Fixes
- Handle Space key and special characters in hotkey recorder, fixes #94
- Resolve Cmd+V/Cmd+C on non-QWERTY keyboard layouts, fixes #92
- Improve LiveTranscriptPlugin text handling and UI (v1.0.2)
- Add confirmation dialog before clearing all memories
- Thread safety, migration, and race condition fixes
Other Changes
- Prepare TypeWhisper for 1.0 readiness (#112)
- Granular key event states, improved suppression logic, key repeat handling
- FileMemoryPlugin and OpenAIVectorMemoryPlugin added to plugin CI workflow
- German translations for all memory plugin UI strings
- Simplified memory plugin system, eliminated duplication
- 30s extraction cooldown, skip memory injection for extraction calls
For Developers
- HTTP API (
/v1/*) for local automation typewhisperCLI embedded in app bundle- Plugin SDK with stable contracts for 1.x
- WidgetKit widgets for stats, activity, and history
System Requirements
- macOS 14.0 or later (SpeechAnalyzer plugin requires macOS 26)
- Apple Silicon or Intel Mac
March 27, 2026
New Features
- option to preserve clipboard content after dictation (#128)
- add ElevenLabs transcription plugin (#125)
- add Gladia transcription plugin (#126)
- add Claude LLM plugin for Anthropic Claude API (#124)
- add Cerebras plugin for ultra-fast LLM inference (#123)
- add Fireworks AI plugin for cloud transcription and LLM (#122)
- automatic daily releases via cron schedule (#117)
- dynamic LLM model fetching for OpenAI & Groq plugins, fixes #113
Bug Fixes
- repair corrupted pbxproj from ElevenLabs/Gladia merge
- improve short recording recognition and add user feedback (#118)
- add German translations for About tab, fixes #114 (#116)
Other Changes
March 26, 2026
New Features
- add ElevenLabs transcription plugin (#125)
- add Gladia transcription plugin (#126)
- add Claude LLM plugin for Anthropic Claude API (#124)
- add Cerebras plugin for ultra-fast LLM inference (#123)
- add Fireworks AI plugin for cloud transcription and LLM (#122)
- automatic daily releases via cron schedule (#117)
- dynamic LLM model fetching for OpenAI & Groq plugins, fixes #113
Bug Fixes
- repair corrupted pbxproj from ElevenLabs/Gladia merge
- improve short recording recognition and add user feedback (#118)
- add German translations for About tab, fixes #114 (#116)
Other Changes
March 25, 2026
March 24, 2026
New Features
- reorganize Settings tabs for cleaner UX (#111)
- VoiceOver support, spoken feedback, and read-back TTS (#105)
- add error display UI with notch banner fix and persistent error log (#109)
- Enhance hotkey handling to distinguish modifier releases from key releases
- inline command detection per profile, fixes #84 (#87)
- add double-tap hotkey support, fixes #106 (#107)
- add audio recorder with mic and system audio capture (#98)
- add Kimi K2 LLM model
- add structured data prompt presets (#95)
- add watch folder for automatic audio transcription (#99)
- implement CGEventTap for hotkey event suppression, fixes #101
- add app-aware auto-formatting with profile-based output format, fixes #55
- add category grouping and hosting filter to plugin marketplace
- add Memory plugin system with LLM extraction and correction tracking
Bug Fixes
- handle Space key and special characters in hotkey recorder, fixes #94
- resolve Cmd+V/Cmd+C on non-QWERTY keyboard layouts, fixes #92
- improve LiveTranscriptPlugin text handling and UI (v1.0.2)
- add confirmation dialog before clearing all memories
- address code review findings - thread safety, migration, races
Other Changes
- Prepare TypeWhisper for 1.0 readiness (#112)
- Enhance hotkey event processing by introducing granular key event states, improving suppression logic, and handling key repeats.
- add FileMemoryPlugin and OpenAIVectorMemoryPlugin to plugin CI workflow
- add German translations for all memory plugin UI strings
- simplify memory plugin system, eliminate duplication
- add 30s extraction cooldown and skip memory injection for extraction calls
March 18, 2026
What's New
CloudflareASR Plugin
Community contribution by @yunluyl - new transcription plugin for OpenAI-compatible ASR servers behind Cloudflare tunnels.
- Supports CF-Access service token authentication and optional Bearer token
- Qwen3-ASR output parsing (
<asr_text>tags) - Model fetching and manual model selection
Audio Engine Resilience
- Recording now automatically recovers when macOS interrupts the audio engine (e.g. notification sounds, Bluetooth device changes)
- Previously buffered audio is preserved during engine restart
Menu Bar Fix
- Windows opened from the menu bar (Settings, History) now properly appear in front instead of behind other windows
March 16, 2026
What's New
- History Audio Playback: Optionally save and replay audio recordings from transcription history
Bug Fixes
- Cloud Transcription Reliability: Fixed cloud API requests (Groq, OpenAI, Deepgram, etc.) hanging after sleep/wake or network changes. All plugins now use ephemeral HTTP sessions with 30s timeouts instead of persistent connections that go stale
- Indicator Preview: Fixed color scheme not applying correctly in IndicatorPreviewView (#86)
- MLX Plugins: Fixed Qwen3 and Voxtral plugins failing to load on macOS 15 due to missing Swift compatibility libraries
- Plugin CI: All plugins now build with Xcode 26.2 for consistent package resolution
March 16, 2026
What's New in v0.4.1
Welcome Wizard Redesign
- Dark theme — Migrated to FluentWindow with Mica backdrop, matching the rest of the app. Previously rendered with invisible text on a white background
- Plugin-based setup — Step 1 now dynamically lists all available marketplace extensions with install/installed status and shows transcription models from installed engines. No more hardcoded model selection
- Streamlined flow — Reduced from 5 steps to 4: Extensions & Model → Microphone → Hotkeys → Done. The obsolete "Cloud Providers" step has been removed (now handled by the plugin system)
- Re-run anytime — Added "Open Wizard" shortcut on the Dashboard to re-run the setup at any time
Bug Fixes
- Fixed all Welcome Window elements being invisible on the dark theme (#3)
March 16, 2026
March 15, 2026
What's New in v0.4.0
This is a major feature release bringing full feature parity with the Mac version.
Post-Processing Pipeline
- Refactored text processing into a priority-based pipeline architecture
- Plugin post-processors can now hook into the processing chain with custom priorities
- Action plugins (Linear, Obsidian) can receive processed text instead of inserting it
HTTP API (7 new endpoints)
GET /v1/history— Search history with paginationDELETE /v1/history— Delete history entriesGET /v1/profiles— List all profilesPUT /v1/profiles/toggle— Toggle profile on/offPOST /v1/dictation/start/stop— Remote recording controlGET /v1/dictation/status— Check recording state- Extended
/v1/statuswith streaming and translation capabilities
Model Auto-Unload
- Models are now automatically unloaded after a configurable idle timeout
- Saves memory when TypeWhisper is running but not actively used
- Configurable via
ModelAutoUnloadSecondssetting (0 = disabled)
History Export
- Added Markdown (
.md) and JSON (.json) export formats - Export dialog now supports 4 formats: Text, CSV, Markdown, JSON
New Plugins (available in Marketplace)
- Google Gemini — LLM provider supporting Gemini 2.5 Flash, Pro, and Flash Lite
- Linear — Create Linear issues directly from transcriptions via GraphQL API
- Obsidian — Save transcriptions as Markdown notes in Obsidian vaults with YAML frontmatter
- Script Runner — Pipe transcriptions through custom shell scripts (cmd/PowerShell)
- Live Transcript — Floating semi-transparent window showing real-time transcription
Per-Profile Hotkeys
- Each profile can now have a dedicated hotkey for instant activation
- Bypasses automatic app/URL matching — press the hotkey to dictate with that specific profile
- Configure via the new hotkey recorder in the profile detail panel
Audio Device Preview
- Real-time microphone level meter in audio settings
- Visual feedback when selecting or testing microphone devices
- Preview automatically starts/stops when navigating to/from the Recording section
Database
- Schema upgraded to v6 with new columns for action plugin routing, profile hotkeys, and model tracking
March 15, 2026
New Plugins
- LiveTranscriptPlugin - Real-time live transcript overlay during recording (#77)
New Features
- Redesigned Home Dashboard as Bento Grid layout (#78)
Bug Fixes
- Fix iPhone Continuity mic causing recording to immediately stop (#76)
- Fix memory corruption crash in audio level update pipeline
- Fix WhisperKit auto-detect language detection (#75)
- Show guidance when no LLM provider plugins are installed
Updates
- WhisperKitPlugin 1.0.14
March 13, 2026
New Plugins
- ScriptPlugin - Post-process transcribed text via shell commands (e.g.
tr,sed, custom scripts) - ObsidianPlugin - Auto-export transcriptions to Obsidian vaults with daily notes support
- DeepgramPlugin - Real-time WebSocket streaming transcription via Deepgram
- AssemblyAIPlugin - WebSocket v3 streaming transcription via AssemblyAI
New Features
- Overlay indicator system with customizable styles and position presets
- Static indicator preview and style picker in General Settings
- Configurable notch display for multi-monitor setups (#65)
- Dynamic app group identifier for codesigning (PR #73)
DEVELOPMENT_TEAMxcconfig for easier contributor setup (#67)
Bug Fixes
- Fix crash on macOS 26 when starting recording -
EXC_BAD_ACCESSin StreamingHandler (PR #74) - Fix crash when saving history records after ScriptPlugin processing
- Fix API key persistence in cloud plugins (#72)
- Fix Prompt Palette text selection and replacement in Electron apps (#70)
- Fix auto-install of plugins - users now choose in setup wizard (#66)
- Fix mic preview not restarting when switching devices in setup wizard (#63)
- Fix
MACOSX_DEPLOYMENT_TARGETfor all plugin targets (#64) - Fix
CODE_SIGN_IDENTITYoverrides for automatic signing (#67)
March 8, 2026
What's New
Auto-Unload Models
- Configurable auto-unload - Local transcription models can now be automatically unloaded from memory after a configurable inactivity period. Choose from Immediate, 2/5/10/30/60 minutes, or Never.
- Automatic reload - When you start dictating after a model was auto-unloaded, it reloads automatically - no manual intervention needed.
- RAM savings - Frees up gigabytes of RAM when you're not actively transcribing, especially useful on 8 GB machines.
Settings: Recording > Engine > Auto-unload model
Fixes
- Bundled plugins - Fixed bundled plugins not being auto-enabled on first launch.
March 7, 2026
What's New
Instant Results with Streaming
When using Deepgram or AssemblyAI with live transcription enabled, the streaming result is now used as the final transcription. Previously, the app would discard the live result and run a redundant batch transcription after recording stopped — adding unnecessary delay. Now results appear instantly when you stop recording.
AssemblyAI Streaming Fixes
- Live transcription now works — fixed audio chunk sizing (AssemblyAI requires ≥50ms chunks)
- Multilingual support — automatically uses the multilingual streaming model for non-English languages
- No more duplicated text — fixed finality detection for the v3 streaming API
Bug Fix
- Fixed missing space in text stabilization when the transcription engine diverges mid-word
March 7, 2026
What's New
VoxtralPlugin
- Local Voxtral Mini 4B - New speech-to-text plugin for local transcription using Voxtral Mini 4B.
Setup Wizard
- Embedded Marketplace - Plugin marketplace is now embedded directly in Setup Wizard Step 2 with default engine and LLM pickers.
Homebrew
- Homebrew Tap support - Install TypeWhisper via
brew install typewhisper/tap/typewhisper.
Fixes
- Control+Space hotkey - Fixed hotkey recorder not capturing modifier+key combinations like Control+Space.
- Processing spinner - Show a spinner in the notch indicator during transcription processing.
- Microphone permission - Auto-open Settings when microphone permission is missing.
March 6, 2026
What's New
Widgets
- Home Screen Widgets - 4 new WidgetKit widgets: Stats (small), Last Transcription (small), Activity Chart (medium), and History (large). Add them to your desktop for quick dictation insights.
Simulated Streaming
- Streaming for all engines - Non-streaming engines (Parakeet, Qwen3, cloud engines) now show simulated streaming text output, giving visual feedback while transcription completes.
macOS 14 Support
- macOS Sonoma compatibility - Deployment target lowered to macOS 14. All plugins now load on Sonoma as well.
March 2, 2026
Performance
- History section: Opens instantly — async cache loading off the UI thread and real UI virtualization (ListBox) replace the previous synchronous load that froze the UI
- Dashboard: No longer blocks app startup — history data loads asynchronously in the background
- Stats caching: TotalRecords/Words/Duration are cached and updated incrementally instead of recalculated via LINQ on every access
March 2, 2026
Other Changes
- Fix SupportedLanguages DIM test by enabling CallBase on mock
- Refresh profile transcription model list on plugin state changes
- Improve OpenAI Compatible plugin connection UX and remove inline styles
- Migrate all settings sections to WPF-UI controls
- Migrate PluginsSection to WPF-UI with CardExpander and fix expand state
- Add WPF-UI Fluent Design and migrate SettingsWindow to FluentWindow
- Add OpenAI Compatible plugin for local LLM servers
- Extract local models into SherpaOnnx plugin and unify Models UI
- Refresh plugin and model lists live after marketplace install/uninstall
- Add plugin system parity: IActionPlugin, streaming, marketplace, and localization
- Add prompt palette for AI-powered text processing via hotkey
February 2026
February 27, 2026
What's New
Plugin Marketplace
Browse, install, update, and uninstall plugins directly from Settings > Integrations. Plugins are downloaded from GitHub Releases with progress feedback. Available/Installed tabs, update badges, and "Install from File" for local bundles.
Qwen3 ASR Engine
New local transcription engine powered by MLX on Apple Silicon. 30 languages, no API key required. Available as a plugin from the marketplace.
All Engines are Plugins
WhisperKit, Parakeet, SpeechAnalyzer, Groq, and OpenAI transcription engines have been migrated to the plugin system. The Models tab has been removed - each engine now manages its models in its own settings within the Integrations tab.
Hide Menu Bar Icon
New option in Settings > General > Appearance to hide the menu bar icon. When hidden, a Dock icon appears and the settings window opens on launch.
Plugin Localization
Plugins now support localized names and descriptions. German translations included for all bundled plugins and marketplace listings.
Improvements
- Engine and model picker moved to Recording settings for easier access
- History view redesigned with wider settings window and cached computed properties
- Plugins report supported languages for better profile matching
- Model download progress shown immediately for all engines
- Plugin auto-restores its model on app launch
- Settings window forwards plugin state changes for live UI updates
Fixes
- WhisperKit model download progress polling
- Prevent re-installing plugins after user uninstalls them
- SpeechAnalyzer settings view loads models on appear
- Plugin deactivation falls back to first ready engine
- WhisperKitConfig downloadBase set correctly to prevent Documents permission prompt
February 26, 2026
Simplification Release
This release focuses on removing complexity and cleaning up features that didn't carry their weight.
Removed
- Whisper mode (microphone gain boost) - removed for simplicity
- Silence detection - never worked reliably enough to ship
- Default prompt setting - unused, prompts are better managed per-profile
- Auto-submit in profiles - removed to reduce profile complexity
- Notch indicator extras - clock and battery removed, blue audio glow removed
- Preset labels from prompt actions list
- Dead code - toast overlay, hotkey migration, unused settings
Improved
- Settings reorganized - "Dictation" tab renamed to "Recording", audio sections regrouped
- Hotkey settings - more compact layout
- DictationViewModel - refactored with shared helpers, DRY event processing
- README - updated to reflect current feature set
February 24, 2026
New Features
- Dynamic Island overlay: Configurable floating overlay with widgets (profile, model, audio level) and multi-monitor support
- Dictionary page redesign: Search bar with filtering, rich empty states with icons, segmented type selector, colored accent stripes on entry cards, polished add form and edit overlay
Improvements
- Profile widget: Show "Kein Profil" placeholder when no profile is active
- Profile settings: Hide detail panel when no profile is selected
February 24, 2026
What's New
History Overhaul
- Standalone History window - Access history directly from the tray menu, independent of Settings
- Timeline grouping - Entries grouped by Today, Yesterday, Last 7 Days, etc.
- Filters - Filter by app and time range with native picker controls
- Bulk delete - Multi-select entries and delete via context menu
- "Delete All Visible" - Delete all entries matching current filters at once
- Auto-retention - Configure automatic cleanup (keep last 30/90/180/365 days) in Settings > Advanced
- Multi-select export - Export multiple entries as Markdown, Plain Text, or JSON
Profiles
- Per-profile hotkeys - Assign dedicated hotkeys to individual profiles
- Auto-submit option - Automatically submit text in chat apps (press Enter after insertion)
API & Integration
- HTTP API endpoints for history and profiles (#30)
Improvements
- Separate Dev and Production builds with different bundle IDs and data directories
- Disable streaming preview when cloud engine is selected
- Fix translation language normalization
- Windows no longer force-float above other apps
February 23, 2026
What's New
HTTP API for External Integrations
TypeWhisper now exposes a comprehensive HTTP API for controlling the app from external tools like Raycast, Alfred, or custom scripts.
New endpoints:
GET /v1/history- Search and browse transcription history (withq,limit,offsetparams)DELETE /v1/history?id=<uuid>- Delete a history entryGET /v1/profiles- List all profilesPUT /v1/profiles/toggle?id=<uuid>- Enable/disable a profilePOST /v1/dictation/start- Start voice recordingPOST /v1/dictation/stop- Stop voice recordingGET /v1/dictation/status- Check if currently recording
All date fields now use ISO 8601 format. CORS headers support PUT and DELETE methods.
Separate Dev and Production Builds
Debug and Release builds now use different bundle IDs and data directories, so you can run both side-by-side without conflicts:
- Production:
com.typewhisper.mac/~/Library/Application Support/TypeWhisper/ - Development:
com.typewhisper.mac.dev/~/Library/Application Support/TypeWhisper-Dev/
Other
- Updated README with screenshots for all settings tabs
- Added screenshot automation script
February 22, 2026
What's New
ActionPlugin System
- New
ActionPluginprotocol for routing LLM output to custom actions instead of text insertion - Bundled LinearPlugin: Create Linear issues directly from voice dictation
Bug Fixes
- Fix auto-paste not working in some apps (#29) - Text insertion now works reliably in all apps (VS Code, Notes, etc.) by removing an overly restrictive accessibility check
February 22, 2026
Bug Fixes
- Fix SmartScreen warnings for plugin DLLs on startup: Remove "Mark of the Web" (Zone.Identifier) from plugin files before loading assemblies. Prevents Windows SmartScreen balloon notifications for built-in plugins (e.g. Groq) after install or update via Velopack.
February 22, 2026
Bug Fixes
- Fix built-in plugins missing after install/update: OpenAI, Groq, and Webhook plugins are now included in the Velopack installer package. Previously they were only present in build output but not in the publish directory.
- Fix settings lost on updates: Settings persistence is now resilient against corruption with atomic writes, automatic backup before each save, and recovery from backup if the primary file is corrupt.
February 22, 2026
Bugfixes
Auto-Update: Balloon-Klick funktioniert jetzt
- Klick auf die Tray-Notification "Update verfügbar" startet jetzt direkt den Download und die Installation
- Vorher wurde der Klick ignoriert, obwohl der Text "Klicken Sie hier zum Aktualisieren" anzeigte
Auto-Start: Funktioniert nach Updates zuverlässig
--minimizedFlag wird jetzt korrekt ausgewertet- Nach Velopack-Updates wird der Auto-Start Registry-Eintrag automatisch mit dem neuen Exe-Pfad aktualisiert
- Onboarding-Fenster wird bei minimiertem Start übersprungen
- Logging in
startup.logfür einfacheres Debugging
February 22, 2026
What's New
Multi-Webhook System (Mac-Parität)
- Multiple Webhooks mit individueller Konfiguration (Name, URL, HTTP-Methode)
- Profil-Filter pro Webhook (nur bei bestimmten Profilen auslösen)
- Delivery Log mit den letzten 20 Zustellungen (Erfolg/Fehler)
- Automatischer Retry nach 5 Sekunden bei Fehlern
- Neuer Bearbeitungs-Dialog für Webhook-Konfiguration
Laufende Apps im Profil-Editor
- Apps-Feld zeigt jetzt laufende Anwendungen als klickbare Vorschläge
- Ein Klick fügt die App zum Profil-Filter hinzu
- Liste aktualisiert sich automatisch
SDK
TranscriptionCompletedEvententhält jetztProfileName
February 21, 2026
Bug Fixes
- Fix rapid back-to-back dictation hanging the app - When starting a new recording immediately after the previous one finished, the app could become unresponsive. The inserting-state reset timer now gets properly cancelled when a new recording begins.
Cleanup
- Removed unused "Always Paste" option from profiles
February 21, 2026
What's New in v0.9
Plugin System
TypeWhisper is now extensible through plugins. A new plugin SDK (TypeWhisperPluginSDK) enables developers to build custom integrations:
- Extension points: LLM providers, transcription engines, and post-processors can all be added via plugins
- Typed event bus: Plugins can observe dictation lifecycle events (recording start/stop, transcription complete, text inserted, etc.)
- Host services API: Plugins get scoped keychain access, UserDefaults, data directories, and profile name resolution
- Priority-based post-processing pipeline: Multiple post-processors run in defined order
Cloud Providers are now Plugins
Groq, OpenAI, and Gemini are now bundled as plugins instead of hardcoded providers. This makes them easier to update independently and serves as reference implementations for plugin developers.
- Gemini support - New! Use Google's Gemini models for prompt processing via OpenAI-compatible API
- Dual-purpose plugins - Groq and OpenAI plugins provide both transcription and LLM capabilities
- User-installed plugins in
~/Library/Application Support/TypeWhisper/Plugins/take priority over built-in ones
Webhook Plugin
A new built-in webhook plugin lets you send transcription results to external services:
- Configurable webhook URLs with per-profile filtering
- Automatic retry logic and delivery log
Bug Fixes
- Fixed media pause launching Apple Music when nothing is playing (#25)
- Fixed WhisperKit model disappearing from picker after first transcription
February 21, 2026
What's Changed
- Prompt Palette redesigned: Moved from notch overlay to a standalone floating panel for better usability and more screen space
- Project roadmap added: Added a roadmap document outlining planned features and development direction
February 20, 2026
What's Changed
- Add global default prompt setting (#17)
- Fix audio end being cut off on quick stop
- Move WhisperKit model storage from Documents to Application Support
- Add silence-pause mode for continuous dictation
- Remove unused DictationOverlay files
February 20, 2026
Other Changes
- Fix Velopack update discovery with per-architecture channels
February 20, 2026
Other Changes
- Add version display and update check to Info settings section
- Remove unused XAML namespace and explicit DataType from DictionarySection
February 19, 2026
February 18, 2026
Notch Indicator
New recording indicator that visually extends the MacBook notch - inspired by boring.notch.
New Features
- Notch-extending overlay: Three-zone layout (left | notch | right) with smooth Bezier ear curves, centered over the hardware notch
- Audio-reactive blue glow: Blue shimmer emanates from the notch shape, intensity reacts to voice level
- Configurable content: Both left and right sides independently configurable with 6 options: Recording Indicator, Timer, Waveform, Clock, Battery, None
- Expandable streaming text: Panel expands downward to show live transcription text with auto-scroll
- Non-notch support: Falls back to a compact centered bar on external displays without a notch
- Full buffer streaming: Removed 28-second sliding window limit - streaming preview now covers the entire recording
Changes
- Classic overlay removed (replaced by notch indicator)
- Display mode setting removed (simplified to visibility + left/right content pickers)
- 65 missing German translations added
Defaults
- Visibility: During activity
- Left: Timer
- Right: Waveform
February 17, 2026
Other Changes
- Non-blocking transcription pipeline with Channel<T> job queue
February 16, 2026
Other Changes
- Add cloud transcription providers (Groq, OpenAI) with provider architecture
February 16, 2026
February 15, 2026
Other Changes
- Fix release pipeline: apply RID only to publish step
- Build separate release artifacts for win-x64 and win-arm64
February 15, 2026
What's Changed
- Add project README with features, build instructions, architecture overview, and model recommendations
- Fix welcome wizard: replace outdated Whisper model list with Parakeet TDT 0.6B and Canary 180M Flash
February 15, 2026
Other Changes
- Add release pipeline with GitHub Actions and Velopack
- Redesign dashboard with activity chart and stats
- Fix dispose guards and missing connection opens
- Add DB migration for tags column and consume trailing punctuation in snippets
- Enhance snippets: tags, clipboard placeholder, import/export, editor dialog
- Add dictionary feature: cards UI, term packs, auto-learning
- Extend sound durations and remove stop sound
- Add sound feedback with NAudio and fix playback order
- Remove Whisper reference from info section
- Add audio ducking and media pause during recording
- Redesign history as timeline with inline-expandable cards
- Add .agents/ and .claude/ to gitignore
- Enhance overlay UI with partial transcription, badges, and timer
- Add live transcription with Silero VAD for real-time partial results
- Refactor hotkeys: support hybrid, toggle-only, and hold-only modes
- Remove Whisper engine, consolidate to Parakeet-only
- Add app icon for tray, taskbar, and exe from Mac icon set
- Add browser URL detection via Windows UI Automation for profile matching
- Initial commit: TypeWhisper Windows with local Opus-MT translation