Changelog

Release notes and updates for TypeWhisper.

April 2026

macOS

v1.3.0-daily.20260421

April 21, 2026

New Features

  • add prompt temperature controls and lazy plugin loading

Bug Fixes

  • remove duplicate macOS 14 settings sidebar button
macOS

v1.3.0-rc2

April 19, 2026

TypeWhisper 1.3.0-rc2 rolls up the stabilization work that landed after rc1, with a focus on push-to-talk reliability, short-speech handling, audio recovery hardening, and the refreshed local-model and licensing flows.

Reliability & Input Handling

  • Fixed modifier-combo push-to-talk so transient flagsChanged churn does not stop recording before the final key release.
  • Added the default push-to-talk discard behavior when extra keys interrupt an active hold, while keeping the stop point predictable on the actual hotkey release.
  • Added an advanced opt-in to transcribe more short and quiet clips instead of classifying them as no speech.
  • Hardened audio recovery, preview recovery, and recording-abort cleanup around route changes and device loss.
  • Delayed media playback resume after dictation to avoid premature restarts.
  • Ignored benign startup configuration changes that should not trigger unnecessary recovery work.

Local Models & Plugins

  • Upgraded the shared local-model runtime stack and added Gemma 4 support with clearer model-policy messaging.
  • Improved local-model download, cancellation, and Hugging Face token flows across the bundled local plugins.
  • Fixed Gemini model discovery and prompt-processing fallback handling.

Settings & UX

  • Redesigned the in-app licensing settings flow.
  • Aligned spoken-language picker labels and search behavior with the app locale.
macOS

v1.3.0-daily.20260419

April 19, 2026

New Features

  • redesign the licensing settings flow

Bug Fixes

  • ignore benign startup config change on recording start
  • align language picker search with app locale
  • bounded recovery + terminal error propagation

Other Changes

  • Fix Gemini model discovery and prompt fallback handling
  • Trigger website rebuilds for preview releases
macOS

v1.3.0-daily.20260418

April 18, 2026

Other Changes

  • Trigger website rebuilds for preview releases
Windows

v0.6.1

April 17, 2026

TypeWhisper v0.6.1

TypeWhisper 0.6.1 is a focused stability release that tightens dictation behavior, file transcription model selection, provider state handling, and release packaging.

Improvements

  • Improved provider and model state handling so the app is less likely to lose the selected configuration while switching between engines and prompts.
  • Refined the bundled Groq integration with better LLM model selection and refresh behavior.

Fixes

  • Fixed recorder-compatible hotkey parsing so captured shortcuts round-trip correctly through the UI and runtime hotkey pipeline.
  • Fixed file transcription to load the model the user actually selected instead of falling back unexpectedly.
  • Fixed the silent-stop overlay reset path and closed a streaming session leak that could leave overlay state behind after recording ended.

Release Prep

  • Hardened the release workflow so app tags prefer curated notes from docs/releases/ and fall back to commit-generated notes only when no curated file exists.
  • Scoped previous-tag lookup to app tags only, preventing plugin tags from polluting app release note ranges.
  • Prepared the Windows release metadata for 0.6.1, including CLI version reporting and Groq plugin metadata alignment for plugin-groq-v1.0.2.
macOS

v1.3.0-rc1

April 17, 2026

TypeWhisper 1.3.0-rc1 opens the 1.3 release-candidate line. The release adds spoken feedback, per-request STT control through the HTTP API and CLI, multilingual language hints, and the new plugin-compatibility safeguards introduced after #327, on top of a set of hotkey, audio, and plugin reliability fixes.

New Features

  • Spoken feedback (text-to-speech) with a new System Voice marketplace plugin that uses the built-in macOS AVSpeechSynthesizer voices. Voice and speed are configurable in the settings, and the feedback is scoped to transcription readback only.
  • Per-request STT engine and model selection through the HTTP API (engine/model on /v1/transcribe) and the CLI (typewhisper transcribe --engine <id> --model <id>). When omitted, the active profile defaults are used.
  • Multilingual language hints with a multi-select picker, inline search, and a selected-count indicator, wired through to the transcription providers.

Bug Fixes

  • Restored GPT-5 and Codex compatibility for the bundled OpenAI plugin.
  • Fixed Fn-hotkey handling so both press-and-release and press-and-hold strategies work reliably.
  • Spoken feedback is now limited to transcription readback and no longer narrates unrelated UI events.
  • Addressed the 1.2.3 review blockers (K1-K4, M1, M7, M8) in the release pipeline and plugin registry.
  • Split the marketplace into legacy and v1 feeds, moved plugin and term-pack fetches out of startup, and hardened plugin loading so incompatible external bundles are skipped instead of breaking launch.
  • Hardened audio recovery during Bluetooth route changes to avoid crashes.
  • Dictionary terms are forwarded through the streaming providers (AssemblyAI, Soniox, SpeechAnalyzer) without breaking the live session.

Other Changes

  • Refactored the Qwen3 context-bias formatter for cleaner prompt injection and easier future tuning.
  • Incremental live transcription, Google Cloud STT release mapping, and the multi-release plugin registry are carried forward from the late 1.2.x work.
  • All current marketplace plugins now have v1 releases for the RC line, including System Voice as an installable TTS plugin.
macOS

v1.3.0-daily.20260417

April 17, 2026

New Features

  • per-request STT engine/model selection (#317) (#325)
  • add TTS providers for spoken feedback (#323)
  • add multilingual language hints (#312) (#319)

Bug Fixes

  • dedupe build-file UUID collision between Qwen3ContextBiasFormatter and MinimalIndicatorView
  • limit spoken feedback to transcription readback (#324)
  • address release 1.2.3 review blockers (K1-K4, M1/M7/M8) (#320)
  • support Fn press/release hotkey strategies (#316)
  • restore GPT-5 OpenAI plugin compatibility (#313)

Other Changes

  • Fix plugin SDK compatibility handling (#328)
  • Add double-escape cancel confirmation (#326)
  • strip trailing newline from 1.3.0-rc1 release notes
  • Refactor Qwen3 context bias formatting (#322)
macOS

v1.2.2-daily.20260416

April 16, 2026

New Features

  • wire dictionary terms through transcription providers

Bug Fixes

  • keep AssemblyAI streaming active with dictionary terms

Other Changes

  • Update localized string catalog
  • Allow optional profile scope and default fallback
  • Fix PR guard bot handling
  • Add Google Cloud STT plugin release mapping
  • Keep plugin registry backward compatible
  • Fix legacy plugin release migration
  • Add multi-release registry and incremental live transcription
  • fix audio recovery crash during Bluetooth route changes
  • remove temporary plugin debug logging
  • Remove deprecated activation option
  • Allow empty correction replacements
  • Add manual uninstall instructions to README
  • Fix managed window activation on repeated menu bar opens
  • Update localized string catalog ordering
  • Fix #287 validate selected audio inputs on selection
Windows

v0.6.0

April 14, 2026

New Features

  • add whisper.cpp marketplace plugin

Bug Fixes

  • add new plugins to marketplace registry workflow
  • clear whisper model selection on unload

Other Changes

  • harden CI and prepare v0.6.0 release
  • Refine settings shell and localization
  • Implement custom protocol handling and improve Discord claim integration
  • Add supporter Discord claims and live profile context
  • Refactor structured settings pages
  • Polish settings, wizard, and snippets UX
  • Overhaul Windows settings UI
  • Fix Win hotkey modifier handling
  • Add vocabulary boosting for local Parakeet transcription
Windows

v0.5.1

April 9, 2026

Bug Fix

  • Fixed marketplace not showing new plugins — RegistryPlugin deserialization failed when optional fields were missing from the registry JSON. Switched from positional record to init properties with defaults.
Windows

v0.5.0

April 8, 2026

What's New in v0.5.0

Major Architecture Change

  • Replaced SQLite with JSON file storage — All data (history, dictionary, snippets, profiles, prompts) is now stored as simple JSON files. No more database migrations. Faster startup, easier debugging, and human-readable data files.
  • Removed Microsoft.Data.Sqlite dependency — Smaller application footprint.

New Features

  • Error Log — Structured error logging with categories (transcription, recording, prompt, plugin). View errors and export diagnostics in Settings → About.
  • App-aware Formatting — Transcribed text is automatically formatted based on the target app: Markdown for Obsidian/Notion, HTML for Outlook, plain for code editors.
  • Audio History — Recordings are now saved as WAV files alongside transcription history. Play back past recordings directly from the history view.
  • Watch Folder — Monitor a folder for audio files and automatically transcribe them with sidecar .txt output.
  • Memory Service — Automatically extract lasting personal facts from transcriptions using LLM. Stored via memory storage plugins.
  • Spoken Feedback — Text-to-speech readback of transcribed text after each dictation (Settings → Advanced).
  • Accessibility Announcements — Screen reader support via UI Automation live region events for Narrator, NVDA, and JAWS.
  • System Audio Capture — WASAPI loopback capture for recording system audio output.
  • CLI Tooltypewhisper command-line tool with status, models, and transcribe commands.
  • License & Supporter Tiers — Polar.sh license activation with Bronze/Silver/Gold supporter badges.
  • Release Channels — Stable, Release Candidate, and Daily update channels.
  • Audio Recorder — Standalone recording feature in Settings → Recorder with auto-transcription and recording history.

UI Improvements

  • Redesigned Settings Sidebar — Restructured to match Mac app: Home, General, Recording, Models, Recorder, History, Dictionary, Snippets, Profiles, Prompts, Integrations, Advanced, License, About.
  • Dashboard Enhancements — Trend indicators (↑↓%) on all stat cards, "All Time" period option, recent transcriptions section.
  • Overlay Redesign — Centered text layout, subtle border, larger font, more padding. Matches the new preview in Settings.
  • Overlay Preview — Live preview in General settings showing actual widgets, updates in real-time when changing widget selections.
  • Marketplace Grouping — Plugins are now grouped by category (Transcription Engines, LLM Providers, Memory, Actions, Utilities) with count badges.

New Plugins (14)

LLM Providers: Claude (Anthropic), OpenRouter, Cerebras, Cohere, Fireworks Transcription Engines: Gladia, Speechmatics, Soniox, Cloudflare ASR, Google Cloud STT, Qwen3 STT, Voxtral (Mistral) Memory Storage: File Memory, OpenAI Vector Memory

Plugin SDK Enhancements

  • ActionCompletedEvent for tracking action plugin results
  • App context (process name, window title, URL) in transcription events
  • IMemoryStoragePlugin interface for memory storage backends
  • IPluginSettingsActivity for progress reporting in plugin settings
  • SetStreamingDisplayActive() for plugins that handle their own streaming UI
  • Category and IsLocal fields in plugin manifest for marketplace grouping

Bug Fixes

  • #21 — Added option to disable saving transcriptions to history (Settings → Advanced)
  • #22 — Fixed intermittent hotkey toggle failures with 300ms debounce
  • #24 — Hotkey events are now swallowed to prevent triggering media playback in background apps
  • Hotkeys can now be cleared with Escape, Delete, or Backspace
  • Timer and other widgets show proper fallback values in overlay preview
Windows

v0.4.3

April 5, 2026

New Features

  • add local Gemma 4 LLM plugin (LLamaSharp/GGUF)

Bug Fixes

  • include NuGet dependencies and native runtimes in plugin ZIP

Other Changes

  • add gemma-local to plugin publish workflow
Windows

v0.4.2

April 5, 2026

New Features

  • Silence hallucination filter — Two-layer detection (client-side energy gate + server-side no_speech_prob) prevents cloud Whisper models from hallucinating random text on silent audio (#19)
  • Visual hybrid hotkey indicators — Toggle mode now shows amber overlay border and mode-aware status text ("Recording (Toggle) - press hotkey to stop") so you always know recording is active (#7)
  • Plugin localization — All 6 cloud plugins (OpenAI, Groq, Gemini, Deepgram, AssemblyAI, OpenAI Compatible) now fully support English and German in their settings views (#14)
  • IBM Granite Speech plugin — New local transcription engine available in the Marketplace

Bug Fixes

  • App no longer freezes when no microphone is connected; shows feedback and auto-recovers when a mic is plugged in (#5)
  • Groq plugin now has a 30-second HTTP timeout instead of infinite, preventing hangs on slow/unreachable API (#6)
  • Prompt actions now show a clear warning when no LLM provider is configured, instead of silently doing nothing (#8)
  • Translation is correctly skipped when the configured source language already matches the target language (#9)
  • System language detection now uses the Win32 API directly, fixing incorrect language when launched via the Velopack installer (#15)
  • Granite Speech plugin download now includes retry logic and validation (#4)

March 2026

Windows

v0.4.1

March 16, 2026

What's New in v0.4.1

Welcome Wizard Redesign

  • Dark theme — Migrated to FluentWindow with Mica backdrop, matching the rest of the app. Previously rendered with invisible text on a white background
  • Plugin-based setup — Step 1 now dynamically lists all available marketplace extensions with install/installed status and shows transcription models from installed engines. No more hardcoded model selection
  • Streamlined flow — Reduced from 5 steps to 4: Extensions & Model → Microphone → Hotkeys → Done. The obsolete "Cloud Providers" step has been removed (now handled by the plugin system)
  • Re-run anytime — Added "Open Wizard" shortcut on the Dashboard to re-run the setup at any time

Bug Fixes

  • Fixed all Welcome Window elements being invisible on the dark theme (#3)
Windows

v0.4.0

March 15, 2026

What's New in v0.4.0

This is a major feature release bringing full feature parity with the Mac version.

Post-Processing Pipeline

  • Refactored text processing into a priority-based pipeline architecture
  • Plugin post-processors can now hook into the processing chain with custom priorities
  • Action plugins (Linear, Obsidian) can receive processed text instead of inserting it

HTTP API (7 new endpoints)

  • GET /v1/history — Search history with pagination
  • DELETE /v1/history — Delete history entries
  • GET /v1/profiles — List all profiles
  • PUT /v1/profiles/toggle — Toggle profile on/off
  • POST /v1/dictation/start / stop — Remote recording control
  • GET /v1/dictation/status — Check recording state
  • Extended /v1/status with streaming and translation capabilities

Model Auto-Unload

  • Models are now automatically unloaded after a configurable idle timeout
  • Saves memory when TypeWhisper is running but not actively used
  • Configurable via ModelAutoUnloadSeconds setting (0 = disabled)

History Export

  • Added Markdown (.md) and JSON (.json) export formats
  • Export dialog now supports 4 formats: Text, CSV, Markdown, JSON

New Plugins (available in Marketplace)

  • Google Gemini — LLM provider supporting Gemini 2.5 Flash, Pro, and Flash Lite
  • Linear — Create Linear issues directly from transcriptions via GraphQL API
  • Obsidian — Save transcriptions as Markdown notes in Obsidian vaults with YAML frontmatter
  • Script Runner — Pipe transcriptions through custom shell scripts (cmd/PowerShell)
  • Live Transcript — Floating semi-transparent window showing real-time transcription

Per-Profile Hotkeys

  • Each profile can now have a dedicated hotkey for instant activation
  • Bypasses automatic app/URL matching — press the hotkey to dictate with that specific profile
  • Configure via the new hotkey recorder in the profile detail panel

Audio Device Preview

  • Real-time microphone level meter in audio settings
  • Visual feedback when selecting or testing microphone devices
  • Preview automatically starts/stops when navigating to/from the Recording section

Database

  • Schema upgraded to v6 with new columns for action plugin routing, profile hotkeys, and model tracking
Windows

v0.3.3

March 7, 2026

What's New

Instant Results with Streaming

When using Deepgram or AssemblyAI with live transcription enabled, the streaming result is now used as the final transcription. Previously, the app would discard the live result and run a redundant batch transcription after recording stopped — adding unnecessary delay. Now results appear instantly when you stop recording.

AssemblyAI Streaming Fixes

  • Live transcription now works — fixed audio chunk sizing (AssemblyAI requires ≥50ms chunks)
  • Multilingual support — automatically uses the multilingual streaming model for non-English languages
  • No more duplicated text — fixed finality detection for the v3 streaming API

Bug Fix

  • Fixed missing space in text stabilization when the transcription engine diverges mid-word
Windows

v0.3.2

March 3, 2026

Other Changes

  • Add UI localization with English default and German support
Windows

v0.3.1

March 2, 2026

Performance

  • History section: Opens instantly — async cache loading off the UI thread and real UI virtualization (ListBox) replace the previous synchronous load that froze the UI
  • Dashboard: No longer blocks app startup — history data loads asynchronously in the background
  • Stats caching: TotalRecords/Words/Duration are cached and updated incrementally instead of recalculated via LINQ on every access

Full Changelog →

Windows

v0.3.0

March 2, 2026

Other Changes

  • Fix SupportedLanguages DIM test by enabling CallBase on mock
  • Refresh profile transcription model list on plugin state changes
  • Improve OpenAI Compatible plugin connection UX and remove inline styles
  • Migrate all settings sections to WPF-UI controls
  • Migrate PluginsSection to WPF-UI with CardExpander and fix expand state
  • Add WPF-UI Fluent Design and migrate SettingsWindow to FluentWindow
  • Add OpenAI Compatible plugin for local LLM servers
  • Extract local models into SherpaOnnx plugin and unify Models UI
  • Refresh plugin and model lists live after marketplace install/uninstall
  • Add plugin system parity: IActionPlugin, streaming, marketplace, and localization
  • Add prompt palette for AI-powered text processing via hotkey

February 2026

Windows

v0.2.8

February 24, 2026

New Features

  • Dynamic Island overlay: Configurable floating overlay with widgets (profile, model, audio level) and multi-monitor support
  • Dictionary page redesign: Search bar with filtering, rich empty states with icons, segmented type selector, colored accent stripes on entry cards, polished add form and edit overlay

Improvements

  • Profile widget: Show "Kein Profil" placeholder when no profile is active
  • Profile settings: Hide detail panel when no profile is selected

Full Changelog →

Windows

v0.2.7

February 22, 2026

Bug Fixes

  • Fix SmartScreen warnings for plugin DLLs on startup: Remove "Mark of the Web" (Zone.Identifier) from plugin files before loading assemblies. Prevents Windows SmartScreen balloon notifications for built-in plugins (e.g. Groq) after install or update via Velopack.

Full Changelog →

Windows

v0.2.6

February 22, 2026

Bug Fixes

  • Fix built-in plugins missing after install/update: OpenAI, Groq, and Webhook plugins are now included in the Velopack installer package. Previously they were only present in build output but not in the publish directory.
  • Fix settings lost on updates: Settings persistence is now resilient against corruption with atomic writes, automatic backup before each save, and recovery from backup if the primary file is corrupt.

Full Changelog →

Windows

v0.2.5

February 22, 2026

Bugfixes

Auto-Update: Balloon-Klick funktioniert jetzt

  • Klick auf die Tray-Notification "Update verfügbar" startet jetzt direkt den Download und die Installation
  • Vorher wurde der Klick ignoriert, obwohl der Text "Klicken Sie hier zum Aktualisieren" anzeigte

Auto-Start: Funktioniert nach Updates zuverlässig

  • --minimized Flag wird jetzt korrekt ausgewertet
  • Nach Velopack-Updates wird der Auto-Start Registry-Eintrag automatisch mit dem neuen Exe-Pfad aktualisiert
  • Onboarding-Fenster wird bei minimiertem Start übersprungen
  • Logging in startup.log für einfacheres Debugging

Full Changelog →

Windows

v0.2.4

February 22, 2026

What's New

Multi-Webhook System (Mac-Parität)

  • Multiple Webhooks mit individueller Konfiguration (Name, URL, HTTP-Methode)
  • Profil-Filter pro Webhook (nur bei bestimmten Profilen auslösen)
  • Delivery Log mit den letzten 20 Zustellungen (Erfolg/Fehler)
  • Automatischer Retry nach 5 Sekunden bei Fehlern
  • Neuer Bearbeitungs-Dialog für Webhook-Konfiguration

Laufende Apps im Profil-Editor

  • Apps-Feld zeigt jetzt laufende Anwendungen als klickbare Vorschläge
  • Ein Klick fügt die App zum Profil-Filter hinzu
  • Liste aktualisiert sich automatisch

SDK

  • TranscriptionCompletedEvent enthält jetzt ProfileName

Full Changelog →

Windows

v0.2.3

February 20, 2026

Other Changes

  • Fix Velopack update discovery with per-architecture channels
Windows

v0.2.2

February 20, 2026

Other Changes

  • Add version display and update check to Info settings section
  • Remove unused XAML namespace and explicit DataType from DictionarySection
Windows

v0.2.1

February 17, 2026

Other Changes

  • Non-blocking transcription pipeline with Channel<T> job queue
Windows

v0.2.0

February 16, 2026

Other Changes

  • Add cloud transcription providers (Groq, OpenAI) with provider architecture
Windows

v0.1.2

February 15, 2026

Other Changes

  • Fix release pipeline: apply RID only to publish step
  • Build separate release artifacts for win-x64 and win-arm64
Windows

v0.1.1

February 15, 2026

What's Changed

  • Add project README with features, build instructions, architecture overview, and model recommendations
  • Fix welcome wizard: replace outdated Whisper model list with Parakeet TDT 0.6B and Canary 180M Flash

Full Changelog →

Windows

v0.1.0

February 15, 2026

Other Changes

  • Add release pipeline with GitHub Actions and Velopack
  • Redesign dashboard with activity chart and stats
  • Fix dispose guards and missing connection opens
  • Add DB migration for tags column and consume trailing punctuation in snippets
  • Enhance snippets: tags, clipboard placeholder, import/export, editor dialog
  • Add dictionary feature: cards UI, term packs, auto-learning
  • Extend sound durations and remove stop sound
  • Add sound feedback with NAudio and fix playback order
  • Remove Whisper reference from info section
  • Add audio ducking and media pause during recording
  • Redesign history as timeline with inline-expandable cards
  • Add .agents/ and .claude/ to gitignore
  • Enhance overlay UI with partial transcription, badges, and timer
  • Add live transcription with Silero VAD for real-time partial results
  • Refactor hotkeys: support hybrid, toggle-only, and hold-only modes
  • Remove Whisper engine, consolidate to Parakeet-only
  • Add app icon for tray, taskbar, and exe from Mac icon set
  • Add browser URL detection via Windows UI Automation for profile matching
  • Initial commit: TypeWhisper Windows with local Opus-MT translation