SpeakToCodeSpeakToCode
SpeakToCodeSpeakToCode

Changelog

New features, improvements, and fixes

1.0.0-betaFeb 1, 2026

v1.0.0-beta: Public Beta Launch

We're thrilled to announce the public beta of SpeakToCode! After months of development and testing, voice-to-text for developers is here.

New Features

  • Voice-to-text transcription with OpenAI Whisper running 100% locally
  • 7 Whisper model sizes, from Tiny (75 MB) to Large (2.9 GB)
  • Global hotkeys for push-to-talk and toggle modes
  • Auto-paste to any active window
  • Multi-monitor overlay to see recording status across all screens
  • Transcription history with search and statistics
  • Cloud sync for optional backup of transcriptions and settings
  • Dark and light themes

Technical Details

  • Built with Tauri v2 for native Windows performance
  • whisper.cpp for optimized local inference
  • Supabase for authentication and cloud sync
  • 15-day free trial, no credit card required
0.2.0Jan 15, 2026

v0.2.0: AI Cleanup & Settings Sync

This release adds AI-powered text cleanup and cross-device settings sync.

New Features

  • AI text cleanup: Use Groq API to clean up and format your dictations
  • Settings sync: Your preferences now sync across devices via cloud
  • Improved overlay: Customizable position and display duration
  • Better hotkey support: More key combinations available

Bug Fixes

  • Fixed overlay flickering on some multi-monitor setups
  • Fixed clipboard paste delay on certain applications
  • Improved Whisper model download reliability

Performance

  • 15% faster transcription with optimized whisper.cpp build
  • Reduced memory usage for Tiny and Base models