Back to Projects
Voice Notes — AI Transcription
Desktop transcription app with AI-powered speech-to-text, hotkey recording, and real-time processing.
Role
Solo Developer
Timeline
1 Month
Stack
Tauri, React, TypeScript
Focus
Desktop Engineering
The Challenge
Building a native desktop app with web technologies that feels truly native — handling system-level audio capture, global hotkeys, and real-time streaming transcription with minimal latency.
Architectural Decisions
◆Rust Backend
Tauri + Rust for native performance, small binary size, and direct access to system audio APIs.
◆Streaming Pipeline
Real-time audio streaming to Deepgram with chunked processing for instant transcription feedback.
◆Global Hotkeys
System-wide keyboard shortcuts for start/stop recording without switching window focus.
◆Clipboard Integration
One-click copy of transcribed text directly to system clipboard for seamless workflow integration.
Impact & Results
<500ms
Transcription Latency
15MB
App Size
12+
Languages Supported