# WHISPER VOICE ### SOVEREIGN SPEECH RECOGNITION
**Your Voice. Your Machine. Your Data.**
*A high-performance, locally-run dictation agent for the liberated desktop.* [Download Release](https://git.lashman.live/lashman/whisper_voice/releases) • [View Source](https://git.lashman.live/lashman/whisper_voice) • [Report Issue](https://git.lashman.live/lashman/whisper_voice/issues)
--- ## ✊ The Manifesto **We hold these truths to be self-evident: That user data is an extension of the self, and its exploitation by centralized clouds is a violation of digital autonomy.** Whisper Voice is built on the principle of **technological sovereignty**. It provides state-of-the-art speech recognition without renting your cognitive output to corporate oligarchies. By running entirely on your own hardware, it reclaims the means of digital production, ensuring that your words remain exclusively yours. ## ⚡ Technical Core Under the hood, Whisper Voice exploits the raw power of **Faster-Whisper**, a highly optimized implementation of OpenAI's Whisper model using CTranslate2. This delivers: * **Zero Latency Loop**: By eliminating network round-trips, transcription happens as fast as your hardware can think. * **Privacy by Physics**: Data physically cannot leave your machine because the engine has no cloud uplink. The cable is cut. * **Precision Engineering**: Leveraging 8-bit quantization to run even the `Large-v3` models on consumer GPUs with minimal memory footprint. ## ✨ Capabilities ### 🧠 Adaptive Intelligence Choose the model that fits your rig. From `Tiny` (low resource, high speed) to `Large` (human-level accuracy). The agent automatically configures itself for your available Compute Device (CUDA GPU or CPU). ### 🚀 Inputs & Injection * **Global Hotkey**: A rigorous system-wide hook (default `F9`) puts the ear of the machine at your fingertips. * **Simulated Typing**: Bylaws of some applications block pasting? No problem. Our engine simulates keystrokes at supersonic speeds (up to **6000 CPM**), bypassing restrictions like water flowing around a rock. * **Clipboard Mode**: Standard, lightning-fast text injection for permissive environments. ### 🛡️ System Integration * **Glassmorphic UI**: A modern, non-intrusive QML interface that respects your screen real estate. * **Tray Agent**: Retracts to the system tray, maintaining a low profile until summoned. * **Bootstrapper**: A self-assembling runtime that provisions its own dependencies using an isolated embedded Python environment. No pollution of your system PATH. --- ## �️ Usage Guide ### Installation 1. **Acquire**: Download the latest portable executable from the [Releases](https://git.lashman.live/lashman/whisper_voice/releases) page. 2. **Deploy**: Place `WhisperVoice.exe` in a directory of your choosing. 3. **Initialize**: Run the executable. It will autonomously hydrate its runtime environment (approx. 2GB) on the first launch. ### Operation 1. **Configure**: Open Settings via the tray icon. Select your **Model Size** and **Compute Device**. 2. **Engage**: Press `F9` (or your custom hotkey) to open the channel. 3. **Dictate**: Speak clearly. The noise gate will isolate your voice. 4. **Execute**: Release the key. The machine interprets the signal and injects the text into your active window immediately. --- ## 🧪 Model Performance | Model | VRAM (Approx) | Speed | Capabilities | | :--- | :--- | :--- | :--- | | **Tiny** | < 1 GB | Supersonic | Quick commands, simple dictation. | | **Base** | 1 GB | Very Fast | Good balance for older hardware. | | **Small** | 2 GB | Fast | Standard daily driver. High English accuracy. | | **Medium** | 5 GB | Moderate | High precision, handles accents well. | | **Large-v3** | 8 GB+ | Heavy | Professional grade. Near-perfect understanding. | *Note: Performance scales with your GPU capabilities.* --- ## 🤝 Mutual Aid This software is free as in freedom. It is a commons, not a commodity. Contributions are welcome from all who share the vision of decentralized, local-first computing. Whether it is code, documentation, or design—labor given freely enriches the community whole. **[Fork the Repository](https://git.lashman.live/lashman/whisper_voice)** ---
"The master's tools will never dismantle the master's house."
Build your own tools. Run them locally.