diff --git a/README.md b/README.md index e86e374..d33994e 100644 --- a/README.md +++ b/README.md @@ -42,7 +42,40 @@ This operates on the metal. It is not a wrapper. It is an engine. --- -## 📊 Intelligence Matrix +## 🌎 Native Translation Engine + +Whisper Voice v1.0.1 introduces a powerful **Universal Translator** built directly into the core. This is not a web-request to Google Translate. This is a neural network running on your GPU that understands the semantic meaning of speech and reconstructs it in fluent English. + +* **Any Language Source**: Speak in French, Japanese, Russian, or 96 other languages. +* **English Output**: The engine instantly transcribes the audio into English text. +* **Zero Latency**: Translation happens in real-time as you speak (sentence-by-sentence). + +### Dual-Channel Operation +You do not need to switch modes manually. The application listens on two separate channels simultaneously. + +* **F9 (Default)** -> **Transcribe**: Types exactly what you say, in the language you speak. +* **F10 (Default)** -> **Translate**: Translates whatever you say in *any* language into English. + +This allows for seamless bilingual workflows. Dictate a message to a local friend on `F9`, then instantly reply to an international colleague on `F10` without touching a single setting. + +--- + +## đŸ•šī¸ Controls & Configuration + +### Global Hotkeys +The system runs silently in the background. Control it via global shortcuts: + +* **Transcribe (Default: F9)**: Use this for normal speech-to-text. It respects the language set in Settings (or Auto-Detect). +* **Translate (Default: F10)**: Use this to force translation to English. +* **Customization**: Both keys can be remapped in the Settings menu. The recorder supports complex combinations (e.g., `Ctrl + Alt + Space`). + +### Input Modes +* **Clipboard Paste**: Injects text via OS clipboard. Instant, but some games disable paste. +* **Simulate Typing**: Mimics physical keystrokes. Bypasses anti-cheat and anti-paste blocks. Configurable speed (default 6000 CPM) to prevent game kicks. + +--- + +## 📊 Intelligence Matrix (Models) Select the model that aligns with your hardware capabilities. @@ -65,82 +98,20 @@ Select the model that aligns with your hardware capabilities. 1. **Download**: Grab `WhisperVoice.exe` from [Releases](https://git.lashman.live/lashman/whisper_voice/releases). 2. **Deploy**: Place it anywhere. It is portable. 3. **Bootstrap**: Run it. The agent will self-provision an isolated Python environment (~2GB) on first launch. +4. **Updates**: Simply replace the `.exe`. The **Smart Bootstrapper** will detect the update and sync only the changed files, preserving your settings and skipping unnecessary downloads. -### đŸ•šī¸ Controls -* **Global Hook**: `F9` (Default). Press to open the channel. Release to inject text. -* **Tray Agent**: Retracts to the system tray. Right-click for **Settings** or **File Transcription**. - -### 📡 Input Modes -| Mode | Description | Speed | -| :--- | :--- | :--- | -| **Clipboard Paste** | Standard text injection via OS clipboard. | Instant | -| **Simulate Typing** | Mimics physical keystrokes. Bypasses anti-paste blocks. | Up to **6000** CPM | +### īŋŊ Troubleshooting +* **App crashes on start**: Ensure you have [Microsoft Visual C++ Redistributable 2015-2022](https://learn.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist) installed. +* **"Simulate Typing" is slow**: Some applications (remote desktops, older games) choke on super-fast input. Lower the typing speed in Settings to ~1200 CPM. +* **No Audio**: The agent listens to the **Default Communication Device**. Check your Windows Sound Control Panel. --- -## 🌐 Universal Translation +## 🌐 Supported Languages -The model listens in **99 languages** and translates them to English or transcribes them natively. +The engine supports 99 languages. You can lock the engine to a specific language in Settings to improve accuracy, or leave it on **Auto-Detect** for multilingual usage. -
-Click to view supported languages -
- -| | | | | -| :--- | :--- | :--- | :--- | -| Afrikaans đŸ‡ŋđŸ‡Ļ | Albanian đŸ‡Ļ🇱 | Amharic đŸ‡Ē🇹 | Arabic 🇸đŸ‡Ļ | -| Armenian đŸ‡Ļ🇲 | Assamese đŸ‡ŽđŸ‡ŗ | Azerbaijani đŸ‡ĻđŸ‡ŋ | Bashkir 🇷đŸ‡ē | -| Basque đŸ‡Ē🇸 | Belarusian 🇧🇾 | Bengali 🇧🇩 | Bosnian 🇧đŸ‡Ļ | -| Breton đŸ‡Ģ🇷 | Bulgarian 🇧đŸ‡Ŧ | Burmese 🇲🇲 | Castilian đŸ‡Ē🇸 | -| Catalan đŸ‡Ē🇸 | Chinese đŸ‡¨đŸ‡ŗ | Croatian 🇭🇷 | Czech 🇨đŸ‡ŋ | -| Danish 🇩🇰 | Dutch đŸ‡ŗđŸ‡ą | English đŸ‡ē🇸 | Estonian đŸ‡ĒđŸ‡Ē | -| Faroese đŸ‡Ģ🇴 | Finnish đŸ‡Ģ🇮 | Flemish 🇧đŸ‡Ē | French đŸ‡Ģ🇷 | -| Galician đŸ‡Ē🇸 | Georgian đŸ‡ŦđŸ‡Ē | German 🇩đŸ‡Ē | Greek đŸ‡Ŧ🇷 | -| Gujarati đŸ‡ŽđŸ‡ŗ | Haitian 🇭🇹 | Hausa đŸ‡ŗđŸ‡Ŧ | Hawaiian đŸ‡ē🇸 | -| Hebrew 🇮🇱 | Hindi đŸ‡ŽđŸ‡ŗ | Hungarian 🇭đŸ‡ē | Icelandic 🇮🇸 | -| Indonesian 🇮🇩 | Italian 🇮🇹 | Japanese đŸ‡¯đŸ‡ĩ | Javanese 🇮🇩 | -| Kannada đŸ‡ŽđŸ‡ŗ | Kazakh 🇰đŸ‡ŋ | Khmer 🇰🇭 | Korean 🇰🇷 | -| Lao 🇱đŸ‡Ļ | Latin đŸ‡ģđŸ‡Ļ | Latvian 🇱đŸ‡ģ | Lingala 🇨🇩 | -| Lithuanian 🇱🇹 | Luxembourgish 🇱đŸ‡ē | Macedonian 🇲🇰 | Malagasy 🇲đŸ‡Ŧ | -| Malay 🇲🇾 | Malayalam đŸ‡ŽđŸ‡ŗ | Maltese 🇲🇹 | Maori đŸ‡ŗđŸ‡ŋ | -| Marathi đŸ‡ŽđŸ‡ŗ | Moldavian 🇲🇩 | Mongolian đŸ‡˛đŸ‡ŗ | Myanmar 🇲🇲 | -| Nepali đŸ‡ŗđŸ‡ĩ | Norwegian đŸ‡ŗđŸ‡´ | Occitan đŸ‡Ģ🇷 | Panjabi đŸ‡ŽđŸ‡ŗ | -| Pashto đŸ‡ĻđŸ‡Ģ | Persian 🇮🇷 | Polish đŸ‡ĩ🇱 | Portuguese đŸ‡ĩ🇹 | -| Punjabi đŸ‡ŽđŸ‡ŗ | Romanian 🇷🇴 | Russian 🇷đŸ‡ē | Sanskrit đŸ‡ŽđŸ‡ŗ | -| Serbian 🇷🇸 | Shona đŸ‡ŋđŸ‡ŧ | Sindhi đŸ‡ĩ🇰 | Sinhala 🇱🇰 | -| Slovak 🇸🇰 | Slovenian 🇸🇮 | Somali 🇸🇴 | Spanish đŸ‡Ē🇸 | -| Sundanese 🇮🇩 | Swahili 🇰đŸ‡Ē | Swedish 🇸đŸ‡Ē | Tagalog đŸ‡ĩ🇭 | -| Tajik đŸ‡šđŸ‡¯ | Tamil đŸ‡ŽđŸ‡ŗ | Tatar 🇷đŸ‡ē | Telugu đŸ‡ŽđŸ‡ŗ | -| Thai 🇹🇭 | Tibetan đŸ‡¨đŸ‡ŗ | Turkish 🇹🇷 | Turkmen 🇹🇲 | -| Ukrainian đŸ‡ēđŸ‡Ļ | Urdu đŸ‡ĩ🇰 | Uzbek đŸ‡ēđŸ‡ŋ | Vietnamese đŸ‡ģe | -| Welsh đŸ´ķ §ķ ĸķ ˇķ Ŧķ ŗķ ŋ | Yiddish 🇮🇱 | Yoruba đŸ‡ŗđŸ‡Ŧ | | - -
- ---- - -## 🔧 Troubleshooting - -
-đŸ”Ĩ App crashes on start -
-The underlying engine requires standard C++ libraries. Install the Microsoft Visual C++ Redistributable (2015-2022). -
-
- -
-🐌 "Simulate Typing" is slow -
-Some apps (games, RDP) can't handle supersonic input. Go to Settings and lower the Typing Speed to ~1200 CPM. -
-
- -
-🎤 No Audio / Silence -
-The agent listens to the Default Communication Device. Ensure your microphone is set correctly in Windows Sound Settings. -
-
+Afrikaans, Albanian, Amharic, Arabic, Armenian, Assamese, Azerbaijani, Bashkir, Basque, Belarusian, Bengali, Bosnian, Breton, Bulgarian, Burmese, Castilian, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Faroese, Finnish, Flemish, French, Galician, Georgian, German, Greek, Gujarati, Haitian, Hausa, Hawaiian, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Lao, Latin, Latvian, Lingala, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Maori, Marathi, Moldavian, Mongolian, Myanmar, Nepali, Norwegian, Occitan, Panjabi, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Sanskrit, Serbian, Shona, Sindhi, Sinhala, Slovak, Slovenian, Somali, Spanish, Sundanese, Swahili, Swedish, Tagalog, Tajik, Tamil, Tatar, Telugu, Thai, Tibetan, Turkish, Turkmen, Ukrainian, Urdu, Uzbek, Vietnamese, Welsh, Yiddish, Yoruba. ---