Feat: Integrated Local LLM (Llama 3.2 1B) for Intelligent Correction -- New Core: Added LLMEngine utilizing llama-cpp-python for local private text post-processing. -- Forensic Protocol: Engineered strict system prompts to prevent LLM refusals, censorship, or assistant chatter. -- Three Modes: Grammar, Standard, Rewrite. -- Start/Stop Logic: Consolidated conflicting recording methods. -- Hotkeys: Added dedicated F9 (Correct) vs F8 (Transcribe). -- UI: Updated Settings. -- Build: Updated portable_build.py. -- Docs: Updated README.
This commit is contained in:
18
README.md
18
README.md
@@ -68,14 +68,20 @@ At its core, Whisper Voice is the ultimate bridge between thought and text. It l
|
||||
### Workflow: `F9 (Default)`
|
||||
The primary channel for native-language transcription. It transcribes precisely what it hears in the language you speak (or the one you've locked in Settings).
|
||||
|
||||
### ✨ Style Prompting (New in v1.0.2)
|
||||
Whisper Voice replaces traditional "grammar correction models" with a native **Style Prompting** engine. By injecting a specific "pre-prompt" into the model's context window, we can guide its internal style without external post-processing.
|
||||
### 🧠 Intelligent Correction (New in v1.1.0)
|
||||
Whisper Voice now integrates a local **Llama 3.2 1B** LLM to act as a "Silent Consultant". It post-processes transcripts to fix grammar or polish style without effectively "chatting" back.
|
||||
|
||||
* **Standard (Default)**: Forces the model to use full sentences, proper capitalization, and periods. Ideal for dictation.
|
||||
* **Casual**: Encourages a relaxed, lowercase style (e.g., "no way that's crazy lol").
|
||||
* **Custom**: Allows you to seed the model with your own context (e.g., "Here is a list of medical terms:").
|
||||
It is strictly trained on a **Forensic Protocol**: it will never lecture you, never refuse to process explicit language, and never sanitize your words. Your profanity is yours to keep.
|
||||
|
||||
This approach incurs **zero latency penalty** and **zero extra VRAM** usage.
|
||||
#### Correction Modes:
|
||||
* **Standard (Default)**: Fixes grammar, punctuation, and capitalization while keeping every word you said.
|
||||
* **Grammar Only**: Strictly fixes objective errors (spelling/agreement). Touches nothing else.
|
||||
* **Rewrite**: Polishes the flow and clarity of your sentences while explicitly preserving your original tone (Casual stays casual, Formal stays formal).
|
||||
|
||||
#### Supported Languages:
|
||||
The correction engine is optimized for **English, German, French, Italian, Portuguese, Spanish, Hindi, and Thai**. It also performs well on **Russian, Chinese, Japanese, and Romanian**.
|
||||
|
||||
This approach incurs a ~2s latency penalty but uses **zero extra VRAM** when in Low VRAM mode.
|
||||
|
||||
<br>
|
||||
|
||||
|
||||
Reference in New Issue
Block a user