190 Commits

Author SHA1 Message Date
Alexey Haidamaka
8293bc27b7
Merge branch 'main' into feat/paste-last-enhancement 2025-09-12 21:49:17 +02:00
Alexey Haidamaka
154368e616 Added capability to paste last enhancement 2025-09-12 20:03:19 +02:00
Beingpax
0429af6678 Use whisper module in VoiceActivityDetector to resolve calling-convention conflict with whisper_vad_default_params; no functional changes 2025-09-12 14:13:31 +05:45
Beingpax
7161bc3f71 Improved the comma seperated replacement values to be consolidated 2025-09-12 11:58:26 +05:45
Beingpax
f1fb2168c2 Added support for comma-separated words for word replacement. 2025-09-12 11:21:00 +05:45
Prakash Joshi Pax
8c985c31ea
Merge pull request #290 from n0an/updateGroqModel
Update Groq Kimi K2 model name
2025-09-12 10:14:44 +05:45
Beingpax
2a4f48ac0d Optimize recorder startup by backgrounding media controls 2025-09-12 10:14:10 +05:45
Anton Novoselov
574e381b0a Update Groq Kimi K2 model name 2025-09-11 09:42:10 +01:00
Beingpax
53d1507a53 improve hallucination filter and integrate with parakeet transcription service 2025-09-06 16:56:28 +05:45
Beingpax
95e57bf3f7 Added support for retry functionality 2025-09-06 09:36:55 +05:45
Beingpax
5eacee467a Feat: Respect VAD user setting in ParakeetTranscriptionService 2025-09-06 08:57:32 +05:45
Beingpax
c0ed2dc78a Improved VAD for Parakeet model 2025-09-06 07:13:06 +05:45
Beingpax
106fd653ea feat: Integrate experimental VAD for Parakeet
This change introduces a standalone Voice Activity Detection (VAD) service and integrates it into the ParakeetTranscriptionService.

The VAD preprocesses the audio to remove silent segments, aiming to improve transcription accuracy.

This is considered experimental due to a discovered anomaly in the Swift/C bridge where timestamps were being multiplied by 100. A workaround has been implemented to correct this.
2025-09-05 18:37:16 +05:45
Beingpax
12c850f77b Open With and title cleanup 2025-09-05 11:22:14 +05:45
Alexey Haidamaka
c06d01f61e Handle retry for API rate limit errors 2025-09-04 11:10:54 +02:00
Prakash Joshi Pax
26362bb50e
Merge pull request #263 from gdmka/feature/ai-enhancement-retry-logic
Implement Retry Logic for AI Enhancement Service
2025-08-28 09:00:01 +05:45
Alexey Haidamaka
c83afac031 feat: Add retry logic to AI enhancement service 2025-08-28 03:34:12 +02:00
Brandon Weng
95061cda40 spacing 2025-08-27 14:51:36 -04:00
Brandon Weng
620b3a8d3b Remove cleanup state 2025-08-27 14:50:09 -04:00
Brandon Weng
2ea220dfed use default configs from upstream for parakeet 2025-08-27 14:48:07 -04:00
Beingpax
2aac1020f8 Added support for preserving the prompt used during enhancement on 2025-08-26 21:34:54 +05:45
Prakash Joshi Pax
6980857750
Merge pull request #258 from gdmka/gdmka-mistral-medium-latest
added mistral-medium-latest
2025-08-26 20:57:30 +05:45
Beingpax
b3257eef61 Fixed license validation and activation issue due to missing API header on 2025-08-26 20:51:29 +05:45
Alexey Haidamaka
d666343121
added mistral-medium-latest
https://docs.mistral.ai/getting-started/models/models_overview/#api-versioning

Medium offers a reasonable balance between Small and Large in output and speed. Especially now when Large is sometimes is frequently over capacity.

This is my go to model for Mistral.
2025-08-26 16:32:43 +02:00
Beingpax
68e8a21746 Clipboard context with their own tags 2025-08-26 16:55:01 +05:45
Beingpax
71fdbdfc75 Add obfuscated trial storage to prevent perpetual trial 2025-08-26 09:41:42 +05:45
Beingpax
9e29b34db1 Fix decoder state cleanup blocking transcription start with Parakeet model 2025-08-25 13:50:07 +05:45
Beingpax
6a308b81bf Update app to support Parakeet B3 model 2025-08-25 13:00:35 +05:45
Beingpax
2708cc502a Added support for Gemini models for transcription 2025-08-25 12:00:15 +05:45
Beingpax
49a3cd0f4a Add support for word replacement in non-space languages 2025-08-25 10:46:01 +05:45
Beingpax
405b7ed143 Add predefined dictionary words 2025-08-23 23:34:14 +05:45
Beingpax
edd3145bf3 Remove toggle from dictionary items 2025-08-23 23:04:08 +05:45
Beingpax
73bb3e765a Remove dictionary item dependency on local Whisper model 2025-08-23 22:15:13 +05:45
Beingpax
abe12d0dfe Fix AI assistant prompt using dictionary items as context 2025-08-23 19:32:36 +05:45
Beingpax
f13e559536 Add support for retrying the last transcription from menu bar 2025-08-23 17:16:17 +05:45
Beingpax
881957793f Log successful license validation responses 2025-08-22 10:00:39 +05:45
Beingpax
d39c660bca Logging for license validation issue 2025-08-22 02:13:09 +05:45
Beingpax
57b58ec80e Add dictionary context to AI enhancement 2025-08-19 21:48:22 +05:45
Beingpax
4ddd35504a Logging for License Verification 2025-08-19 15:00:39 +05:45
Beingpax
3c84bcf757 Logging of failed API verification and adding GPT‑4 models again. 2025-08-19 09:57:32 +05:45
Beingpax
889162633c remove api header 2025-08-18 19:48:09 +05:45
Beingpax
08657224d4 Add gpt-oss from groq 2025-08-18 17:00:40 +05:45
Beingpax
36fa030d12 Fix audio processing overflow error 2025-08-14 14:22:28 +05:45
Beingpax
8d319aa99a Fix Mistral enhancement endpoint 2025-08-14 09:26:41 +05:45
Beingpax
79ba95ccad Add announcement service for important notice/updates 2025-08-13 17:26:30 +05:45
Beingpax
a8db010900 Added support for adding fine-tuned local whisper models. 2025-08-13 14:04:34 +05:45
Beingpax
a3c302b50b Update TranscriptCleanup with interval settings 2025-08-13 09:07:08 +05:45
Beingpax
0b8eb71048 Enable native Apple Speech Analyzer in build 2025-08-11 09:07:29 +05:45
Danny Ricciotti
e2b04a2ac3 Fix to build on Mac OS15 2025-08-10 18:58:16 -04:00
Beingpax
0e2d9017c6 Move pause media feature inside an experimental section 2025-08-10 18:48:48 +05:45