12 Commits

Author SHA1 Message Date
Beingpax
5eacee467a Feat: Respect VAD user setting in ParakeetTranscriptionService 2025-09-06 08:57:32 +05:45
Beingpax
c0ed2dc78a Improved VAD for Parakeet model 2025-09-06 07:13:06 +05:45
Beingpax
106fd653ea feat: Integrate experimental VAD for Parakeet
This change introduces a standalone Voice Activity Detection (VAD) service and integrates it into the ParakeetTranscriptionService.

The VAD preprocesses the audio to remove silent segments, aiming to improve transcription accuracy.

This is considered experimental due to a discovered anomaly in the Swift/C bridge where timestamps were being multiplied by 100. A workaround has been implemented to correct this.
2025-09-05 18:37:16 +05:45
Brandon Weng
95061cda40 spacing 2025-08-27 14:51:36 -04:00
Brandon Weng
620b3a8d3b Remove cleanup state 2025-08-27 14:50:09 -04:00
Brandon Weng
2ea220dfed use default configs from upstream for parakeet 2025-08-27 14:48:07 -04:00
Beingpax
9e29b34db1 Fix decoder state cleanup blocking transcription start with Parakeet model 2025-08-25 13:50:07 +05:45
Beingpax
6a308b81bf Update app to support Parakeet B3 model 2025-08-25 13:00:35 +05:45
Beingpax
3eebbc4e3b Better Parakeet error handling 2025-08-03 12:44:13 +05:45
Beingpax
29722d0a31 more logging in parakeettranscription service 2025-08-03 09:35:49 +05:45
Beingpax
b5eaf647db 🦜 Add Parakeet logging 2025-08-02 21:26:37 +05:45
Beingpax
d09a9fba7f Experimental new models 2025-08-01 17:26:08 +05:45