This change introduces a standalone Voice Activity Detection (VAD) service and integrates it into the ParakeetTranscriptionService.
The VAD preprocesses the audio to remove silent segments, aiming to improve transcription accuracy.
This is considered experimental due to a discovered anomaly in the Swift/C bridge where timestamps were being multiplied by 100. A workaround has been implemented to correct this.