vOOice

Author	SHA1	Message	Date
Jake Shore	de1c1e51aa	Add hybrid streaming transcription for improved accuracy - Implement real-time streaming preview using Parakeet EOU (160ms chunks) - Add batch transcription on completion for accurate final result - Prefer Whisper large-v3-turbo (2.7% WER) over Parakeet (6.05% WER) when available - Remove audio preprocessing that hurts ASR accuracy (gain control, noise reduction) - Add streaming audio callback support in Recorder and CoreAudioRecorder - Raw audio passthrough - SDK handles resampling internally Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>	2026-01-16 07:35:53 -05:00
Beingpax	5d6a2208cf	Update selectedtextkit package	2025-12-21 22:48:18 +05:45
Beingpax	078e02c503	Parakeet model validation & Intel mac local model warning	2025-12-17 09:41:57 +05:45
Beingpax	1e612d9987	Support org.nspasteboard conventions for transient clipboard handlin	2025-11-28 13:03:30 +05:45
Deborah Mangan	2840bf08c2	chore: Update FluidAudio to v0.7.8 Update FluidAudio from v0.7.7 (2dd0bd1) to v0.7.8 (8136bd0) Performance improvements: - 5% faster ASR inference - 10% fewer missing words on long audio files - 0.5% improved WER for v2 and v3 models Stability improvements: - Fixed ANE concurrency crashes (<3% latency impact) - Switched ASR to stateless for better batching support - Improved concurrency safety This is a backward-compatible update with no API breaking changes. No code changes required - all existing Parakeet integration works as-is. Full changelog: https://github.com/FluidInference/FluidAudio/compare/v0.7.7...v0.7.8 Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2025-11-05 13:57:44 +10:00
Beingpax	7c86ac6928	Added Swift atomics dependency.	2025-10-31 11:31:25 +05:45
Beingpax	ea0e5d840c	Rewrote the default prompts to make them simpler and better.	2025-10-29 23:27:49 +05:45
Beingpax	357804c03d	feat: Integrate SelectedTextKit for robust text retrieval with multi-strategy support	2025-10-28 18:30:25 +05:45
Anton Lvovych	cc086c1d92	Update FluidAudio to latest with ESpeakNG framework fix - Update FluidAudio to f47209a which includes ESpeakNG framework fix - Fix VadConfig API compatibility: threshold -> defaultThreshold - Add WorkspaceSettings to allow FluidAudio's unsafe build flags This resolves the dyld crash: "Library not loaded: ESpeakNG.framework/Versions/A/ESpeakNG" Fixed upstream in FluidInference/FluidAudio#159 and FluidInference/FluidAudio#160 Tested: All 4 UI tests pass	2025-10-27 15:16:40 +07:00
Beingpax	cacd38f339	Fix SwiftData pagination race by isolating on MainActor	2025-10-24 22:31:47 +05:45
Beingpax	a0e4dd1367	Added support for V2 and V3 models	2025-10-19 14:01:31 +05:45
Beingpax	97c6234fb3	Respect VAD flag, downloading & updated to latest version	2025-09-20 17:00:28 +05:45
Beingpax	91734bda45	Native Fluid Audio VAD	2025-09-19 19:24:02 +05:45
Beingpax	12c850f77b	Open With and title cleanup	2025-09-05 11:22:14 +05:45
Brandon Weng	b052fb491f	Bump FluidAudio	2025-08-29 12:26:05 -04:00
Beingpax	68e8a21746	Clipboard context with their own tags	2025-08-26 16:55:01 +05:45
Beingpax	6a308b81bf	Update app to support Parakeet B3 model	2025-08-25 13:00:35 +05:45
Beingpax	3eebbc4e3b	Better Parakeet error handling	2025-08-03 12:44:13 +05:45
Beingpax	29722d0a31	more logging in parakeettranscription service	2025-08-03 09:35:49 +05:45
Beingpax	3c84ebdd76	Update README	2025-08-02 10:23:29 +05:45
Beingpax	d09a9fba7f	Experimental new models	2025-08-01 17:26:08 +05:45
Beingpax	50c7b9a354	Support for playing/pausing media during recording	2025-07-27 10:23:36 +05:45
Beingpax	21417711e5	removed VideoCTA	2025-04-19 18:02:53 +05:45
Beingpax	9fb36eddea	Fix license validation bug	2025-04-18 17:59:48 +05:45
Beingpax	76a154706c	Initial commit: Open-sourcing VoiceInk	2025-02-22 11:52:41 +05:45

25 Commits