89 Commits

Author SHA1 Message Date
Jake Shore
f6a613e8ef Use 320ms streaming chunks and fix bundle ID
- Switch from 160ms to 320ms chunk size for better streaming accuracy
- Fix corrupted bundle identifier (was "--com.jakeshore.VoiceInk-com.jakeshore.VoiceInk")

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-13 05:24:30 -05:00
Jake Shore
de1c1e51aa Add hybrid streaming transcription for improved accuracy
- Implement real-time streaming preview using Parakeet EOU (160ms chunks)
- Add batch transcription on completion for accurate final result
- Prefer Whisper large-v3-turbo (2.7% WER) over Parakeet (6.05% WER) when available
- Remove audio preprocessing that hurts ASR accuracy (gain control, noise reduction)
- Add streaming audio callback support in Recorder and CoreAudioRecorder
- Raw audio passthrough - SDK handles resampling internally

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-16 07:35:53 -05:00
Beingpax
652859414c v1.69 Update 2026-01-12 19:50:37 +05:45
Beingpax
c530367a04 Replace audio recorder with CoreAudio AUHAL
New low-level recorder targeting devices directly. Includes device switching during recording, enhanced logging (transport type, format, buffer), and log export feature.
2026-01-10 20:45:37 +05:45
Beingpax
072b57f365 Update to 1.67 2026-01-07 11:30:17 +05:45
Beingpax
5d6a2208cf Update selectedtextkit package 2025-12-21 22:48:18 +05:45
Beingpax
078e02c503 Parakeet model validation & Intel mac local model warning 2025-12-17 09:41:57 +05:45
Beingpax
557ac95478 1.64 update 2025-12-06 08:43:41 +05:45
Beingpax
ab9d6ad830 Add option to dismiss Affiliate promotion 2025-12-06 08:18:35 +05:45
Beingpax
c2666c3c24 Update to 1.63 2025-12-04 22:30:22 +05:45
Beingpax
88676362b4 update to v1.62 2025-11-29 08:49:08 +05:45
Beingpax
1e612d9987 Support org.nspasteboard conventions for transient clipboard handlin 2025-11-28 13:03:30 +05:45
Beingpax
e5e194de5f v1.61 update 2025-11-08 12:00:45 +05:45
Deborah Mangan
2840bf08c2 chore: Update FluidAudio to v0.7.8
Update FluidAudio from v0.7.7 (2dd0bd1) to v0.7.8 (8136bd0)

Performance improvements:
- 5% faster ASR inference
- 10% fewer missing words on long audio files
- 0.5% improved WER for v2 and v3 models

Stability improvements:
- Fixed ANE concurrency crashes (<3% latency impact)
- Switched ASR to stateless for better batching support
- Improved concurrency safety

This is a backward-compatible update with no API breaking changes.
No code changes required - all existing Parakeet integration works as-is.

Full changelog: https://github.com/FluidInference/FluidAudio/compare/v0.7.7...v0.7.8

Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>
2025-11-05 13:57:44 +10:00
Beingpax
7c86ac6928 Added Swift atomics dependency. 2025-10-31 11:31:25 +05:45
Beingpax
ea0e5d840c Rewrote the default prompts to make them simpler and better. 2025-10-29 23:27:49 +05:45
Beingpax
615a0c4fad Isolate whisper.cpp dependency build outside the project. 2025-10-29 11:59:49 +05:45
Beingpax
357804c03d feat: Integrate SelectedTextKit for robust text retrieval with multi-strategy support 2025-10-28 18:30:25 +05:45
Anton Lvovych
cc086c1d92
Update FluidAudio to latest with ESpeakNG framework fix
- Update FluidAudio to f47209a which includes ESpeakNG framework fix
- Fix VadConfig API compatibility: threshold -> defaultThreshold
- Add WorkspaceSettings to allow FluidAudio's unsafe build flags

This resolves the dyld crash: "Library not loaded: ESpeakNG.framework/Versions/A/ESpeakNG"
Fixed upstream in FluidInference/FluidAudio#159 and FluidInference/FluidAudio#160

Tested: All 4 UI tests pass
2025-10-27 15:16:40 +07:00
Beingpax
cacd38f339 Fix SwiftData pagination race by isolating on MainActor 2025-10-24 22:31:47 +05:45
Beingpax
a0e4dd1367 Added support for V2 and V3 models 2025-10-19 14:01:31 +05:45
Beingpax
29d139285a Update it to version 1.59 2025-10-10 22:37:35 +05:45
Beingpax
b7627c9deb Remove screen capture toggle gating 2025-10-06 17:26:38 +05:45
Beingpax
7dcf982b60 Updated to version 1.57 2025-09-24 21:08:22 +05:45
Beingpax
97c6234fb3 Respect VAD flag, downloading & updated to latest version 2025-09-20 17:00:28 +05:45
Beingpax
91734bda45 Native Fluid Audio VAD 2025-09-19 19:24:02 +05:45
Beingpax
abaa54eb02 Fix build no. 2025-09-16 10:43:50 +05:45
Beingpax
e6225050cc Prefer enhanced text with graceful fallback: paste enhanced falls back to original; copy last transcription now mirrors same behavior; updated Settings description to reflect single-shortcut-friendly design. 2025-09-16 10:30:58 +05:45
Beingpax
12c850f77b Open With and title cleanup 2025-09-05 11:22:14 +05:45
Brandon Weng
b052fb491f Bump FluidAudio 2025-08-29 12:26:05 -04:00
Beingpax
68e8a21746 Clipboard context with their own tags 2025-08-26 16:55:01 +05:45
Beingpax
f0ddbed5e8 Update to version 1.5 2025-08-25 14:09:17 +05:45
Beingpax
6a308b81bf Update app to support Parakeet B3 model 2025-08-25 13:00:35 +05:45
Beingpax
889162633c remove api header 2025-08-18 19:48:09 +05:45
Beingpax
a8db010900 Added support for adding fine-tuned local whisper models. 2025-08-13 14:04:34 +05:45
Beingpax
6935f58737 Recorder error loggin. 2025-08-12 21:05:53 +05:45
Beingpax
1b4f09731b 1.50 2025-08-12 10:30:44 +05:45
Beingpax
0b8eb71048 Enable native Apple Speech Analyzer in build 2025-08-11 09:07:29 +05:45
Danny Ricciotti
2c12003b84 Update project.pbxproj 2025-08-10 18:59:53 -04:00
Beingpax
b14bf8733e Added tip jar 2025-08-08 21:34:12 +05:45
Beingpax
4a14b7cd52 1.47 update 2025-08-08 09:02:22 +05:45
Beingpax
d363808a7b revert copy to clipboard 2025-08-07 10:09:02 +05:45
Beingpax
f7987cd4bd Redesign recorder views with reusable components 2025-08-06 13:10:21 +05:45
Beingpax
7c74f57b8f update version 2025-08-03 13:03:16 +05:45
Beingpax
3eebbc4e3b Better Parakeet error handling 2025-08-03 12:44:13 +05:45
Beingpax
29722d0a31 more logging in parakeettranscription service 2025-08-03 09:35:49 +05:45
Beingpax
cfd09ed36b Add correct model name for Qwen-3-235b-a22b-thinking-250 2025-08-02 12:24:28 +05:45
Beingpax
3c84ebdd76 Update README 2025-08-02 10:23:29 +05:45
Beingpax
d09a9fba7f Experimental new models 2025-08-01 17:26:08 +05:45
Beingpax
50c7b9a354 Support for playing/pausing media during recording 2025-07-27 10:23:36 +05:45