77 Commits

Author SHA1 Message Date
Beingpax
e5e194de5f v1.61 update 2025-11-08 12:00:45 +05:45
Deborah Mangan
2840bf08c2 chore: Update FluidAudio to v0.7.8
Update FluidAudio from v0.7.7 (2dd0bd1) to v0.7.8 (8136bd0)

Performance improvements:
- 5% faster ASR inference
- 10% fewer missing words on long audio files
- 0.5% improved WER for v2 and v3 models

Stability improvements:
- Fixed ANE concurrency crashes (<3% latency impact)
- Switched ASR to stateless for better batching support
- Improved concurrency safety

This is a backward-compatible update with no API breaking changes.
No code changes required - all existing Parakeet integration works as-is.

Full changelog: https://github.com/FluidInference/FluidAudio/compare/v0.7.7...v0.7.8

Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>
2025-11-05 13:57:44 +10:00
Beingpax
7c86ac6928 Added Swift atomics dependency. 2025-10-31 11:31:25 +05:45
Beingpax
ea0e5d840c Rewrote the default prompts to make them simpler and better. 2025-10-29 23:27:49 +05:45
Beingpax
615a0c4fad Isolate whisper.cpp dependency build outside the project. 2025-10-29 11:59:49 +05:45
Beingpax
357804c03d feat: Integrate SelectedTextKit for robust text retrieval with multi-strategy support 2025-10-28 18:30:25 +05:45
Anton Lvovych
cc086c1d92
Update FluidAudio to latest with ESpeakNG framework fix
- Update FluidAudio to f47209a which includes ESpeakNG framework fix
- Fix VadConfig API compatibility: threshold -> defaultThreshold
- Add WorkspaceSettings to allow FluidAudio's unsafe build flags

This resolves the dyld crash: "Library not loaded: ESpeakNG.framework/Versions/A/ESpeakNG"
Fixed upstream in FluidInference/FluidAudio#159 and FluidInference/FluidAudio#160

Tested: All 4 UI tests pass
2025-10-27 15:16:40 +07:00
Beingpax
cacd38f339 Fix SwiftData pagination race by isolating on MainActor 2025-10-24 22:31:47 +05:45
Beingpax
a0e4dd1367 Added support for V2 and V3 models 2025-10-19 14:01:31 +05:45
Beingpax
29d139285a Update it to version 1.59 2025-10-10 22:37:35 +05:45
Beingpax
b7627c9deb Remove screen capture toggle gating 2025-10-06 17:26:38 +05:45
Beingpax
7dcf982b60 Updated to version 1.57 2025-09-24 21:08:22 +05:45
Beingpax
97c6234fb3 Respect VAD flag, downloading & updated to latest version 2025-09-20 17:00:28 +05:45
Beingpax
91734bda45 Native Fluid Audio VAD 2025-09-19 19:24:02 +05:45
Beingpax
abaa54eb02 Fix build no. 2025-09-16 10:43:50 +05:45
Beingpax
e6225050cc Prefer enhanced text with graceful fallback: paste enhanced falls back to original; copy last transcription now mirrors same behavior; updated Settings description to reflect single-shortcut-friendly design. 2025-09-16 10:30:58 +05:45
Beingpax
12c850f77b Open With and title cleanup 2025-09-05 11:22:14 +05:45
Brandon Weng
b052fb491f Bump FluidAudio 2025-08-29 12:26:05 -04:00
Beingpax
68e8a21746 Clipboard context with their own tags 2025-08-26 16:55:01 +05:45
Beingpax
f0ddbed5e8 Update to version 1.5 2025-08-25 14:09:17 +05:45
Beingpax
6a308b81bf Update app to support Parakeet B3 model 2025-08-25 13:00:35 +05:45
Beingpax
889162633c remove api header 2025-08-18 19:48:09 +05:45
Beingpax
a8db010900 Added support for adding fine-tuned local whisper models. 2025-08-13 14:04:34 +05:45
Beingpax
6935f58737 Recorder error loggin. 2025-08-12 21:05:53 +05:45
Beingpax
1b4f09731b 1.50 2025-08-12 10:30:44 +05:45
Beingpax
0b8eb71048 Enable native Apple Speech Analyzer in build 2025-08-11 09:07:29 +05:45
Danny Ricciotti
2c12003b84 Update project.pbxproj 2025-08-10 18:59:53 -04:00
Beingpax
b14bf8733e Added tip jar 2025-08-08 21:34:12 +05:45
Beingpax
4a14b7cd52 1.47 update 2025-08-08 09:02:22 +05:45
Beingpax
d363808a7b revert copy to clipboard 2025-08-07 10:09:02 +05:45
Beingpax
f7987cd4bd Redesign recorder views with reusable components 2025-08-06 13:10:21 +05:45
Beingpax
7c74f57b8f update version 2025-08-03 13:03:16 +05:45
Beingpax
3eebbc4e3b Better Parakeet error handling 2025-08-03 12:44:13 +05:45
Beingpax
29722d0a31 more logging in parakeettranscription service 2025-08-03 09:35:49 +05:45
Beingpax
cfd09ed36b Add correct model name for Qwen-3-235b-a22b-thinking-250 2025-08-02 12:24:28 +05:45
Beingpax
3c84ebdd76 Update README 2025-08-02 10:23:29 +05:45
Beingpax
d09a9fba7f Experimental new models 2025-08-01 17:26:08 +05:45
Beingpax
50c7b9a354 Support for playing/pausing media during recording 2025-07-27 10:23:36 +05:45
Palash Oswal
e8d49b1797
Update identifiers to reflect latest release 2025-07-25 11:56:53 -04:00
Beingpax
f9adfd5269 cerebras AI provider integration 2025-07-23 21:08:14 +05:45
Beingpax
9968ac3dcc Add sysmte info in analysis view 2025-07-11 19:11:34 +05:45
Beingpax
4a83c5400e Fix for paste fail window messsing with Paste operation 2025-07-04 05:38:47 +05:45
Beingpax
8c44efad64 Consistent background card 2025-06-30 11:27:45 +05:45
Beingpax
a0a1257e3b Add enhancement prompt support when power mode disabled 2025-06-22 23:37:03 +05:45
Beingpax
f7e160ac99 Fix cloud model default clearing 2025-06-15 23:46:53 +05:45
Beingpax
7aef8f04a6 Fix audio input mode persistence 2025-06-15 18:49:41 +05:45
Beingpax
b1524970c1 fix: power mode model selectio & removed current model in favor of currenttranscriptionmodel 2025-06-10 23:23:43 +05:45
Beingpax
e2e6c3ac94 feat: Migrate Power Mode to support cloud models 2025-06-10 17:13:55 +05:45
Beingpax
d7c23f91d0 fix: Update menu bar icon to use AppIcon 2025-06-07 17:24:05 +05:45
Beingpax
7781ecc19b Improve AI trigger words and retry logic 2025-05-29 09:38:49 +05:45