Prakash Joshi Pax
45dbd57722
Merge pull request #429 from klaudworks/fix/ocr-window-detection
...
Fix OCR capturing VoiceInk status overlay instead of frontmost app window
2025-12-07 12:27:31 +05:45
Beingpax
85e26856df
Optimize recorder startup
2025-12-07 09:26:52 +05:45
Nico Duldhardt
d25ae52333
Fix OCR capturing VoiceInk status overlay instead of frontmost app window
...
The screen capture service was selecting the first layer-0 window, which
during recording was VoiceInk's own status indicator overlay. This caused
OCR to always return 'No text detected' since the overlay has no readable
content.
Changes:
- Filter out windows owned by VoiceInk's process
- Prioritize windows belonging to NSWorkspace.frontmostApplication
- Refactor WindowCandidate struct to class scope
2025-12-07 01:05:00 +01:00
Nico Duldhardt
df2a8d9901
remove check for windows < 120x120
2025-12-07 00:52:52 +01:00
Nico Duldhardt
3b043f4da9
Fix OCR capturing VoiceInk status overlay instead of frontmost app window
...
The screen capture service was selecting the first layer-0 window, which
during recording was VoiceInk's own status indicator overlay. This caused
OCR to always return 'No text detected' since the overlay has no readable
content.
Changes:
- Filter out windows owned by VoiceInk's process
- Prioritize windows belonging to NSWorkspace.frontmostApplication
- Filter out tiny windows (<120x120) to avoid tooltips/overlays
- Move CGWindowListCopyWindowInfo off main thread for better UI responsiveness
- Refactor WindowCandidate struct to class scope
2025-12-07 00:23:04 +01:00
Beingpax
557ac95478
1.64 update
2025-12-06 08:43:41 +05:45
Beingpax
ab9d6ad830
Add option to dismiss Affiliate promotion
2025-12-06 08:18:35 +05:45
Beingpax
d4fb2125e6
Remove virtual and aggregate device filtering from isValidInputDevice
2025-12-05 19:15:13 +05:45
Beingpax
c2666c3c24
Update to 1.63
2025-12-04 22:30:22 +05:45
Beingpax
8cc83f84dc
Exclude virtual devices and aggregate devices for Audio input devices
2025-12-04 21:50:23 +05:45
Beingpax
feea0d7f2a
Fix race conditions due to audio device selection in menu bar
2025-12-04 21:24:36 +05:45
Beingpax
2e7112c06a
Remove disabled state when enhancement turned off
2025-12-04 21:17:00 +05:45
Beingpax
1ab91ad4ef
Add audio input device selection option in menu bar
2025-12-04 21:12:30 +05:45
Beingpax
8ce84e9a69
Add llama3.1-8b from Cerebras
2025-12-04 20:55:13 +05:45
Beingpax
ea64ff3109
Update AI enhancement models
2025-12-04 20:53:15 +05:45
Beingpax
b754f39d89
Remove text formatting during Paste operation
2025-12-04 19:06:20 +05:45
Beingpax
45a2b72bfd
Resolve fatal crash due to text insertion
2025-12-02 08:19:20 +05:45
Beingpax
88676362b4
update to v1.62
2025-11-29 08:49:08 +05:45
Beingpax
fca509952f
Make the soundfeedback row clickable
2025-11-28 20:45:24 +05:45
Beingpax
ac1a85c056
Improved text formatting during paste operation
2025-11-28 20:39:43 +05:45
Beingpax
1e612d9987
Support org.nspasteboard conventions for transient clipboard handlin
2025-11-28 13:03:30 +05:45
Beingpax
e16c84e1ed
Add accessibility permission check to prevent pop-up when calling SelectedTextService
2025-11-20 22:44:45 +05:45
Beingpax
b6068bc881
Show raw API error responses on key verification failure
2025-11-20 22:29:21 +05:45
Beingpax
cd503ac298
Refactor modifier key handling to use direct await calls
2025-11-20 20:08:19 +05:45
Beingpax
a762070f3d
Remove redundant codefiles
2025-11-18 15:36:12 +05:45
Beingpax
ae308377a0
Merge branch 'main' of https://github.com/Beingpax/VoiceInk
2025-11-18 15:14:50 +05:45
Beingpax
7feef2b374
Add transient marker and fix clipboard race condition
2025-11-18 15:14:09 +05:45
Prakash Joshi Pax
875a0995fa
Merge pull request #405 from Beingpax/custom-start-stop-sound
...
Custom start stop sound
2025-11-18 14:39:38 +05:45
Beingpax
8525392f03
Prevent deletion when re-selecting already stored custom sound
2025-11-18 14:32:21 +05:45
Beingpax
f445f847ef
update model display names
2025-11-17 10:15:48 +05:45
Beingpax
d07e5e4967
Update model display names
2025-11-17 10:15:10 +05:45
Beingpax
308098694d
Add support for scribe v2 model
2025-11-17 10:14:32 +05:45
Beingpax
ff0be0092b
Fix: Prevent abrupt stop of custom sound playback
2025-11-16 22:45:03 +05:45
Beingpax
7365493366
Feat: Add custom start/stop sounds and fix race condition
2025-11-16 22:42:28 +05:45
Beingpax
35a08dce7b
Add Keyboardshortcutlview
2025-11-14 15:33:22 +05:45
Beingpax
fb69db2fe9
Simplify the import logic for dictionary
2025-11-14 11:12:41 +05:45
Beingpax
7d12c50fcb
Rename dictionary context to custom vocabulary
2025-11-14 09:02:12 +05:45
Beingpax
9d3d1482b0
Remove prepopulated dictionary items
2025-11-14 08:25:52 +05:45
Beingpax
38028768e8
Update PR policy
2025-11-12 09:40:39 +05:45
Prakash Joshi Pax
96433ddc27
Update appcast.xml
2025-11-08 20:20:35 +05:45
Beingpax
ba68ec429c
Fix activation policy being reapplied unnecessarily during app switching
2025-11-08 20:11:37 +05:45
Beingpax
e5e194de5f
v1.61 update
2025-11-08 12:00:45 +05:45
Beingpax
c4f847d7cf
Update system instructions
2025-11-08 11:03:38 +05:45
Beingpax
692bd5f9d4
Update Sidebar
2025-11-07 23:58:44 +05:45
Beingpax
f559d19390
Revert "Merge pull request #362 from tmm22/feature/qol-documentation"
...
This reverts commit 5c55d12cbf5e2919ecf0cacb2e623bf17cd8dff6, reversing
changes made to 2f2f1bc894d2a3fe69d4c6e2d8ce0a76071fe1a8.
2025-11-07 22:42:37 +05:45
Prakash Joshi Pax
5c55d12cbf
Merge pull request #362 from tmm22/feature/qol-documentation
...
Quality of Life Improvements - Documentation & Source Files
2025-11-07 22:36:38 +05:45
Beingpax
2f2f1bc894
Fix text display order in transcription card
2025-11-07 22:35:19 +05:45
Beingpax
fcc7b4783f
Add reasoning parameter support for Gemini and OpenAI models
2025-11-07 22:29:44 +05:45
Beingpax
2e5c737255
Remove reasoning effort parameter for openAI and gemini models
2025-11-07 22:28:18 +05:45
Beingpax
cf8d821436
Implement Dictionary Import export support for iOS companion app(v1)
2025-11-07 11:31:41 +05:45