Commit Graph

2 Commits

Author SHA1 Message Date
Josh Hawkins
595f94e1c8 Audio transcription tweaks (#18540)
* use model runner

* unload whisper model when live transcription is complete
2025-08-07 20:53:13 -06:00
Josh Hawkins
8911568fdf Audio transcription support (#18398)
* install new packages for transcription support

* add config options

* audio maintainer modifications to support transcription

* pass main config to audio process

* embeddings support

* api and transcription post processor

* embeddings maintainer support for post processor

* live audio transcription with sherpa and faster-whisper

* update dispatcher with live transcription topic

* frontend websocket

* frontend live transcription

* frontend changes for speech events

* i18n changes

* docs

* mqtt docs

* fix linter

* use float16 and small model on gpu for real-time

* fix return value and use requestor to embed description instead of passing embeddings

* run real-time transcription in its own thread

* tweaks

* publish live transcriptions on their own topic instead of tracked_object_update

* config validator and docs

* clarify docs
2025-08-07 20:53:12 -06:00