Voice

Voice system

Jarvis is designed to behave like a practical desktop voice assistant: wake, listen, execute, verify, and respond naturally.

Wake words

The built-in wake phrases are "Hey Jarvis" and "Jarvis". When the wake loop detects a phrase, Jarvis either acknowledges and listens for the next command or executes the command included after the wake phrase.

Language support

The default speech languages are English and Malayalam. The UI exposes language mode so users can choose automatic detection, English, or Malayalam.

Speech recognition providers

Provider	Use case
Google Speech Recognition	Default cloud fallback for quick local development.
Sarvam STT	Indic speech recognition when configured with a Sarvam key.
Local Whisper	Offline transcription when the Whisper package and model are installed.

Text to speech

Edge TTS is the default voice output. Sarvam TTS can be configured for Indian language voices. Audio files are generated in backend runtime storage and ignored by Git.

Permissions

Voice activation requires `voice.listen` and background wake mode requires `voice.background`. If these are disabled, Jarvis reports that the action is blocked instead of silently listening.

Troubleshooting

Check Windows microphone permissions.
Confirm the backend has access to the selected audio device.
Disable background listening in Safe Mode if you want manual control.
If Local Whisper is selected, install the package and download a model first.

Created by Jojin John

JX Jarvis is created by Jojin John.