Features

  • Universal Audio Capture Record mic, system audio, or any combination via BlackHole (macOS) or equivalent loopback drivers.

  • Manual Control Start/Stop buttons let you define exactly the boundaries of your prompt—perfect for long, multi-sentence queries.

  • One-Shot Transcription Whisper processes the entire recording in a single call—no fragmented sentences.

  • Live AI Streaming Gemini’s response appears token by token, just like in ChatGPT’s streaming interface.

  • Configurable Model Swap between Gemini variants via environment (no code changes).

  • Zero-Install UI Bootstrapping On first run voxai auto-installs the minimal Electron UI source—included in the PyPI package—so you only ever need pip install voxai and voxai.