Data & AI
voicemode - Claude MCP Skill
Voice interaction for Claude Code. Use when users mention voice mode, speak, talk, converse, voice status, or voice troubleshooting.
SEO Guide: Enhance your AI agent with the voicemode tool. This Model Context Protocol (MCP) server allows Claude Desktop and other LLMs to voice interaction for claude code. use when users mention voice mode, speak, talk, converse, voice s... Download and configure this skill to unlock new capabilities for your AI workflow.
Documentation
SKILL.md## First-Time Setup
If VoiceMode isn't working or MCP fails to connect, run:
```
/voicemode:install
```
After install, reconnect MCP: `/mcp` → select voicemode → "Reconnect" (or restart Claude Code).
---
# VoiceMode
Natural voice conversations with Claude Code using speech-to-text (STT) and text-to-speech (TTS).
**Note:** The Python package is `voice-mode` (hyphen), but the CLI command is `voicemode` (no hyphen).
## When to Use MCP vs CLI
| Task | Use | Why |
|------|-----|-----|
| Voice conversations | MCP `voicemode:converse` | Faster - server already running |
| Service start/stop | MCP `voicemode:service` | Works within Claude Code |
| Installation | CLI `voice-mode-install` | One-time setup |
| Configuration | CLI `voicemode config` | Edit settings directly |
| Diagnostics | CLI `voicemode diag` | Administrative tasks |
## Usage
Use the `converse` MCP tool to speak to users and hear their responses:
```python
# Speak and listen for response (most common usage)
voicemode:converse("Hello! What would you like to work on?")
# Speak without waiting (for narration while working)
voicemode:converse("Searching the codebase now...", wait_for_response=False)
```
For most conversations, just pass your message - defaults handle everything else.
| Parameter | Default | Description |
|-----------|---------|-------------|
| `message` | required | Text to speak |
| `wait_for_response` | true | Listen after speaking |
| `voice` | auto | TTS voice |
For all parameters, see [Converse Parameters](../../docs/reference/converse-parameters.md).
## Best Practices
1. **Narrate without waiting** - Use `wait_for_response=False` when announcing actions
2. **One question at a time** - Don't bundle multiple questions in voice mode
3. **Check status first** - Verify services are running before starting conversations
4. **Let VoiceMode auto-select** - Don't hardcode providers unless user has preference
5. **First run is slow** - Model downloads happen on first start (2-5 min), then instant
## Check Status
```bash
voicemode service status # All services
voicemode service status whisper # Specific service
```
Shows service status including running state, ports, and health.
## Installation
```bash
# Install VoiceMode CLI and configure services
uvx voice-mode-install --yes
# Install local services (Apple Silicon recommended)
voicemode service install whisper
voicemode service install kokoro
```
See [Getting Started](../../docs/tutorials/getting-started.md) for detailed steps.
## Service Management
```python
# Start/stop services
voicemode:service("whisper", "start")
voicemode:service("kokoro", "start")
# View logs for troubleshooting
voicemode:service("whisper", "logs", lines=50)
```
| Service | Port | Purpose |
|---------|------|---------|
| whisper | 2022 | Speech-to-text |
| kokoro | 8880 | Text-to-speech |
| livekit | 7880 | Real-time rooms |
**Actions:** status, start, stop, restart, logs, enable, disable
## Configuration
```bash
voicemode config list # Show all settings
voicemode config set VOICEMODE_TTS_VOICE nova # Set default voice
voicemode config edit # Edit config file
```
Config file: `~/.voicemode/voicemode.env`
See [Configuration Guide](../../docs/guides/configuration.md) for all options.
## DJ Mode
Background music during VoiceMode sessions with track-level control.
```bash
mpv-dj mfp 49 # Play Music For Programming episode
mpv-dj status # What's playing
mpv-dj next # Skip to next track
mpv-dj stop # Stop playback
```
**Configuration:** Set `VOICEMODE_DJ_VOLUME` in `~/.voicemode/voicemode.env` to customize startup volume (default: 50%).
See [DJ Mode docs](docs/dj-mode/README.md) for full documentation.
## CLI Cheat Sheet
```bash
# Service management
voicemode service status # All services
voicemode service start whisper # Start a service
voicemode service logs kokoro # View logs
# Diagnostics
voicemode deps # Check dependencies
voicemode diag info # System info
voicemode diag devices # Audio devices
# History search
voicemode history search "keyword"
voicemode history play <exchange_id>
# DJ Mode
mpv-dj play <file|url> # Start playback
mpv-dj status # What's playing
mpv-dj next/prev # Navigate chapters
mpv-dj stop # Stop playback
```
## Documentation Index
| Topic | Link |
|-------|------|
| Converse Parameters | [All Parameters](../../docs/reference/converse-parameters.md) |
| Installation | [Getting Started](../../docs/tutorials/getting-started.md) |
| Configuration | [Configuration Guide](../../docs/guides/configuration.md) |
| Claude Code Plugin | [Plugin Guide](../../docs/guides/claude-code-plugin.md) |
| Whisper STT | [Whisper Setup](../../docs/guides/whisper-setup.md) |
| Kokoro TTS | [Kokoro Setup](../../docs/guides/kokoro-setup.md) |
| Pronunciation | [Pronunciation Guide](../../docs/guides/pronunciation.md) |
| Troubleshooting | [Troubleshooting](../../docs/troubleshooting/index.md) |
| CLI Reference | [CLI Docs](../../docs/reference/cli.md) |
| DJ Mode | [Background Music](docs/dj-mode/README.md) |Signals
Information
- Repository
- mbailey/voicemode
- Author
- mbailey
- Last Sync
- 1/18/2026
- Repo Updated
- 1/18/2026
- Created
- 1/14/2026
Reviews (0)
No reviews yet. Be the first to review this skill!
Related Skills
mem0
Integrate Mem0 Platform into AI applications for persistent memory, personalization, and semantic search. Use this skill when the user mentions "mem0", "memory layer", "remember user preferences", "persistent context", "personalization", or needs to add long-term memory to chatbots, agents, or AI apps. Covers Python and TypeScript SDKs, framework integrations (LangChain, CrewAI, Vercel AI SDK, OpenAI Agents SDK, Pipecat), and the full Platform API. Use even when the user doesn't explicitly say "mem0" but describes needing conversation memory, user context retention, or knowledge retrieval across sessions.
cursorrules
CrewAI Development Rules
browser-use
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, or extract information from web pages.
remote-browser
Controls a cloud browser from a sandboxed remote machine. Use when the agent is running in a sandbox (no GUI) and needs to navigate websites, interact with web pages, fill forms, take screenshots, or expose local dev servers via tunnels.
Related Guides
Mastering the Oracle CLI: A Complete Guide to the Claude Skill for Database Professionals
Learn how to use the oracle Claude skill. Complete guide with installation instructions and examples.
Python Django Best Practices: A Comprehensive Guide to the Claude Skill
Learn how to use the python django best practices Claude skill. Complete guide with installation instructions and examples.
Mastering Python and TypeScript Development with the Claude Skill Guide
Learn how to use the python typescript guide Claude skill. Complete guide with installation instructions and examples.