This skill generates natural speech from text, supporting over 70 languages with multiple models that balance quality and latency. It is ideal for developers looking to convert written content into spoken language with fine-tuned voice characteristics.
$ npx skills add https://github.com/elevenlabs/skills --skill text-to-speechThis skill converts written text into natural-sounding speech using ElevenLabs' voice synthesis technology. It supports over 70 languages with six different models optimized for different use cases—from highest-quality emotional speech (eleven_v3) to ultra-low latency real-time applications (eleven_flash_v2_5). You can select from pre-built voices or create custom ones, and fine-tune output with stability, similarity boost, style, and speaker boost settings. The skill also offers request stitching to eliminate audio artifacts across long-form content, language enforcement for pronunciation control, text normalization for natural number and date reading, and multiple output formats including MP3, PCM, Opus, and telephony-standard μ-law. Developers building voice applications, voiceover systems, multilingual content platforms, or real-time conversational AI benefit from its flexible architecture and streaming capabilities.
Install the skill using the command provided and refer to the setup guide for additional details.
Generating audio for educational content
Creating voiceovers for videos
Transcribing meetings and events into spoken summaries
Developing interactive voice applications
$ npx skills add https://github.com/elevenlabs/skills --skill text-to-speechgit clone https://github.com/elevenlabs/skillsCopy the install command above and run it in your terminal.
Launch Claude Code, Cursor, or your preferred AI coding agent.
Use the prompt template or examples below to test the skill.
Adapt the skill to your specific use case and workflow.
Check the GitHub repository or documentation for usage examples.
transform text into lifelike voiceovers
Converts spoken words into summaries effortlessly
Transform text into high-quality, natural-sounding speech
High-accuracy voice AI models for transcription and translation
Transform text into natural, emotive speech across multiple Indian languages
Transform text into natural and smooth human voice
Take a free 3-minute scan and get personalized AI skill recommendations.
Take free scan