The tts skill converts any text into speech audio using two backends (Kokoro local and Noiz cloud). It supports features like voice cloning and timeline-accurate audio synthesis for dubbing and video narration.
$ npx skills add https://github.com/noizai/skills --skill ttsThe tts skill converts text into natural-sounding speech audio using two backends: Kokoro for local, privacy-first processing, and Noiz for cloud-based generation with advanced features. It supports precise duration control and timeline-aligned rendering, making it ideal for video dubbing, narration, and content creation workflows. The skill includes voice cloning capabilities powered by reference audio, allowing you to generate speech in specific voices. Setup requires either a local Kokoro installation or a Noiz API key for cloud processing, with the option to control emotion and speaking style through the characteristic-voice skill integration.
Install via `$ npx skills add https://github.com/noizai/skills --skill tts`
Convert text, files, or URLs to audio with voice cloning.
Align speech to SRT subtitles for video narration.
Control voice selection, language, and emotion in segments.
$ npx skills add https://github.com/noizai/skills --skill ttsgit clone https://github.com/noizai/skillsCopy the install command above and run it in your terminal.
Launch Claude Code, Cursor, or your preferred AI coding agent.
Use the prompt template or examples below to test the skill.
Adapt the skill to your specific use case and workflow.
Check the GitHub repository or documentation for usage examples.
Take a free 3-minute scan and get personalized AI skill recommendations.
Take free scan