Pixio Academy

Audio & Music

Music and voice generation.

Lyria 2

Google Lyria 2: audio/music generation with strong aesthetic sense and coherence—good for stylized and artistic audio.

Songcraft Generate

Generate full music tracks from a text description (Suno-style)—create songs with structure, style, and length you describe.

4 lessonsView course

Songcraft

Basic

Generate full songs from text with Songcraft (Suno). Control genre, mood, and lyrics. Extend songs, create covers, and split stems.

4 lessonsView course

Stable Audio 2.5

Basic

Create or transform audio: text-to-audio, inpainting (edit parts of a clip), or audio-to-audio for sound design and music.

4 lessonsView course

Tempolor

Basic

Work with song structure: extract vocals, instrumental, or split into stems for remixing and production.

4 lessonsView course

Mureka

Basic

Create music with AI lyrics and instrumental options, extend clips, or regenerate segments—full control over style and arrangement.

4 lessonsView course

Music V2

Basic

MiniMax music generation: create tracks from descriptions with a balance of quality and speed for drafts and finished pieces.

4 lessonsView course

MiniMax Music V2

Basic

Generate music from style and mood prompts plus lyrics. Text-to-music with control over composition and sample rate.

4 lessonsView course

Pixio Music

Basic

Pixio's music generation: create and shape music from text with integrated controls and workflows.

4 lessonsView course

Gemini 3.1 Flash TTS Preview

Basic

Google text-to-speech with natural single-speaker narration, selectable voices, and prompt-controlled style.

0 lessonsView course

ElevenLabs Music

Basic

Compose songs from a prompt or a composition plan. Create instrumentals and full tracks with ElevenLabs Music (Compose).

4 lessonsView course

Speech 02/2.5/2.6/2.8 Turbo & HD

Basic

MiniMax text-to-speech with multiple quality and speed tiers—from fast Turbo to high-fidelity HD for different use cases.

4 lessonsView course

MiniMax Speech

Basic

High-quality text-to-speech with MiniMax Speech 02, 2.5, 2.6, and 2.8 (Turbo and HD). Multiple preset voices and natural intonation.

4 lessonsView course

Voice Clone

Basic

Clone a voice from samples with MiniMax—create a consistent synthetic voice for narration, dialogue, or content at scale.

4 lessonsView course

ElevenLabs

Basic

Convert text to speech with ElevenLabs. Choose from a wide range of voices, adjust stability and style, and use custom voice clones (IVC).

4 lessonsView course

Text to Speech / Voice Clone (IVC) / Text to Dialogue

Basic

ElevenLabs TTS, voice cloning (IVC), and multi-voice dialogue—natural-sounding speech and character voices.

4 lessonsView course

ElevenLabs Text to Dialogue

Basic

Generate multi-speaker dialogue from text. Assign different voices to each speaker for podcasts, storytelling, and presentations.

4 lessonsView course

Music (Compose) / Sound Effects

Basic

ElevenLabs music composition and sound effects—generate background music and SFX from text for video and media.

4 lessonsView course

Kling Create Voice

Basic

Create a reusable custom voice from a clean 5–30 second audio sample. Use your Kling voice ID for consistent voiceovers and content.

4 lessonsView course