• Tools
  • Pricing
  • Workflows
  • All Models
    Maker Mode
  • Gallery
  • Academy
  • Documentation
  • API
  • Status
  • Blog
Pixio Logo
Sign InSign Up
Pixio Logo
  • Tools
  • Pricing
  • Workflows
  • All Models
    Maker Mode
  • Gallery
  • Academy
  • Documentation
  • API
  • Status
  • Blog
Sign InSign Up
Pixio Logo

Visualize the Future: Crafted by AI, Inspired by You

© Copyright 2026 Pixio. All Rights Reserved.

Privacy PolicyTerms of ServiceRefund Policy
ModelsAudio & Music
ModelsPixio audio model systemBuilt for voice, music, and structure

Audio & Music

Music and voice generation.

Audio is easier to navigate when you split the problem by function: speech, composition, dialogue, or transformation. The right tool usually becomes obvious once the role is clear.

Open in PixioExplore the academy

Browse by output role first, then use the model page to get into prompting, structure, and workflow details.

18 models in Audio & Music

Open any card for the full model brief
AudioModel brief
ElevenLabs
01

Convert text to speech with ElevenLabs. Choose from a wide range of voices, adjust stability and style, and use custom voice clones (IVC).

Voice
Open model brief
AudioModel brief
ElevenLabs Music
02

Compose songs from a prompt or a composition plan. Create instrumentals and full tracks with ElevenLabs Music (Compose).

Compose
Open model brief
AudioModel brief
ElevenLabs Text to Dialogue
03

Generate multi-speaker dialogue from text. Assign different voices to each speaker for podcasts, storytelling, and presentations.

Voice
Open model brief
AudioModel brief
Kling Create Voice
04

Create a reusable custom voice from a clean 5–30 second audio sample. Use your Kling voice ID for consistent voiceovers and content.

Voice
Open model brief
AudioModel brief
Lyria 2
05

Google Lyria 2: audio/music generation with strong aesthetic sense and coherence—good for stylized and artistic audio.

Compose
Open model brief
AudioModel brief
MiniMax Music V2
06

Generate music from style and mood prompts plus lyrics. Text-to-music with control over composition and sample rate.

Compose
Open model brief
AudioModel brief
MiniMax Speech
07

High-quality text-to-speech with MiniMax Speech 02, 2.5, 2.6, and 2.8 (Turbo and HD). Multiple preset voices and natural intonation.

Voice
Open model brief
AudioModel brief
Mureka
08

Create music with AI lyrics and instrumental options, extend clips, or regenerate segments—full control over style and arrangement.

Compose
Open model brief
AudioModel brief
Music (Compose) / Sound Effects
09

ElevenLabs music composition and sound effects—generate background music and SFX from text for video and media.

ComposeEdit
Open model brief
AudioModel brief
Music V2
10

MiniMax music generation: create tracks from descriptions with a balance of quality and speed for drafts and finished pieces.

Compose
Open model brief
AudioModel brief
Pixio Music
11

Pixio's music generation: create and shape music from text with integrated controls and workflows.

Compose
Open model brief
AudioModel brief
Songcraft
12

Generate full songs from text with Songcraft (Suno). Control genre, mood, and lyrics. Extend songs, create covers, and split stems.

ComposeEdit
Open model brief
AudioModel brief
Songcraft Generate
13

Generate full music tracks from a text description (Suno-style)—create songs with structure, style, and length you describe.

Compose
Open model brief
AudioModel brief
Speech 02/2.5/2.6/2.8 Turbo & HD
14

MiniMax text-to-speech with multiple quality and speed tiers—from fast Turbo to high-fidelity HD for different use cases.

Voice
Open model brief
AudioModel brief
Stable Audio 2.5
15

Create or transform audio: text-to-audio, inpainting (edit parts of a clip), or audio-to-audio for sound design and music.

ComposeEdit
Open model brief
AudioModel brief
Tempolor
16

Work with song structure: extract vocals, instrumental, or split into stems for remixing and production.

ComposeEdit
Open model brief
AudioModel brief
Text to Speech / Voice Clone (IVC) / Text to Dialogue
17

ElevenLabs TTS, voice cloning (IVC), and multi-voice dialogue—natural-sounding speech and character voices.

Voice
Open model brief
AudioModel brief
Voice Clone
18

Clone a voice from samples with MiniMax—create a consistent synthetic voice for narration, dialogue, or content at scale.

Voice
Open model brief