Pixio briefing

How to get the best out of Stable Audio 2.5

Compose

Best when the composition, mood, and arrangement need to come together from one brief.

Songs, instrumentals, background music, cue generation.

Structure

Best when you define pacing and sections instead of vague genre labels.

Hooks, transitions, timing, emotion, arrangement logic.

Refine

Best when the source audio is useful but needs cleanup, transformation, or separation.

Stem work, edits, polish passes.

Basic Info

Stable Audio on Pixio (e.g. Stable Audio 2.5) lets you create or transform audio: text-to-audio, inpainting (edit parts of a clip), or audio-to-audio for sound design and music. Use it when you need prompt-driven music or sound design with the option to edit existing clips (inpaint) or transform them (audio-to-audio).

Stable Audio

Stable Audio on Pixio (e.g. Stable Audio 2.5) lets you create or transform audio: text-to-audio, inpainting (edit parts of a clip), or audio-to-audio for sound design and music. Use it when you need prompt-driven music or sound design with the option to edit existing clips (inpaint) or transform them (audio-to-audio).

Use this when

You need text-to-audio for music or sound design (describe genre, mood, length).
You want to edit part of a clip (inpainting)—replace or fix a segment without re-generating the whole thing.
You need audio-to-audio (transform an existing clip with a prompt—style, mood, or content change).
You are building sound design, background music, or SFX with Stable Diffusion-style control.

Modes in Pixio

Mode	Input	Best for
Text to Audio	Prompt (genre, mood, duration)	New music or sound design from scratch
Inpainting	Existing clip + mask + prompt	Edit or replace a segment
Audio to Audio	Existing clip + prompt	Transform style, mood, or content

Options

Option	Values	Notes
Duration	Depends on backend (e.g. up to 90s or more)	Check Pixio for limits
Prompt	Genre, mood, instruments, structure	Be specific for best results
Credits	Plan-based	Check model card in Pixio

When to use Stable Audio vs other models

Scenario	Best choice
Text-to-audio + inpainting + audio-to-audio	Stable Audio
Music only (no edit)	Pixio Music, Lyria 2, MiniMax Music, Songcraft

Stable Audio

Use this when

You need text-to-audio for music or sound design (describe genre, mood, length).

You want to edit part of a clip (inpainting)—replace or fix a segment without re-generating the whole thing.

You need audio-to-audio (transform an existing clip with a prompt—style, mood, or content change).

You are building sound design, background music, or SFX with Stable Diffusion-style control.

Mode

Input

Best for

Text to Audio

Prompt (genre, mood, duration)

New music or sound design from scratch

Inpainting

Existing clip + mask + prompt

Edit or replace a segment

Audio to Audio

Existing clip + prompt

Transform style, mood, or content

Option

Values

Notes

Duration

Depends on backend (e.g. up to 90s or more)

Check Pixio for limits

Prompt

Genre, mood, instruments, structure

Be specific for best results

Credits

Plan-based

Check model card in Pixio

Scenario

Best choice

Text-to-audio + inpainting + audio-to-audio

Stable Audio

Music only (no edit)

Pixio Music, Lyria 2, MiniMax Music, Songcraft

Stable Audio 2.5

How to get the best out of Stable Audio 2.5

Stable Audio

Use this when

Modes in Pixio

Options

When to use Stable Audio vs other models

Stable Audio 2.5

How to get the best out of Stable Audio 2.5

Stable Audio

Use this when

Modes in Pixio

Options

When to use Stable Audio vs other models

Tips