How to get the best out of Fabric 1.0 / 1.0 Fast
Fabric 1.0 / 1.0 Fast on Pixio is Veed Fabric’s talking-head pipeline: one face image + audio → lip-synced video. The model drives mouth and expression from the audio so the character speaks naturally. 1.0 Fast for speed and lower cost; 1.0 for higher quality when it matters. Use it when you need a spokesperson, avatar, or talking head that matches your script or voiceover.
Fabric 1.0 / 1.0 Fast
Fabric 1.0 / 1.0 Fast on Pixio is Veed Fabric’s talking-head pipeline: one face image + audio → lip-synced video. The model drives mouth and expression from the audio so the character speaks naturally. 1.0 Fast for speed and lower cost; 1.0 for higher quality when it matters. Use it when you need a spokesperson, avatar, or talking head that matches your script or voiceover.
Use this when
- You need talking-head video: one face image and audio (voiceover, podcast, script) → lip-synced clip.
- You want Veed Fabric quality and natural lip-sync without manual animation.
- You’re building spokesperson, avatar, or explainer content and have a clear face reference and audio track.
- You want Fast for drafts and 1.0 for final quality.
Modes in Pixio
| Mode | Input | Best for |
|---|---|---|
| Face + Audio to Video | One face image + audio file (or script) | Lip-synced talking head; expression driven by audio |
Options
| Option | Values | Notes |
|---|---|---|
| Variant | 1.0 Fast, 1.0 | Fast = speed/cost; 1.0 = higher fidelity |
| Face reference | One image (clear face, front or three-quarter) | Good lighting, neutral or slight expression |
| Audio | Voice track or script (when TTS supported) | Clean audio improves lip-sync |
Credits
Credits depend on variant (1.0 Fast vs 1.0) and duration. Fast costs less per clip. Check the model card in Pixio for current rates.
Why Fabric fits talking head
Fabric is built for one face + one audio → one talking-head video. The model handles lip-sync and expression from the audio; you don’t need to animate mouth or timing. Use a clear face reference (front or three-quarter, good lighting) and for best results. For character-driven (e.g. waving, gesturing), use or instead.
