How to get the best out of Gen-4 Act-Two
Gen-4 Act-Two on Pixio is Runway’s character-driven video model. You provide a reference image of a character (or person) and a text prompt that describes how they should move or act; the model generates video that keeps the character consistent across the clip. Use it when you need a specific character or spokesperson to perform an action or deliver a scene—talking, gesturing, or moving—without character drift.
Gen-4 Act-Two
Gen-4 Act-Two on Pixio is Runway’s character-driven video model. You provide a reference image of a character (or person) and a text prompt that describes how they should move or act; the model generates video that keeps the character consistent across the clip. Use it when you need a specific character or spokesperson to perform an action or deliver a scene—talking, gesturing, or moving—without character drift.
Use this when
- You have a character reference (photo, illustration, or design) and need them to perform in video—talking, gesturing, walking, or acting.
- You want character consistency—same face, look, and proportions across the generated clip.
- You need motion and expression driven by a text prompt (e.g. “waves at camera”, “explains product with hand gestures”).
- You’re building spokesperson, avatar, or character animation content without full lip-sync or voice (pair with Act-One or voice tools for speech).
Modes in Pixio
| Mode | Input | Best for |
|---|---|---|
| Character to Video | One character reference image + prompt | Character performs the described action; consistency from reference |
Options
| Option | Values | Notes |
|---|---|---|
| Reference | One image (character/person) | Clear face and body; front or three-quarter view works best |
| Duration | Depends on backend | Check Pixio for limits |
| Prompt | Action, expression, camera | Describe what the character does, not their appearance |
Credits
Credits depend on duration and plan; check the model card in Pixio for current rates.
Why Act-Two fits character-driven video
Act-Two is built for one character in, one character out: the reference image defines who we see, and the prompt defines what they do. The model keeps the character’s look consistent while animating motion and expression. Use it for spokesperson clips, character moments, or when you need a specific person/character to perform an action. For , combine with Runway or other voice/lip-sync tools.
