Gemini Omni Reference Consistency: Keep Characters and Products Stable

May 20, 2026

Gemini Omni makes references more important, not less. When a workflow can combine images, clips, and audio cues, the prompt must tell the model which details are anchors and which details are allowed to change.

Related guides:

Write the visual contract

List what must survive the generation. For a product, that may be silhouette, label, color, material, scale, and logo placement. For a character, it may be face shape, hairstyle, outfit, posture, voice policy, and expression range.

Separate appearance from motion

The reference should define appearance. The prompt should define movement. If you ask for a new outfit, new background, fast camera, expression change, and product rotation at once, you make drift harder to diagnose.

Use the uploaded image as the identity anchor. Preserve face shape, hair, outfit, and main silhouette. Add one restrained movement: a slow turn toward window light. Keep the background simple. Review: does the person remain recognizable and appropriate for this context?

Use source videos carefully

A source video can provide rhythm, pose, or camera movement. Name which part you want. If the reference is only for motion, say that the original subject, background, or water/objects should not appear in the final clip.

Takeaway

Gemini Omni reference consistency comes from a clear contract: preserve the anchor, move one thing, review the drift, then revise one variable.

Gemini Omni Team

Gemini Omni Team

Gemini Omni Reference Consistency: Keep Characters and Products Stable