Skip to main content

Generating Videos

Updated this week

Bring images to life, create dynamic videos and transitions, or restyle your videos on the Canvas with Superstudio Video Flows.

Wan

With 3 models to choose from Wan lets you easily turn your ideas into high-quality videos, giving you the freedom to create and experiment.

Wan 2.5

Generate videos with audio directly from text prompts, with an optional image.
Length 5 seconds | 10 seconds

  • Image optional

  • Generates with audio

  • 16:9; 9:16; 1:1 aspect ratios

Wan 2.2
Generate videos using first/last frame images, a start image, or from text prompts alone.

Length 5 seconds | 10 seconds

  • Image optional

  • Supports first/last frame images

  • 16:9; 9:16; 1:1 aspect ratios

Wan 2.2 Turbo

Generate videos using first/last frame images, or a start image.

Length 5 seconds

  • Image required

  • Supports first/last frame images

  • 16:9; 9:16; 1:1 aspect ratios

Example video: Wan 2.2 Turbo

Veo 3.1

Generate cinematic-quality video with an elevated take on Google's Veo 3.1—now with refined scene coherence, enhanced photorealism, and precise audio.
Length:
Text to video: 4 seconds | 6 seconds | 8 seconds
Image to video: 8 seconds
Models: Veo 3.1 Fast | Veo 3.1

Features:

  • Generate a video from a text prompt with optional audio generation.

  • Generate a video from up to 3 images.

  • Text to video 16:9, 9:16 and 1:1 aspect ratio.

  • Image to video 16:9 and 9:16 aspect ratio.

  • Create videos with realistic speech and character lip movement from a text prompt.

  • Create videos with background audio and sound effects from text prompts.

  • Optional prompt enhance.

Example video: Veo 3.

Veo 3.1 First/Last Frame

Veo 3.1 introduces first and last frame generation for seamless storytelling—pairing richer audio, stronger narrative control, and enhanced realism to redefine what’s possible in state-of-the-art audiovisual creation.

Length: 8 seconds
Models: Veo 3.1 Fast | Veo 3.1

Features:

  • Generate a video from start and end frames.

  • 16:9, 9:16, 1:1 aspect ratio.

Veo 2

Generate high-quality, photorealistic video with natural motion, rich detail, and stunning compositions.

Length: Minimum length: 5 seconds | Maximum length: 8 seconds.

Features:

  • Generate a video from a text prompt with optional Start Image.

  • 16:9 and 9:16 aspect ratio.

  • Realistic fluid movement.

  • Prompt for dynamic camera movements.

Example video: Veo 2.

Kling Standard 2.1

Create videos with Kling Standard's video model—focused on exceptional movement realism, dynamics, aesthetics, and prompt adherence.

Length: 5 seconds | 10 seconds.
Models: Kling Standard 2.1

Features:

  • Generate video from text with optional Start Image.

  • Strong prompt adherence. Use a slider to adjust prompt intensity to guide the influence your text prompt has on your generation.

  • Create videos with fluid movement realism.

  • Very fast.

Example video: Kling Standard 2.1

Kling Standard

A video model focused on producing stylistic, morphing renders, great for creative and artistic interpretations.

Length: 5 seconds | 10 seconds.
Models: Kling Standard 1.0 | Kling Standard 1.6.

Features:

  • Generate your video from a text prompt and/or a starting image.

  • Create fluid movement and realistic character motion.

  • Choose from 9:16, 19:9 and 1:1 aspect ratios.

Example video: Kling Standard 1.6.

Kling Pro

A video model focused on producing stylistic, moving yet stable renders, great for creative and artistic interpretations.

Length: 5 seconds | 10 seconds.
Models: Kling Pro 1.0 | Kling Pro 1.5 | Kling Pro 1.6.

Features:

  • Generate a video from a text prompt with optional Start Image.

  • Fluid polished motion.

  • Realistic character motion with animated and photorealistic images.

  • Crisp colours and exquisite fine detail.

  • Create videos in 9:16, 19:9 and 1:1 aspect ratios.

Example video: Kling Pro.


Kling 2.0 Master Video

Create a video using Kling 2.0 Master model. This model is focused on producing stylistic, stable renders.

Length: 5 seconds | 10 seconds.

Features:

  • Generate a video from a text prompt with optional Start Image.

  • Stunning realistic motion.

  • Crisp colours, texture and exquisite fine detail.

  • Create videos in 9:16, 19:9 and 1:1 aspect ratios.

Example video: Kling 2.0.

Minimax Hailuo 02

MiniMax Hailuo 2.3 & 02 Essential Tips for Creators in Kaiber Superstudio

Transform static images into dynamic, fluid video content with MiniMax Hailuo AI 02, a state-of-the-art image animation model designed for creators and developers who need professional-quality video generation.

Length: 6 seconds or 10 seconds

Features:

  • Generate a video using start and end images.

  • Generate a video from a start image.

  • Ultra-realistic physics.

  • Excellent consistency with characters, background characters and elements.

Minimax Hailuo 2.3

Minimax Hailuo 2.3 is a state-of-the-art AI video generation model supporting both text-to-video and image-to-video content creation, with significantly improved visual performance to Minimax Hailuo 2.0.
Length: 6 seconds or 10 seconds.

Versions: Pro | Standard | Fast.

Features:

  • Generate a video from a text prompt and/or a starting image.

  • Excellent consistency with background characters and elements.

  • Add emotional depth of expression to characters.

Example video: Minimax.

Luma Video Ray 2

Capture light, depth, and motion with Luma Labs' Dream Machine model. Make cinematic 3D scenes with unmatched detail and realism.

Length: 5 seconds.

Models: Luma Ray 2 | Luma Ray 2 Flash.

Features:

  • Generate a video from a text prompt with optional Start Image.

  • Use both a Start and End Image to create seamless transitions between images.

Example video: Luma Ray 2.

Flipbook

Create videos with frame by frame animation, camera movements and optional audio reactivity.
Length: Minimum length: 3 seconds | Maximum length: 8 minutes.

Features:

  • Generate a video from a text prompt with optional Start Image.

  • Choose a style from the Kaiber range of curated aesthetics or create your own.

  • Choose from a range of camera movements; combine multiple movements cameras.

  • Upload up to 8 minutes of audio to create an audio reactive video where the camera movements sync to the beat. Your audio reactive video will match track length.

Example video: Flipbook.

Image Lip Sync

Bring static images to life with perfectly synced lip movement. Animate faces and match speech seamlessly.
Length: Supports a maximum output of 300 seconds.
Settings: 480p or 720p.

Video Lip Sync

Effortlessly lip sync speech or vocals to any video of a person.
Length: Supports a maximum output of 30 seconds.

Features:

  • Matches audio upload up to 30 seconds.

  • Will loop your video to match audio.

  • 720p resolution.

Grok Imagine Edit Video

Edit a video clip with a text prompt with Grok Imagine.

Access: Click the "Edit Video" button next to any video on the Canvas, or double click on a video in a Collection, Assets or on the Canvas to open the Edit Video window.

Prompting: Use a text prompt to describe the change you want to see.

  • "Make it anime style"

  • "He is skateboarding through a swirling inferno"

  • "Change the character's jacket to green"

Video length: Up to 8 seconds. Longer videos will be automatically cropped to edit the first 8 seconds.

Video Restyle 1

Restyle a video to a specific aesthetic using text prompts.
Minimum length: 3 seconds | Maximum length: 240 seconds.

Features:

  • Add a text prompt to describe your video.

  • Add your own aesthetic text prompt or choose from the Kaiber range of curated aesthetics.

  • Adjust the intensity to control how much change you will see in your video.

Tips:

  • Video length and aspect ratio will match that of the uploaded video.

  • Test prompts and settings with a short video clip before longer generations.

Example video: Video Restyle 1.
Prompt: “male (android:1.4) with shiny chrome limbs a sleek, polished metal body composed of shiny silver plating, reflecting light off its smooth surfaces, (featureless robotic head:1.3) with glowing optical sensors, chest is broad, background is blank, flat wall in the style of futuristic sci fi, high dynamic range, rich colors, lifelike textures, 8K UHD”

Example video: Video Restyle 1.
Prompt: “girl balancing in the style of (line drawing:1.4) (flat anime:1.3), 2D, pencil art, line drawing, subdued matt colours, (lens compression:1.2), dull surface, makoto shinkai”

Video Restyle 2

Restyle a video into a specific aesthetic. Version 2.0 comes with the ability to input aesthetic images plus improved style transfer capabilities.
Length: Minimum: 3 seconds | Maximum: 180 second.

Features:

  • Add a text prompt to describe your video.

  • Adjust the intensity to control how much change you will see in your video.

  • Add your own aesthetic text prompt or choose from the Kaiber range of curated aesthetics.

  • Add an aesthetic image. Use the slider to guide how much influence this will have on your generation.

  • Adjust the intensity to control how much change you will see in your video.

Tips:

  • Video length and aspect ratio will match that of the uploaded video.

  • Test prompts and settings with a short video clip before longer generations.

Add Audio to Video

Create audio synced to a video for smooth integration of background music, soundscapes and sound effects.

Cost: 3 credits per second.

Duration: will match uploaded video length 3-90 seconds.

Output: mp4.


Prompting

Describe the kind of audio you want, consider individual sounds, sound sources and the overall mood or atmosphere.


Example prompts:

Soft wind rustling through leaves, distant birdsong echoing gently to create a peaceful woodland atmosphere

The hiss of a steaming espresso machine over soft background chatter, clinking cups and cutlery, gentle café music playing faintly

Did this answer your question?