How to Write Perfect AI Video Prompts: The Complete Guide
Master AI video prompt engineering with this comprehensive guide. Learn the techniques, structures, and specific words that produce stunning AI video results every time.
AI video generation is powerful — but the quality of your output depends almost entirely on the quality of your prompt. A vague prompt produces vague results. A well-structured prompt with the right vocabulary produces cinematic, production-ready video.
This guide covers everything: the anatomy of a great prompt, vocabulary that consistently improves results, model-specific tips, and common mistakes to avoid.
The Anatomy of a Great Video Prompt
Every strong video prompt has five components:
- Subject — Who or what is the focus?
- Action — What is happening?
- Environment — Where does it take place?
- Camera/Composition — How is it framed and shot?
- Mood/Style — What is the feel and aesthetic?
Not every prompt needs all five, but including all five consistently produces better results than omitting any of them.
Weak Prompt vs. Strong Prompt
Weak: "A woman on a beach"
Strong: "A woman in a white linen dress walks barefoot along a deserted beach at sunset, the camera slowly tracks her from a low angle, warm golden light, cinematic, shallow depth of field"
The difference in output quality between these two prompts is dramatic.
Component 1: Subject and Action
Be specific about who your subject is and what they're doing. Generic descriptions produce generic results.
Generic: "a person walking" Specific: "a young woman in business attire, mid-thirties, confident posture, walking through a glass-walled office corridor"
Specificity in action is equally important:
Generic: "cooking" Specific: "deftly slicing vegetables with a chef's knife, rhythmic and precise"
Component 2: Environment
Environments have enormous impact on visual quality. Describe not just the location, but the time of day, weather, and specific environmental details.
Effective environment descriptors:
- Time of day: "dawn", "golden hour", "midday", "blue hour", "midnight"
- Weather: "clear sky", "overcast", "light rain", "fog", "storm approaching"
- Location specifics: "a cobblestone alley in Paris", "a modern Tokyo skyscraper lobby", "an ancient Greek ruin at dusk"
- Atmosphere: "humid and tropical", "crisp mountain air", "dusty and arid"
Component 3: Camera and Composition
This is where most beginners leave quality on the table. AI video models are trained on real footage, which means cinematographic language works.
Camera Movements
Use these terms to get specific camera behavior:
- Dolly in: Camera physically moves toward the subject (more intimate than zoom)
- Dolly out / Pull back: Camera moves away, revealing context
- Pan: Camera rotates horizontally while staying in place
- Tilt: Camera rotates vertically
- Tracking shot: Camera follows the subject
- Crane/Jib shot: Camera rises up vertically
- Handheld: Subtle natural camera shake suggesting authenticity
- Static / Locked-off: Perfectly still camera
- Arc shot: Camera orbits around the subject
Framing and Composition
- Extreme wide shot (EWS): Tiny subject in vast environment
- Wide shot (WS): Full body of subject with environment
- Medium shot (MS): Subject from waist up
- Close-up (CU): Face and shoulders
- Extreme close-up (ECU): Eyes, hands, small details
- Dutch angle: Tilted camera for unease
- Low angle: Camera below eye level, subject appears powerful
- High angle: Camera above, subject appears vulnerable
- Bird's eye view: Directly above
Lens Characteristics
- Shallow depth of field: Subject sharp, background blurred
- Deep focus: Everything in frame is sharp
- Anamorphic: Wide cinematic look, horizontal lens flares
- Wide angle: More environment visible, slight distortion
- Telephoto: Compressed background, isolates subject
- Macro: Extreme close-up detail
Component 4: Lighting
Lighting dramatically affects mood and quality. Use specific lighting language:
Natural Light
- "Golden hour soft backlight"
- "Overcast diffused daylight"
- "Dramatic midday overhead sun"
- "Window light from the left, soft natural fill"
- "Blue hour, ambient twilight glow"
Artificial Light
- "Warm tungsten practical lights"
- "Neon signs reflecting on wet pavement"
- "Single key light from above-right"
- "Candle-lit, warm and intimate"
- "Fluorescent office lighting, cold and clinical"
Cinematic Lighting Styles
- "Rembrandt lighting" — dramatic shadows with triangle on cheek
- "Chiaroscuro" — extreme contrast, very dramatic
- "High key" — bright, minimal shadows, commercial feel
- "Low key" — predominantly dark with selective light
- "Backlit / contre-jour" — light source behind subject, silhouette effect
Component 5: Style and Mood
Describe the aesthetic and emotional quality of the clip:
Cinematic References
- "Shot on 35mm film"
- "8mm vintage aesthetic, grain"
- "IMAX 70mm quality"
- "Documentary style"
- "Commercial advertising look"
Color Palette
- "Desaturated with a slight blue tint"
- "Warm teal and orange color grade"
- "Vivid saturated, pop art palette"
- "Muted earthy tones"
- "High contrast black and white"
Mood Descriptors
- "Melancholy and introspective"
- "Energetic and kinetic"
- "Serene and peaceful"
- "Tense and suspenseful"
- "Playful and lighthearted"
- "Epic and grandiose"
Model-Specific Tips
Seedance 2
Works well with: Direct action descriptions, cinematic environments, style keywords. Keep under 60 words.
Kling 3.0
Works best with: Highly specific physical descriptions, photorealistic subjects, physics interactions. Responds well to "hyperrealistic" and "photographic".
Veo 3
Strongest with: Technical cinematic language, complex multi-element scenes, camera movement descriptions. Can handle longer, more complex prompts (up to 100 words effectively).
Negative Prompting (Where Supported)
Some models support negative prompts — terms describing what you don't want. Useful negative terms:
- "no blur", "no noise", "no grain" — for sharp output
- "no distortion", "no warping" — for stable subjects
- "no text", "no watermark" — for clean commercial output
- "no artifacts" — general quality improvement
Common Mistakes
Mistake 1: Too vague Don't: "a nice sunset video" Do: "time-lapse of a sunset over the ocean, wide shot, warm orange and pink sky, waves in foreground, cinematic"
Mistake 2: Too long Very long prompts (150+ words) confuse most models. The model may focus on early elements and ignore later ones. Keep it to 40–80 words for most models.
Mistake 3: Conflicting instructions Don't mix conflicting camera or style instructions. "Handheld AND perfectly stable" will confuse the model.
Mistake 4: Ignoring aspect ratio Specify "vertical 9:16 format" for social media vertical content, or "widescreen 16:9" for landscape. Default is usually 16:9.
Mistake 5: Expecting perfection on the first try AI video generation benefits from iteration. Generate 3–5 variations of a concept, pick the best, then refine the prompt based on what worked and what didn't.
Sample Prompts for Common Use Cases
Product launch video: "A sleek smartwatch rotating slowly on a dark reflective surface, studio lighting with a single overhead key light, anamorphic lens, close-up slowly pulling back to reveal the full watch, black and silver color palette, premium commercial aesthetic"
Social media lifestyle reel: "A young woman doing yoga on a rooftop at sunrise, handheld camera, warm backlight, natural and authentic feel, vertical 9:16 format, soft bokeh background"
Brand story video: "A craftsman's hands carefully assembling a wooden chair in a sun-dappled workshop, shallow depth of field, dust particles in the light, slow dolly forward, warm earthy tones, documentary style"
Travel content: "Aerial drone shot slowly descending toward a tropical island, turquoise water, palm trees, golden hour, wide establishing shot, cinematic, lush and vibrant color"
Prompt engineering is a skill that improves with practice. The more you generate, the better you understand how each model interprets language — and the more precisely you can craft prompts that produce exactly what you envision.
Try it yourself on Framiq
20 free credits. Access every model mentioned in this article. No credit card required.
Start generating free