2026-05-25 · 10 min read

The Perfect AI Video Prompt Structure: Shot List vs Ultra-Detailed

Compare two Sora 2 prompt structures — layered Shot List and ultra-detailed parameterized format. Learn when to use each and how to structure prompts for cinematic AI video output.

Soraprompt structureshot listvideoformula

Two prompt structures, two philosophies

Sora 2 offers two distinct prompt structures, each designed for a different production need. Understanding when to use each is the difference between a lucky good output and a repeatable professional workflow.

Structure A — the layered Shot List — treats the prompt like a director's briefing document. You separate Style, Cinematography, Actions, and Background Sound into labeled blocks. Structure B — the ultra-detailed parameterized format — treats the prompt like a DP's tech spec sheet, specifying everything from capture format and lens filtration to grade palette and finishing notes.

Structure A: The layered Shot List

The Shot List is the recommended structure for most production work. It breaks the prompt into four labeled blocks that Sora reads as a sequential directing instruction:

-Style block: visual anchor + aesthetic lineage + era/film texture. Example: '1970s romantic drama, 35mm with natural flares and warm halation, soft focus, slight gate weave.'
-Cinematography block: camera (framing + movement + lens spec), lighting (key + fill + ambient), mood (1–2 words).
-Actions block: ordered beat list — Beat 1, Beat 2, Beat 3 — plus one optional short dialogue line per character.
-Background Sound block: environmental and diegetic only. No post-production score. Let the natural sound design breathe.

Structure B: The parameterized format

The parameterized format is for film-industry-grade control when you need to replicate a specific film stock, lens setup, or color grade across multiple shots. It specifies:

-Format & Look: duration, shutter angle, capture format, grain profile.
-Lenses & Filtration: focal length, filter type, polarization.
-Grade/Palette: highlight/midtone/shadow color mapping.
-Lighting & Atmosphere: natural light direction, bounce boards, haze/fog.
-Location & Framing: foreground/midground/background layering.
-Wardrobe & Props: specific clothing and object details.
-Sound: diegetic-only layers with specific sound sources.
-Camera Notes: optional lens specification details.
-Finishing: optional post-processing reference.

When to use which structure

Use the Shot List (Structure A) for: most commercial and creative work, sequences under 4 shots, situations where speed and repeatability matter more than photographic precision, and when you are iterating rapidly on subject and location. Use the parameterized format (Structure B) for: film-industry replicas, multi-shot continuity where every shot must match the exact same film stock, projects where the photographic look is the product (fashion films, car commercials, branded content), and when you have a DP or colorist providing tech specs.

Most creators use the Shot List for 90% of their work and switch to parameterized only when a specific project demands photographic precision.

Three rules that apply to both structures

Rule 1 — Style is the strongest lever. Do not write 'cinematic, beautiful.' Write a specific film era, stock, and lens texture. Sora locks its entire visual pipeline to the Style field, so make it count.

Rule 2 — Beats beat seconds. Instead of 'from 0 to 2 seconds the robot does X,' write 'Robot taps bulb → flinches, dropping bulb → catches it.' Ordered actions produce more precise timing than timestamps.

Rule 3 — The 80-150 word sweet spot. Fewer than 80 words produces random output. More than 200 words causes visual hallucinations as the model tries to reconcile conflicting instructions.

Building a reusable prompt template

The fastest path to consistent output is a reusable template. Start with the Shot List skeleton: Style, Cinematography, Actions, Background Sound. Fill each block with concrete details once, then for each new scene, swap only the subject, location, and specific action beats. Keep the Style and Cinematography blocks identical across shots to maintain visual continuity.

This template approach turns your prompts from disposable one-liners into a production asset. The same Shot List skeleton can generate dozens of visually consistent clips with minimal rewriting.

FAQ

Which prompt structure is better for beginners?

The layered Shot List. It is easier to learn, produces reliable results faster, and covers 90% of creative needs. Master the Shot List before trying the parameterized format.

Can I mix both structures in one prompt?

No. Pick one structure and commit to it. Mixing Shot List blocks with parameterized fields causes Sora to get confused about which instruction set to follow.

How many action beats per shot?

Three to five beats for a 4–6 second clip. Each beat should be one clear action or camera move. More than five beats creates motion noise and temporal confusion.

Do other AI video models support these structures?

The Shot List structure ports well to Veo and Kling. The parameterized format is Sora-specific due to its deeper control over capture format, lens specs, and grade palette.

Related resources

Browse Sora prompts AI video prompt guide Model comparison Veo prompts Kling prompts Seedance / Dreamina prompts