A high-speed car chase suddenly transforms into a mech battle, opening on a low tracking shot of a car racing through traffic, then snapping into a transformation mid-motion, parts unfolding violently, camera spinning around as the vehicle becomes a humanoid robot, immediately engaging in combat while sliding across asphalt, sparks trailing behind, explosions lighting the scene in dynamic flashes.
A high-speed car chase suddenly transforms into a mech battle, opening on a low tracking shot of a car racing through traffic, then snapping into a transformation mid-motion, parts unfolding violently, camera spinning around as the vehicle becomes a humanoid robot, immediately engaging in combat while sliding across asphalt, sparks trailing behind, explosions lighting the scene in dynamic flashes.
[Cinematographic shooting]: A dynamic low-angle shot that shows rain sliding along a streamlined car from the attached photo, streaming down the lens. As the tension increases, the camera switches to a slightly shaky manual shooting mode, emphasizing the scale and intensity of the transformation. The image was taken with IMAX digital cameras with anamorphic lenses. The high shutter speed allows you to capture the smallest mechanical details, while water droplets stick to the lens, and the shallow depth of field allows you to highlight the transforming structure.
[Subject]: A car from the attached photo, driven by a focused man at the wheel. As the transformation begins, the car transforms into a huge industrial robot made of car-colored steel, the rotating tires become powerful limbs, and the front grille opens up into sharp mechanical jaws.
[Action]: A man rushes along a rain-soaked road, gripping the steering wheel resolutely as water sprays from the tires. Suddenly, the hydraulic system is triggered — the roof slides apart, the wheels turn inward, turning into supports, and the chassis rises vertically. The robot falls to the ground, cracking the wet asphalt. His glowing red eyes flash in the headlights, and he releases a sparkling stream of fire into the stormy air.
[Context]: A winding, rain-soaked road passing through a dense pine forest covered in fog. Tall trees rise on both sides, partially hidden by thick fog. The rain is pouring down like a bucket, and glare appears on the smooth asphalt, reflecting the transformation.
[Style and Atmosphere]: An intense sci-fi action movie inspired by action movies. Realistic, rough CGI textures with dark metallic finish and barely noticeable rust. The stage is illuminated by the cold blue tones of stormy twilight, which contrast sharply with the fiery orange glow of the fur's breath and piercing red eyes, adding to the dramatic tension.
Wide shot of an industrial shipyard where massive construction trucks and cranes transform
simultaneously into heavy-armored bipedal tanks.
The transformation is gritty and mechanical, showing pistons hissing steam.
The camera dollies rapidly toward a central mech as it unleashes a barrage of missiles, creating a chain of orange and teal explosions across the harbor.
Ends with all mechs raising shields in a coordinated phalanx formation.
A high-speed street racer inside a neon-lit cockpit, jaw clenched, hands gripping wheel tightly, eyes reflecting city lights .
Drifts aggressively through tight corners while being chased, narrowly avoiding collisions and accelerating through gaps.
Dense cyberpunk city at night, wet roads reflecting neon signs, traffic moving unpredictably.
Starts with interior close-up then crash zoom outward, transitions into ultra-low tracking shot near wheels, lateral tracking matching speed, speed ramps during drifts, whip pans to obstacles, reflections streaking across frame, intense neon lighting and rain effects
cyberpunk
car
racing
urban
night
driver
tracking
action
colorful
## ANALYSIS PHASE (internal, do not output)
Before writing the prompt, determine:
**Scene archetype** — Classify into one of: `combat` / `dialogue` / `chase` / `intimate` / `discovery` / `performance` / `environmental` / `transformation` / `ensemble` / `montage`. This selects which specialized modules to include.
**Character extraction** — From each image, extract: build, apparent age, wardrobe, distinctive features, posture/attitude. Write these as compact identity anchors (e.g., "tall, lean, mid-30s, black tactical jacket, close-cropped hair, watchful stance") — enough for identity consistency without over-constraining. Target 8–15 words per character.
**Environmental logic** — Infer materials, lighting conditions, spatial scale, interactive elements (what can be touched, moved, broken, climbed). Note time of day and weather if implied.
**Emotional spine** — Identify the scene's core tension or question. Every beat should serve this.
**Natural arc** — Determine the scene's rhythm: does it build, release, oscillate, or sustain? This drives the ESCALATION and HERO MOMENT equivalents.
**Asymmetric intent** — Identify what each character wants *individually*. Dramatic charge comes from mismatched motivations, not shared ones.
---
## OUTPUT STRUCTURE
Assemble the prompt using these blocks. Modules marked *(conditional)* only appear when relevant to the scene archetype.
```
FORMAT: [duration] / [continuity descriptor — e.g., continuous, single-take, interleaved, montage]
CHARACTERS
A: [identity anchor from image analysis + role in scene]
B: [identity anchor from image analysis + role in scene]
[additional characters as needed]
ENVIRONMENT
[2–4 sentence description: space, materials, lighting, atmosphere, interactive elements, time/weather]
SCENE PROFILE
TYPE: [scene archetype]
TONE: [e.g., tender / menacing / bittersweet / reverent / unnerving / euphoric — pick 1–2]
ENERGY: [low / medium / high / explosive / suspended]
STAKES: [what is at risk or being pursued]
INTENT: [what each character wants from the scene — asymmetric where possible]
VISUAL
Cinematic realism, physical lighting, shallow DOF, subtle grain, natural motion blur.
[+ any archetype-specific additions: e.g., "warm practical sources" for intimate, "anamorphic flare" for performance, "handheld intimacy" for dialogue]
DIRECTIVE
No fixed choreography. Scene emerges from character intent, environment, and emotional momentum.
BEHAVIOR
- [3–5 bullets describing what actions/reactions are encouraged, tuned to archetype]
- Use environment expressively (not decoratively)
- Allow micro-behaviors: glances, breath, weight shifts, hesitations
RULES
- Momentum and motivation chain only — no disconnected beats
- Clear spatial positioning at all times
- Continuous, motivated camera ([archetype-appropriate framings: e.g., OTS + push-in for dialogue; wide + low-angle for performance; POV + handheld for discovery])
- Environment reacts appropriately ([archetype-specific: e.g., dust motes + light shifts for intimate; debris + splashes for combat; reflections + condensation for atmospheric])
[CONDITIONAL MODULE — IMPACT] *(combat, chase, transformation)*
Strong contacts/transitions: 2-frame hold + brightness spike + micro shake resume
[CONDITIONAL MODULE — CONNECTION] *(intimate, dialogue)*
Key emotional beats: slight hold on reaction + soft focus pull + breath audible in mix
[CONDITIONAL MODULE — REVELATION] *(discovery, transformation)*
Recognition beats: focus rack + slight push-in + ambient drop
MOTION
Secondary delay (~2 frames) on hair, fabric, environment. Weight and inertia respected throughout.
RHYTHM
[Archetype-tuned: e.g., "Fast exchanges + micro pauses + 1–2 slow-motion beats" for combat; "Long holds + small gestures + one breath-catch beat" for intimate; "Sustained glide + one rupture + recovery" for discovery]
ESCALATION
Intensity rises toward a defined apex. Include:
- one environment-driven or spatially expressive moment
- one decisive beat that shifts the scene's state
HERO MOMENT (CRITICAL)
Generate one standout beat that crystallizes the scene:
- visually dominant, clearly readable
- uses environment or full-body expression
- [archetype-appropriate entry] → defining action → brief aftermath hold
- ends with strong silhouette, composition, or emotional register
GUARDRAILS
- Keep character identity and position consistent (no teleporting, no feature drift)
- Respect weight, inertia, and physical causality
- Ensure clear cause → effect in every beat
- Keep camera readable and motivated (no random spins or cuts)
- Use environment logically (no clipping, no scale mismatch)
- Prioritize clarity and emotional truth over complexity
OUTPUT
Fluid, physical, cinematic scene with clear spatial logic, dynamic camera, environmental interaction, and a memorable defining moment.
```
---
## FEW-SHOT EXAMPLE 1 — COMBAT (original archetype)
**Input:**
- Character images: [two martial figures in dark practical clothing]
- Scene description: "A duel between rival assassins in an abandoned warehouse at dusk. Brutal and tactical — neither will walk away casually."
**Output:**
```
FORMAT: 15s / continuous fight
CHARACTERS
A: tall, lean, late-30s, black tactical jacket, close-cropped hair, controlled stance — the veteran
B: mid-30s, wiry, dark hooded pullover, fingerless gloves, restless footwork — the challenger
ENVIRONMENT
Abandoned warehouse, concrete floor streaked with old oil, steel support columns at regular intervals. Dust motes drift in horizontal shafts of dusk light through high broken windows. Stacked wooden pallets, a chain hoist overhead, loose rebar near the walls. Cold, echoing, scale large enough to allow distance and vertical play.
SCENE PROFILE
TYPE: combat / duel
TONE: brutal, tactical
ENERGY: high, escalating to explosive
STAKES: survival — only one leaves
INTENT: A wants to end it quickly and cleanly; B wants to prove something before the end
VISUAL
Cinematic realism, physical lighting, shallow DOF, subtle grain, natural motion blur. Dust catches the low side-light; shadows long and hard-edged.
DIRECTIVE
No fixed choreography. Fight emerges dynamically from characters, environment, and momentum.
BEHAVIOR
- Use environment freely (walls, columns, pallets, chain hoist)
- Allow verticality (climb, wall-run, elevation shifts)
- Encourage unexpected but physically valid actions
- Micro-behaviors: breath control, weight resets, a wipe of blood from the mouth
RULES
- Momentum chain only, no isolated actions
- Clear positioning at all times
- Continuous, motivated camera (POV / OTS / wide / low-high as needed)
- Environment actively reacts (dust plumes, debris, sparks from rebar, chain swing)
IMPACT
Strong contacts: 2-frame hold + brightness spike + micro shake resume
MOTION
Secondary delay (~2 frames) on hair, fabric, environment. Weight and inertia respected throughout.
RHYTHM
Fast exchanges + micro pauses + 1–2 slow-motion beats (0.5–1s)
ESCALATION
Intensity rises. Include:
- one vertical/environment-driven move (a column used to redirect, a pallet kicked into the space)
- one decisive high-impact action
HERO MOMENT (CRITICAL)
Generate one standout finishing beat:
- visually dominant, clearly readable
- uses environment or full-body motion
- slow-motion entry → sharp impact → brief aftermath hold
- ends with strong silhouette against the broken-window light, dust suspended, one figure standing
GUARDRAILS
- Keep character identity and position consistent (no teleporting)
- Respect weight and inertia (no floaty or impossible motion)
- Ensure clear cause → effect in every action
- Keep camera readable and motivated (no random spins or cuts)
- Use environment logically (no clipping or mismatch)
- Prioritize clarity and physical realism over complexity
OUTPUT
Fluid, physical, cinematic fight with clear spatial logic, dynamic camera, environmental interaction, and a memorable final highlight moment.
```
---
## FEW-SHOT EXAMPLE 2 — INTIMATE / REUNION
**Input:**
- Character images: [two figures, one in worn leather jacket, one in business attire]
- Scene description: "A reunion in a rain-soaked alley at night. They haven't seen each other in ten years. Tension but not hostility — unfinished business."
**Output:**
```
FORMAT: 12s / continuous, single breath-like take
CHARACTERS
A: mid-30s, lean, worn brown leather jacket over dark henley, unshaven, hands in pockets but shoulders tight — arrived first, has been waiting
B: mid-30s, charcoal overcoat over tailored suit, damp hair, briefcase in left hand — arrived second, deliberate pace
ENVIRONMENT
Narrow alley between brick buildings, slick with recent rain. Single sodium streetlight from the far end casts long amber shadows; a neon sign two buildings over bleeds red into puddles. Steam rises from a vent. Low ambient hum of city beyond. Puddles, loose gravel, a metal dumpster, one flickering bulb above a service door.
SCENE PROFILE
TYPE: intimate / reunion
TONE: bittersweet, suspended, quietly charged
ENERGY: low, with one surge
STAKES: acknowledgment — whether ten years can be spoken to or must stay unspoken
INTENT: A wants to know if B remembers; B wants to leave before admitting they do
VISUAL
Cinematic realism, physical lighting, shallow DOF, subtle grain, natural motion blur. Warm sodium + cold neon color contrast. Wet surfaces catch and fracture light. Breath visible in cold air.
DIRECTIVE
No fixed choreography. Scene emerges from character intent, environment, and emotional momentum.
BEHAVIOR
- Micro-behaviors lead: glances held a beat too long, weight shifts, a hand half-raised then dropped
- Physical distance negotiated without language — one step closer, one step back
- Environment expressive: a puddle disturbed, breath fogging, the flickering bulb timing a pause
- Allow silence to carry weight; dialogue optional and sparse
RULES
- Momentum and motivation chain only — no disconnected beats
- Clear spatial positioning at all times
- Continuous, motivated camera: slow push-in on A from behind B's shoulder, drift to profile two-shot, one handheld rack to B's hand on the briefcase
- Environment reacts: rain drips from a gutter, steam curls between them, a distant car passes casting moving shadow
CONNECTION
Key emotional beats: slight hold on reaction + soft focus pull + breath audible in mix
MOTION
Secondary delay (~2 frames) on hair, fabric, environment. Weight and inertia respected throughout.
RHYTHM
Long holds + small gestures + one breath-catch beat. Two moments where time almost stops. One slow exhale that releases the tension without resolving it.
ESCALATION
Intensity rises toward a defined apex. Include:
- one environment-driven moment (the bulb flickers at the exact wrong second, or a passing car's light sweeps across both faces)
- one decisive beat that shifts the scene's state (B sets the briefcase down, or A takes the half-step forward)
HERO MOMENT (CRITICAL)
Generate one standout beat that crystallizes the scene:
- visually dominant, clearly readable
- uses environment or full-body expression
- Slow focus rack from A's eyes to B's hand loosening on the briefcase handle → the hand opens slightly → hold on the small space between them, neon reflected in a puddle at their feet
- ends with two figures in silhouette against amber light, close but not touching
GUARDRAILS
- Keep character identity and position consistent (no teleporting, no feature drift)
- Respect weight, inertia, and physical causality
- Ensure clear cause → effect in every beat
- Keep camera readable and motivated (no random spins or cuts)
- Use environment logically (no clipping, no scale mismatch)
- Prioritize clarity and emotional truth over complexity
OUTPUT
Fluid, physical, cinematic scene with clear spatial logic, dynamic camera, environmental interaction, and a memorable defining moment.
```
Shot 1 (0:00 – 0:02) | Opening Frame
A close-up of a tattooed male arm with a sleek black smartwatch in a futuristic LED-lit corridor. The person raises their wrist and taps it. A faint, minimal holographic UI briefly appears (subtle glow), not the main focus. Camera quickly shifts attention away.
Shot 2 (0:02 – 0:04) | Quick Interaction
Fast swipe gesture. A quick glimpse of a luxury car hologram appears momentarily—no heavy UI detail. Immediate transition begins, emphasizing speed and intent.
Shot 3 (0:04 – 0:06) | Car Reveal (Main Focus Begins)
The hologram transforms rapidly into a real Lamborghini-style sports car. Cinematic low-angle shot. The car materializes with light streaks and reflections. Engine ignition sound implied. Strong emphasis on car design and presence.
Shot 4 (0:06 – 0:09) | Power Build-Up
Close-up shots of the car: headlights turning on, wheels spinning slightly, body reflections, exhaust heat shimmer. Camera moves dynamically around the car, building intensity.
Shot 5 (0:09 – 0:12) | Acceleration
The car launches forward aggressively. Motion blur, reflections on glossy ground, cinematic tracking shot (low angle or side tracking). Environment streaks past, emphasizing speed.
Shot 6 (0:12 – 0:15) | Drift + Final Frame
High-speed drift sequence. Tires screech, smoke spreads, car slides smoothly across a reflective surface. Camera tracks dynamically. Final slow-motion moment as the drift completes. Car stabilizes, slight engine rumble. Fade out.
A driver gripping the wheel, eyes wide, swerving violently at full speed. Dodges collapsing buildings as a giant creature rampages through the city. Urban streets being destroyed, cars flipping, dust clouds rising.
Front tracking shot pulling backward from the car, creature appearing in background, debris crashing around, cinematic destruction synced with motion.
tracking
car
chase
action
destruction
urban
drone
cinematic