Style: Cinematic, romantic tension, realistic handheld
Setting: Night, Italian city (Rome/Milan), heavy rain, warm streetlights reflecting on wet pavement
Mood: Intimate + emotionally charged
0–2s Handheld close-up, slightly unstable camera. Heavy rain pouring onto the pavement. A man and a woman stand under one umbrella, very close. Water drips from the edges. Sound of rain dominates.
2–4s Close-up on the man's face as he looks at her calmly but intensely. He speaks softly: "That was your stop, wasn't it?"
4–6s Cut to the woman, slightly flustered. She shifts subtly but stays under the umbrella. She replies quietly: "…You're too close."
6–8s Camera slowly pushes in, handheld. The man gives a faint half-smile, rain sliding down his jacket. "That's a weak excuse."
8–10s Close-up on their hands near the umbrella handle. His hand is trembling slightly. She notices, looks up at him: "Your hand is shaking."
10–12s Brief silence—only rain and distant traffic. He looks away for a moment, then back at her: "…You noticed."
12–15s Camera slowly circles them, close together under the umbrella. Tension builds—neither steps away. She moves slightly closer, almost unconsciously. Breath visible in the cold air.
Final frame: Close-up of their faces inches apart, rain falling, city lights blurred in the background. Cut to black before anything happens.⚡
Key Notes
No slow motion → natural, grounded movement
Strong emphasis on rain sound + breathing + subtle tension
Handheld camera for realism
Focus on micro-expressions and emotional proximity
romantic
rain
cinematic
emotional
dialogue
night
urban
[CINEMATIC SETUP]
Film Style: Ultra-realistic 35mm anamorphic, documentary-style live-action.
Lens: 35mm wide-angle prime.
Color Grade: Teal-and-amber split tone. Amber from wok fire screen-right, cool teal from fluorescent signage washing the background crowd and wet pavement.
Lighting: Practical only — open-flame wok burners cast hard amber uplight on faces; overhead fluorescent tubes spill teal across the lane.
Camera Behavior: One continuous unbroken Steadicam glide, no cuts. Begins medium-wide on the cooking station, drifts slowly left through the crowd, then reverses back to settle on a wok flare.
[IMAGE REFERENCE]
@image1: Night market street — two women cooking at wok station right-of-frame, wet pavement reflecting teal signage, crowd filling the lane. Maintain this color temperature split and camera height throughout.
[TIMELINE — CONTINUOUS SHOT]
0-3s: HOLD. Medium-wide matching @image1. The lead cook — adult woman, dark hair in high ponytail, blue-green t-shirt, green apron — stirs a steel wok with a long spatula. Oil pops, white steam billows upward catching amber firelight. A second cook beside her ladles broth into a bowl. SFX: sizzle, oil pop, metal scrape.
3-7s: DRIFT LEFT. Slow Steadicam glide left at waist height into the crowd lane. A young couple leans toward a vendor stall pointing at a menu board. Three patrons walk toward camera weaving between plastic stools. Wet pavement reflects teal and amber in long vertical streaks. A second vendor — man in white tank top, dark apron — tosses noodles in a flat pan with a sharp wrist snap. SFX: footsteps on wet concrete, Thai conversation, noodle sizzle.
7-11s: REVERSE. Camera drifts back right toward the original station. A patron crosses foreground carrying a styrofoam container, briefly occluding the frame. Teal signage overhead flickers once. Crowd shuffles — hands gesturing, a woman adjusting a shoulder bag. SFX: crowd murmur, fluorescent buzz.
11-15s: WOK FLARE. Camera settles on the lead cook — adult woman, dark hair in high ponytail, blue-green t-shirt, green apron — as she tosses ingredients with a sharp upward flick. A burst of flame erupts from the wok throwing hard amber light across her face. Steam billows into the teal-lit canopy above. SFX: wok whoosh, flame roar, oil crackle.
[QUALITY]
Photorealistic 8K, natural 35mm grain. Realistic steam, smoke, and flame physics — no particle over-application. Wet pavement reflections track light sources accurately. Stable character features on lead cook. No duplicate vendors. No cuts. HDR for simultaneous flame highlights and shadow detail.
10-second cinematic video, nighttime in an empty USA intersection at a red traffic light. A black superbike is stopped at the line. The rider is an Asian female sport model wearing a full assassin-style black riding suit, sleek armored leather, dark visor helmet, intimidating and calm presence. No other cars or pedestrians anywhere on the street except one vintage convertible that slowly pulls up beside her. The convertible has exactly 5 men with different faces, hairstyles, tattoos, and gangster aesthetics, seated clearly in a clean arrangement: 2 in the front seats, 3 in the back seats. The car is blasting loud retro gangster hip-hop music.
One man turns toward the biker and says in English: Lady, wanna race? The men laugh loudly. The woman does not smile, does not speak, completely expressionless, only staring forward. The men become slightly uneasy. The same man says in English: Then when it turns green, the race begins. They laugh again, but more nervously.
Cinematic setup shots build tension: close-ups of the biker's gloved hand gripping the throttle, the convertible engine rumbling, the men smirking, tattoos, chrome details, wheels, exhaust vibration, the woman's still body posture, subtle heat distortion, low-angle hero shots, dramatic side profile shots, dashboard reflections, red traffic light glowing overhead. Rich cinematic color grading, moody contrast, high-end anamorphic look, realistic lighting, sharp composition, premium action-film atmosphere.
Camera focuses intensely on the traffic light. The exact moment the light turns green, the biker launches with an absurd supernatural sonic-speed burst, like a magical shockwave explosion of air and light, disappearing from frame instantly in less than a blink, leaving dust, smoke, light streaks, and a violent pressure wave. The camera remains mostly fixed after her disappearance. Focus shifts back to the 5 men in the convertible, all reacting in stunned silence and disbelief, shocked and confused. The moment is both cinematic and slightly funny. Hyper-realistic, dynamic camera language, dramatic pacing, ultra-detailed, clean background, no traffic, no extra people, no chaos beyond the shockwave.
Style tags: cinematic, action movie, dramatic tension, realistic, anamorphic lens, moody lighting, premium color grading, dynamic close-ups, ultra-detailed, humorous payoff, supernatural speed effect
Use a base identity reference image and preserve the subject's face 100% (no beautification or changes), with Elle Fanning as the main character while strictly maintaining her natural facial features, proportions, and realism. Place her in a modern ultra-high-rise office at night with floor-to-ceiling windows overlooking a vast city skyline. Add realistic investigation-style paper notes on the walls (wrinkled, taped, layered, partially overlapping). Use cinematic corporate lighting (cool blue city light + soft overhead + subtle warm desk lamp), shot on a 35mm lens with shallow depth of field, in a hyper-realistic documentary style, 4K quality. Then create a second image of a tall luxury NYC residential skyscraper at night, viewed from a distance with surrounding buildings, wet streets, atmospheric haze, cool exterior tones, and warm interior penthouse lights, shot on a 135mm telephoto lens with realistic proportions. Finally, generate an 8–10 second 1080p Kling 3.0 video using the skyscraper as the opening frame and the office as the ending frame, with a slow cinematic push-in toward a specific lit window, natural glass reflections, seamless transition into the interior (no cuts or morphing), realistic exposure shift from exterior to interior, and Elle Fanning remaining still throughout with only subtle breathing, no expression change.
cinematic
documentary
realistic
night
urban
portrait
{
"format": "15s / free rhythm / ONE CONTINUOUS SHOT / worm's eye rear follow, loopable",
"subjects": [
"A 10cm commuter in an office suit fights through a packed Seoul Metro carriage, trying to stay ahead of shifting feet and reach a narrow lane before it closes again.",
"Full-size Seoul passengers stand packed shoulder to shoulder in the aisle, filling the car from bench to bench, including students, office workers, and everyday commuters in varied attire."
],
"environment": "A clean Seoul Metro carriage with bright Hangul route displays, polished steel poles, pale floor panels, phone straps, canvas totes, backpacks, and cool window reflections. Crisp fluorescent carriage light mixes with soft tunnel flicker, turning shoe edges, swinging hems, and dangling bags into precise moving obstacles.",
"mood": "Tight, fast, and controlled, driven by crowd rhythm, polite compression, and constant foot readjustment.",
"color_logic": "Naturalistic Film Print Emulation",
"scene": "The camera stays in one uninterrupted worm's eye rear follow at ankle height with a stable 24mm spherical feel, trailing the tiny commuter through a Seoul Metro aisle packed with passengers. White sneakers shuffle inward, dark loafers pivot, heels reset beside benches, and tote straps sway overhead. The commuter runs along a floor seam, dodges descending shoes, darts under swinging hems, and threads through closing gaps between footwear. A seated passenger’s shoe slides forward; the commuter steps onto the toes, runs across, drops off the edge, and slips between a loafer and sneaker sole. The crowd compresses again, feet adjust for balance, and reflections of Hangul route lights glide across windows. The commuter skids, grabs a seat support, avoids a heel strike, and bursts into a narrow lane between crossed calves. The loop closes as the corridor geometry resets with the same sprinting path.",
"sfx": [
"train hum",
"sneaker squeak",
"leather creak",
"fabric rustle",
"soft heel taps",
"bag buckle click",
"polite door chime",
"carriage drone"
]
}
FORMAT: 15s / 145 BPM / 15 SHOTS / beat-synced routine
SUBJECT: @[image1] < ATTACH YOUR IMAGE.
WARDROBE: Sleep tee and lounge shorts at home. Tailored jacket, fitted top, trousers, and lace-up shoes outside.
ENVIRONMENT: Tiny apartment, bright fridge glow, rain-dusted hallway, chrome metro, clean office, then a bedroom in cool window light. Everything feels glossy and lived-in.
MOOD: Late-for-work panic, clipped momentum, breathless urgency, then an exhausted exhale.
MUSIC: Fast percussive electro-pop
COLOR LOGIC: Hyperreal Pop Look
STYLE: Ultra-Realistic.
LOGIC RULE: Keep logical consistency in wardrobe, props, locations, and action continuity across all shots.
SHOT 1: ECU, 85mm push-in / 06:50 on the phone screen as it shakes on rumpled sheets. / SFX: alarm, sheet rustle.
SHOT 2: WS, 35mm handheld jolt / Rhythmic cut into her jolting upright through side light, throwing the blanket aside, and planting her feet on the floor in one rushed motion, still in a soft sleep tee and lounge shorts. / SFX: mattress bounce, blanket whip, sharp breath.
SHOT 3: MCU, 50mm slide / Cut on action into face wash at the sink, droplets catching the top light. / SFX: faucet rush, water slap.
SHOT 4: Insert shot, 85mm rack focus / Match cut into the toothbrush held at a natural forward brushing angle against the front teeth, hand relaxed and upright, mint foam and mirror eye. / SFX: bristle scrape, sink drip.
SHOT 5: Interior fridge view, 24mm wide / Object pass into the camera inside the fridge looking out as the door snaps open and her hand darts in, blue fridge light framing a hurried grab for breakfast ingredients. / SFX: fridge hum, bottle clink, shelf rattle.
SHOT 6: Insert shot, 50mm handheld / Rhythmic cut into eggs and toast hitting the pan under warm practical light. / SFX: butter sizzle, chop tap.
SHOT 7: MCU, centered 50mm push-in / Match cut into one rushed bite, a quick clock glance, and an immediate rise from the chair. / SFX: crunch, ceramic clink, chair scrape.
SHOT 8: Bird's-eye insert, 35mm overhead / Cut on action into striped socks snapping on. / SFX: fabric stretch, heel tap.
SHOT 9: MS, 35mm pivot / Camera wipe into a rushed outfit change as the sleep tee disappears under a fitted top and tailored jacket, then her tote, keys, and transit card get scooped up in one messy grab. / SFX: fabric whip, key jingle, zipper pull, bag rustle.
SHOT 10: Insert shot, 50mm overhead / Match cut into lace-up shoes slamming on as the laces yank tight in one impatient pull. / SFX: sole thump, lace tug, short breath.
SHOT 11: WS, 24mm parallax / Whip pan transition into her, now in the tailored outside outfit, rushing through the apartment door into corridor light without breaking stride. / SFX: latch click, rapid footsteps, hallway air.
SHOT 12: MS to CU, 35mm glide into 85mm push-in / Sound bridge into the metro car interior only as she grips the pole, shifts with the carriage sway, checks the passing station lights, and snaps a tense glance toward the closing doors, reflected chrome streaking around her and the city smearing outside the window. / SFX: rail clatter, carriage screech, door warning chime, tight breath.
SHOT 13: Insert to MCU, 50mm snap zoom / Smash cut to the office entrance as her access card hits the reader, the glass door unlocks, and she slips through fast before the chair roll and laptop open. / SFX: badge beep, door click, laptop chime.
SHOT 14: OTS, 35mm handheld / Rhythmic cut into fingers racing across keys, chat windows blinking, coffee by the trackpad, and notifications stacking faster than she clears them. / SFX: keyboard burst, notification ticks, mouse click.
SHOT 15: WS, 50mm pull-out / L-cut with a match from laptop close to apartment re-entry as the jacket drops, work clothes peel away, and she changes back into sleepwear before collapsing into bed in the opening frame shape. / SFX: door shut, bag drop, fabric rustle, blanket rustle, room tone.