Ultra-realistic live basketball broadcast still of a glamorous woman sitting courtside in a packed indoor arena during a night playoff game, wearing an elegant deep emerald off-shoulder silk dress and minimal silver hoop earrings, shoulder-length honey blonde hair styled in soft layered waves. She is casually eating loaded nachos with one hand while holding a clear plastic cup of sparkling soda in the other. Around her are passionate fans wearing bright red and white basketball jerseys, hoodies, and foam fingers, creating strong team-color contrast throughout the crowd. The scene feels candid and cinematic, captured mid-game from a professional TV broadcast camera angle with shallow depth of field. Include realistic arena seating, crowded audience atmosphere, LED ribbon boards, energetic spectators reacting to the game, broadcast overlay graphics in the top-left corner showing a live basketball score, quarter, and game timer, and a sports network watermark in the top-right. Natural indoor arena lighting, detailed skin texture, realistic reflections on fabric and drink cup, sharp focus on the woman, slightly blurred background crowd, authentic live sports broadcast aesthetic, ultra-detailed realism, 16:9 composition.
She begins standing still, facing camera with a calm confident expression. Slowly she raises one hand to lift her hair off her neck, then turns away from the camera revealing "RAPHINHA 11" printed in gold on the back of her Barcelona jersey. She gazes out over the stadium pitch. Her hair lifts and settles naturally in a light wind. The giant Barcelona tifo mosaic on the pitch fills the background. Camera holds a steady handheld medium shot with subtle organic sway. Overcast daylight, cool diffused light, cinematic 4K, realistic hair movement, authentic pre-match stadium energy.
GPT Image 2 prompt: Photorealistic photograph of a young woman with fair skin, medium-length straight dark brown hair, and natural minimal makeup standing in the lower stands of a large football stadium. She wears an FC Barcelona home jersey — navy blue and deep burgundy vertical split, gold Spotify logo on chest, FC Barcelona crest. Dark trousers. She faces the camera directly with a calm, confident, composed expression. Overcast daylight, soft diffused natural light on her face. Behind her: open-air stadium with a giant Barcelona crest tifo mosaic covering the pitch, partially filled stands visible in the distance. Handheld editorial photography style, shallow depth of field, subject sharp, stadium background softly blurred. 4K, cinematic color grade, cool neutral tones, no filters, hyper-realistic skin texture, natural hair detail, authentic stadium atmosphere.
Ultra-realistic live broadcast shot of a young Asian woman sitting in the crowd at a professional baseball game, captured from far away by a stadium TV camera. She is seated among blue stadium seats, casually leaning back and looking to the side with a surprised "caught on camera" expression, lips slightly parted, natural candid moment. Soft stadium lighting, shallow zoom lens compression, authentic sports broadcast aesthetic, slightly grainy televised look, blurred people in the background, cinematic realism, spontaneous fan-cam energy, detailed skin texture, natural makeup, long black hair, stylish casual outfit, high realism, telephoto lens, ESPN-style broadcast frame, candid atmosphere.
A cinematic martial arts duel set in a traditional ancient Chinese courtyard paved with large weathered stone tiles, framed symmetrically by old wooden temple buildings, carved balconies, hanging red vertical banners with black Chinese calligraphy, and soft greenery along the sides. Two highly detailed martial artists face each other in the center in classic kung fu combat stances, captured mid-confrontation with intense focus and restrained tension.
On the left stands an older rugged martial artist wearing layered brown leather and fabric warrior robes with worn textures, stitched seams, dark belts, armored wrist guards, rugged boots, and flowing lower garments. His stance is low and grounded with one fist extended forward and the other hand pulled back defensively. His facial expression is sharp and concentrated, with realistic skin texture, subtle wrinkles, and cinematic lighting shaping the face.
On the right stands a disciplined martial arts master wearing an elegant dark navy-blue traditional Chinese robe with subtle embroidered patterns, red trim accents, long flowing fabric, and clean structured tailoring. His stance is balanced and defensive, one hand open in a Wing Chun-style guard while the other hand forms a fist. Calm but intense facial expression, highly detailed fabric folds, realistic posture, and natural movement in the robe.
Between them in the background is a wooden ceremonial table draped with vivid red cloth, positioned centrally in front of a misty temple entrance that creates strong depth and symmetry. Soft atmospheric haze fills the center background. Natural daylight with cinematic contrast, soft shadows, muted earthy tones, realistic stone textures, subtle depth of field, ultra-detailed realism, authentic kung fu movie aesthetic, balanced composition, dramatic tension, photorealistic cinematic still frame, high-end film production quality, 4K, ultra-sharp details, aspect ratio 16:9.
A photorealistic, cinematic live television broadcast video set in a crowded indoor sports arena. The video starts with a shot of a massive, glowing jumbotron suspended from the ceiling, showing an Asian man with glasses and a blue varsity jacket taking a bite of a hamburger. The camera immediately cuts to the actual man sitting in the stadium seats among a cheering, slightly blurred crowd. As he chews, he suddenly realizes he is on the big screen. He freezes, looks surprised with his cheeks full of food, then breaks into an embarrassed, good-natured smile and waves awkwardly at the camera. The lighting is bright, even stadium floodlights. High-quality sports broadcast aesthetic, complete with realistic digital overlays, a 'LIVE' badge, and Korean broadcast graphics, 8k resolution, captured on professional broadcast cameras.
Use the uploaded reference image as the strongest identity anchor. The woman must look like the exact same adult Japanese woman from the reference image, not a similar person. Preserve her exact facial identity, same soft oval face, same glossy lips, same delicate nose, same large expressive eyes, same pale smooth skin, and same long voluminous curly blonde hair with soft layered bangs.
Create an ultra-realistic candid KBO baseball broadcast video scene during a lively night game. The woman is seated in the crowd beside her boyfriend, both wearing casual baseball jerseys among energetic cheering fans. She naturally watches the game while smiling and laughing softly with him. No dialogue, no talking, no cinematic acting. The interaction should feel completely natural and unscripted, like a real live broadcast moment accidentally captured by the stadium camera.
She holds yellow cheering sticks and lightly taps them together while cheering for the team. The boyfriend leans closer and they share a very short soft kiss lasting about one second, subtle and natural, not dramatic or romanticized. Immediately after the kiss, she realizes the live stadium camera is focused on them on the giant screen.
She becomes shy and slightly embarrassed for a moment, then gives a soft cute smile toward the camera while trying not to laugh. Her reaction should feel authentic, candid, and spontaneous, like a real fan unexpectedly shown on TV.
Use realistic Korean baseball TV broadcast cinematography: long telephoto lens compression, slight handheld camera movement, subtle motion blur from cheering fans, realistic stadium lighting, shallow depth of field, natural skin texture, broadcast softness, authentic crowd reactions, imperfect framing, 16:9 live sports broadcast composition, genuine candid atmosphere.
Ultra-realistic sports broadcast still of a glamorous woman sitting in a packed football stadium crowd during a night match, wearing a dark brown sleeveless high-neck satin top and black square earrings, shoulder-length light brown/blonde hair styled in soft waves. She is casually drinking from a tall blue aluminum can while holding a half-eaten cheeseburger in the other hand. Around her are fans in bright yellow and blue football jerseys and scarves, creating strong team-color contrast. The scene feels candid and cinematic, captured mid-game from a TV broadcast camera angle with shallow depth of field. Include realistic stadium seating, crowded audience atmosphere, broadcast overlay graphics in the top-left corner showing a live football score and match timer, and a sports network watermark in the top-right. Natural arena lighting, detailed skin texture, sharp focus on the woman, slightly blurred background crowd, authentic live sports broadcast aesthetic, 16:9 composition.
Ultra realistic KBO live broadcast crowd cam video from the above of the SAME BOY from the reference image. Do NOT change his face, bone structure, eyes, lips, eyebrows, or https://t.co/isCl0jzXxT Al beauty filter, no influencer vibe, no glossy skin, no fashion-shoot feeling. Style:Looks exactly like a real SPOTV KBO live TV broadcast accidentally capturing a pretty normal spectator in the crowd.Natural Korean baseball stadium atmosphere at night.Slightly compressed TV broadcast quality, realistic digital noise, subtle motion blur, imperfect focus breathing, handheld broadcast zoom behavior.Scene:he is sitting casually in the stadium seats watching the baseball game.Legs crossed comfortably, occasionally adjusting posture naturally. Other spectators around her drinking beer, chatting, cheering, holding cheering sticks and mini portable fans. Plastic beer cups, towels, jerseys, stadium lights visible.Natural crowd movement in background.Face & movement:He should NOT stare
stare at one spot. He naturally looks around the stadium:* briefly watches the game* glances left and right* looks at scoreboard* small eye movements* occasional blink* slight awkward reaction when realizing camera is on him* subtle half smile then looks away* fixes hair naturally with one hand* realistic breathing and micro expressionsImportant:NO exaggerated https://t.co/8g3na5arIt Tik Tok https://t.co/BSxOmS7WYu model https://t.co/d9A9QlwAUc perfect https://t.co/i6vblDBgib smooth doll skin.Keep realistic pores, baby hairs, tiny skin texture, slight sweat shine from stadium weatherCamera: Broadcast zoom lens from distance. Very slight shaky sports-camera movement. Momentary autofocus https://t.co/8wORcxcbpd TV feeling. Natural depth compression from telephoto lens. Lighting: Real stadium lighting only. Uneven shadows allowed. No cinematic lighting.The video needs to be atleast 10 seconds long in which the boy is doing random stuff as mentioned above.
Photorealistic MLB baseball stadium broadcast footage. A young Southeast Asian woman with warm medium skin, long straight dark hair falling loose past her shoulders, gold hoop earrings, and a delicate pendant necklace sits in stadium stands. She wears a black spaghetti-strap top with a dark oversized cardigan slipping off one shoulder. Blue stadium seats fill the background. Broadcast telephoto framing, shallow depth of field, indoor stadium floodlights, ESPN scoreboard HUD in bottom frame.
Motion sequence: She sits relaxed in her seat, gaze drifting toward the field with a slightly parted mouth and curious eyes — processing something happening in the game. Her expression softens gradually into a quiet, knowing half-smile. Her hair catches slight air movement. The camera holds a slow, barely-perceptible push-in on her face, emphasizing natural skin texture, dark eye reflections, and understated charisma. Background fans sit still in soft bokeh.
Camera: Static-to-slow-push telephoto, subtle broadcast compression, minimal handheld drift. ESPN MLB broadcast overlay aesthetic — bottom ticker bar, bottom-right scoreboard bug.
Style: Cinematic 4K, cool indoor stadium lighting with warm face fill, natural motion blur on hair, ultra-detailed facial expression, photorealistic skin texture, authentic crowd-cam energy.
Ultra-realistic sports broadcast still of a glamorous woman sitting in a packed football stadium crowd during a night match, wearing a dark brown sleeveless high-neck satin top and black square earrings, shoulder-length light brown/blonde hair styled in soft waves. She is casually drinking from a tall blue aluminum can while holding a half-eaten cheeseburger in the other hand. Around her are fans in bright yellow and blue football jerseys and scarves, creating strong team-color contrast. The scene feels candid and cinematic, captured mid-game from a TV broadcast camera angle with shallow depth of field. Include realistic stadium seating, crowded audience atmosphere, broadcast overlay graphics in the top-left corner showing a live football score and match timer, and a sports network watermark in the top-right. Natural arena lighting, detailed skin texture, sharp focus on the woman, slightly blurred background crowd, authentic live sports broadcast aesthetic, 16:9 composition.
Cinematic, high-fidelity shot of a beautiful young woman with long black hair sitting in a crowded baseball stadium. She is wearing a white off-the-shoulder 'Bears' crop top and holding an iced coffee. Beside her, a man with a red headband looks at her with concern, asking 'Are you okay?' The atmosphere is bright and realistic, mimicking a live TV sports broadcast with a scoreboard in the top left corner."
The Transformation (Motion/FX)
"Suddenly, the woman's body glitches and contorts. Her head snaps back and her face undergoes a horrific transformation into a zombie. Her skin becomes pale and veiny, her eyes turn a glowing demonic red, and her jaw distends unnaturally. The scene shifts from a daytime stadium to a dark, chaotic night game. She leans over and bites the man's neck, blood spraying. The final shot is a terrifying close-up of the zombie woman screaming into the camera with a wide, rotting mouth and sharp teeth. High-intensity horror aesthetic, jump-scare pacing, and hyper-realistic gore.
Presented in the style of raw, handheld iPhone video footage, with all camera settings set to automatic, no post-processing color grading or special effects. The 画面 features slight hand-held shake and the operator's breathing sensation, with autofocus occasionally showing search, delay, and brief loss of focus. Auto white balance naturally switches between warm and cool tones according to the mixed natural light from classroom windows and fluorescent lights. The image is generally flat, preserving realistic edge color fringing (purple-green fringes), slight overexposure or underexposure, motion blur, and other optical imperfections. Only in-scene natural ambient sounds are used, no background music, and the microphone may show slight distortion during loud sounds. Adopting a first-person POV medium close-up shot from a student sitting behind a desk in the front row, secretly looking up. The camera movement is natural reactive rather than professionally smooth, occasionally showing small 幅度 quick adjustments or brief dips due to nervousness. Medium close-up composition, with the female teacher's upper body occupying a large portion of the frame, most of the space taken up by her chest and above, with clear details of her face and gestures. A female teacher in her late 20s to early 30s of Asian descent.
hf_2026
hf_2
Standing at the podium, she teaches a psychology course. She has a full figure with prominent breasts, exudes a gentle and professional demeanor, and matches the hairstyle and clothing in the reference image, without glasses. Her expression is focused yet mild, with occasional smiles and natural gestures. Behind her are a whiteboard and a projection screen displaying content related to "Introduction to Psychology - Cognitive Biases." The classroom is an ordinary university lecture hall. In the front row, 2-3 students are seated (one girl 低头认真记笔记, one boy occasionally looks up at the teacher, and one student leans against the chair back, slightly swaying). Their heads and shoulders are barely visible at the bottom edge of the frame, creating a natural foreground layer. At 0 seconds, the camera has already lifted from a medium close-up position, clearly occupying most of the frame above the female teacher's chest. Her hairstyle and clothing lines are distinct according to the reference image. The autofocus is stable on her face but still shows slight hand-held breathing tremors, with the tops of the front-row students' heads barely visible at the bottom edge of the frame. At 1 second, the female teacher begins teaching, her voice clear and rhythmic. The camera slightly 晃动 due to the students' minor posture adjustments, and the autofocus naturally switches between her face and gestures. At 2 seconds, the female teacher gestures to explain "confirmation bias." The medium close-up composition makes her chest lines and hand movements clear according to the reference image. The camera slightly follows her gestures upward, with a natural breathing feel, and the tops of the front-row students' heads form a stable layer at the bottom of the frame. At 3 seconds, the female teacher turns to walk to the whiteboard to write keywords. As she walks, her chest visibly bounces with her steps. Her hairstyle and clothing remain natural according to the reference image after the turn. The camera reacts slightly slowly, the autofocus searching from her side profile to the whiteboard text. Natural window light makes the frame slightly warm, and someone in the front row looks up. At 4 seconds, the female teacher turns back to face the students and continues teaching. Her clothing lines remain clear according to the reference image after the turn. The camera maintains a stable medium close-up composition but still shows slight 抖动. The autofocus shifts slightly cooler due to dominant fluorescent lighting, and the rustling sound of flipping pages from the front row is clearly captured by the microphone. At 5 seconds, the female teacher explains with a smile, her smile, eyes, and chest lines according to the reference image are all clearly visible. The autofocus occasionally loses focus briefly due to nearby students' minor movements but quickly recovers, and someone in the front row nods. At 6 seconds, a student secretly adjusts their phone angle, causing the frame to tilt slightly and the composition to become imperfect. The female teacher glances over at the front row, and the camera quickly sinks half a second before rising, with the tops of the front-row students' heads forming a natural obstruction at the bottom of the frame. At 7 seconds, the female teacher walks to the edge of the podium to continue teaching. Her chest bounces noticeably and continuously as she walks. The hairstyle and clothing details according to the reference image are prominent. The medium close-up low-angle shot makes her upper body and chest appear larger, with clear facial expressions and gesture details. The camera follows her movement with slight 抖动, and the microphone clearly records her teaching voice and classroom reverberation. At 8 seconds, the female teacher flips through her notes. The camera briefly focuses on the notes in her hand before quickly pulling back to her face and chest lines according to the reference image. Someone in the front row coughs softly. At 9 seconds, the female teacher poses a question for the students to think about. The camera maintains a relatively stable medium close-up composition but still shows a breathing feel, and someone in the front row lowers their head to think. At 10 seconds, the female teacher smiles while waiting for an answer. The camera slightly sinks and returns to normal due to the students' minor movements, with a natural but imperfect composition, and the tops of the front-row students' heads form a realistic layer at the bottom of the frame. At 11 seconds, the camera slowly returns to normal, and the female teacher continues teaching. The autofocus makes a final slight search before stabilizing on her face and chest lines according to the reference image, and the front-row students continue to listen quietly. The frame presents a genuine, untreated hand-held video quality with a natural, imperfect documentary feel, without any post-processing color adjustment or effects.All camera actions comply with the physical characteristics of iPhone's auto-shooting, featuring a tense sense of stealth shooting and the realistic ratio of medium-to-close-up high-angle shots.
ACT AS: A world-class Hollywood action director, rooftop combat choreographer, and elite AI filmmaker specializing in ultra-realistic rooftop fights, tactical chase scenes, grounded IMAX cinematography, and practical Hollywood stunt realism for Seedance 2.0.
TITLE / VIRAL HOOK: "One wrong jump."
FORMAT: 15-second ultra-cinematic rooftop combat sequence, Designed for Seedance 2.0, Grounded real-life realism, Bright Los Angeles daylight, Fast-paced Hollywood action, 4K IMAX cinematic quality
CORE CONCEPT: A young woman is hunted across the rooftops of downtown Los Angeles by heavily armed tactical agents, police helicopters, and rooftop sniper teams. Instead of only running, she fights aggressively while escaping across rooftops using grounded martial arts, tactical movement, environmental combat, and realistic stunt choreography.
STYLE: Mission Impossible × Jason Bourne × Extraction × Sicario — Ultra grounded realism, real-life cinematic look, practical stunt choreography, natural movement physics.
TIMELINE: (0s–2s) Wide drone shot of downtown Los Angeles skyline. The woman bursts through a rooftop access door and immediately sprints forward. Police helicopter appears behind the building. (2s–5s) Two tactical agents rush toward her aggressively. She dodges the first punch and counters with a fast elbow strike. (5s–8s) Three more tactical agents emerge. Fast close-quarter rooftop combat begins. She slides across concrete while avoiding attacks, kicks one agent into rooftop AC units. (8s–10s) Rooftop fight intensifies. Sniper laser sights track across the rooftop. Helicopter spotlight locks onto her position. (10s–13s) She sprints and launches into a huge realistic rooftop jump toward the nearby building, crashes through a modern apartment glass window. (13s–15s) Broken glass scatters across the apartment floor. She slowly stands up while sunlight fills the luxury apartment interior. Hard cut to black.
CAMERA STYLE: IMAX cinematic framing, real handheld camera feel, drone skyline shots, natural motion blur, wide rooftop combat visibility.
NEGATIVE STYLE LOCK: No anime, no cartoon visuals, no exaggerated superhero physics, no game-style rendering, no over-stylized CGI, no unrealistic flips, no fantasy lighting.
action
rooftop
combat
hollywood
cinematic
los-angeles
realistic
Use the uploaded reference image as the strongest identity anchor. The woman must look like the exact same adult woman from the reference image, not just a similar Korean woman.
Preserve her exact facial identity with high priority: same small oval face, same delicate jawline, same large clear eyes, same eye spacing, same eyelid shape, same straight nose, same soft muted pink lips, same pale clear skin tone, same refined calm expression, and same long black softly wavy hair.
Create an ultra-realistic candid KBO baseball broadcast screenshot of the same woman accidentally caught by a live TV camera in the spectator seats. The team name is LG and F1. Her face should remain closer to the reference image than to a generic stadium fan. Do not change her into another person. Do not make her face wider, older, sharper, more westernized, or more idol-like. Keep the same delicate studio-portrait identity, but translated naturally into a real stadium environment.
She is seated among a lively Korean baseball crowd, holding an iced drink and a cheering stick, wearing a clean white baseball jersey over a simple casual top. She is adjusting her hair with one hand. She notices the camera and gives a small natural smile, slightly surprised but composed.
Use a realistic far-distance broadcast camera look: telephoto compression, mild video softness, slight motion blur in the crowd, stadium lighting, natural skin texture, imperfect candid framing, 16:9 horizontal TV broadcast composition.
Create a hyper-realistic live sports broadcast style video using the uploaded stadium image as the starting frame. A stylish young woman sits calmly among thousands of energetic football supporters during a tense nighttime match. Around her, the crowd erupts with authentic motion, fans chanting, raising scarves, clapping, recording on phones, and reacting emotionally to the game. Bright stadium floodlights illuminate the scene while giant LED scoreboards flicker in the distance. In the blurred background, football players sprint across the pitch as the match unfolds naturally.
The woman remains the visual focus with subtle realistic movements: soft blinking, slight head turns, natural breathing, and gentle hair motion caused by the cool stadium breeze. Add cinematic handheld broadcast-style camera movement with slight zoom corrections and natural live-TV vibration to create the feeling of being captured by a professional match camera. Use shallow depth of field, realistic crowd animation, dynamic lighting shifts, motion blur, and detailed skin textures for a premium televised football atmosphere. Ultra-detailed, photorealistic, immersive sports cinematography, authentic Champions League broadcast energy.
Hyper-realistic cinematic 15s action sequence. A car is already at full speed racing across a long suspension bridge as it collapses progressively behind.
Bright daylight, cables snapping, sections dropping in sequence.
Action is continuous forward motion. No sharp turns. The car maintains a straight path as the road disappears segment by segment.
Camera starts low front tracking, moving backward at equal speed. Slight lateral drift only.
Mid-sequence, a large section drops, creating a clean gap.
The car accelerates and jumps forward across it.
Camera follows the arc smoothly, staying aligned with direction.
End with forward motion into remaining unstable span.
Create a 10-second cinematic food-commercial video following a STRICT 9-panel storyboard sequence.
The AI MUST follow the storyboard EXACTLY in order with smooth cinematic continuity between every shot.
Do NOT skip panels, merge scenes, change camera angles randomly, or alter the cooking process.
STRICT VIDEO RULES
EXACTLY 9 sequential scenes
Maintain the SAME young Western blonde woman throughout the entire video
Same wardrobe, hairstyle, kitchen environment, props, and food consistency in every shot
Keep realistic Indonesian bubur ayam preparation accurate
Smooth transitions between scenes
Realistic live-action cinematography ONLY
No animation, no cartoon style, no surreal visuals
Professional food-commercial pacing
Every scene should feel connected like a luxury Netflix food documentary
VISUAL STYLE
Ultra realistic cinematic food videography
Warm morning lighting
Indonesian street-food atmosphere blended with modern cozy kitchen aesthetic
Rich golden tones
Soft steam atmosphere
Shallow depth of field
Smooth cinematic motion blur
Macro food photography look
Premium commercial composition
24fps cinematic motion
4K ultra realism
Natural cooking ambience audio
Steam and glossy textures highly visible
STRICT 9-PANEL VIDEO STORYBOARD
PANEL 1 — "Morning Preparation" (0:00–0:01)
Wide cinematic establishing shot.
Young blonde Western woman enters a cozy Indonesian-inspired kitchen carrying fresh ingredients toward a wooden counter.
Warm sunrise light enters through the window.
Slow handheld cinematic camera movement.
PANEL 2 — "The Bubur Pot" (0:01–0:02)
Extreme close-up of a large steaming pot of thick bubur ayam.
The woman slowly stirs the porridge with a metal ladle.
Heavy steam rises dramatically into warm light.
Macro cinematic food detail.
PANEL 3 — "Careful Seasoning" (0:02–0:03)
Close-up of the woman sprinkling spices and seasoning into the bubbling porridge.
Focused expression.
Shallow depth of field with cinematic hand movement.
PANEL 4 — "Pouring the Porridge" (0:03–0:04)
Slow-motion macro shot of thick glossy porridge being poured from ladle into a white ceramic bowl.
Steam rises beautifully.
Camera follows the pouring motion smoothly.
PANEL 5 — "Preparing Toppings" (0:04–0:05)
Fast cinematic montage of toppings being prepared:
shredded chicken, chopped scallions, fried shallots, soybeans, crackers.
Quick macro cuts with elegant food styling.
PANEL 6 — "Topping Assembly" (0:05–0:06)
Dynamic slow-motion shot of toppings dropping into the bowl one by one.
Floating crumbs and steam visible.
Luxury commercial close-up angles.
PANEL 7 — "Golden Broth Finish" (0:06–0:07)
Golden chicken broth poured over the porridge creating rich ripples.
Sambal carefully added on the side.
Camera slowly rotates around the bowl.
PANEL 8 — "Final Food Presentation" (0:07–0:08.5)
Completed bubur ayam placed on a warm wooden table.
The woman adjusts the bowl presentation gently.
Steam rises naturally.
Crispy toppings highly detailed.
PANEL 9 — "Hero Shot" (0:08.5–0:10)
Final cinematic hero frame.
The blonde Western woman sits beside the finished bubur ayam smiling softly toward camera in warm morning light.
Slow cinematic push-in camera movement.
Shallow depth of field.
Elegant premium food-commercial ending with cinematic focus pull.
TECHNICAL NOTES
Smooth cinematic transitions only
Keep camera movement elegant and controlled
Avoid fast chaotic edits
Maintain realistic physics and food textures
Steam must remain visible in most scenes
Food should always look fresh, glossy, warm, and appetizing
Cinematic luxury advertisement quality throughout
The AI MUST strictly follow all 9 storyboard panels in exact order without improvisation
A photorealistic, ultra-high-definition cinematic video of a fluffy grey-and-white tabby cat sitting upright on a beige sofa, wearing a soft plush wolf-head costume hat. The cat is positioned behind a large table filled with an abundant mukbang-style feast, including crispy golden fried chicken, spicy red noodles, a juicy cheeseburger, fresh strawberries, tortilla wraps, and corn dogs.
The scene is styled like a viral ASMR pet eating video with realistic textures and subtle humor. The cat animatedly picks up a piece of fried chicken with its paw, brings it to its mouth, and eats with exaggerated, enthusiastic chewing. It then grabs a glass bottle of dark soda and drinks from it by tilting its head back.
In the background, colorful plush animal toys are neatly arranged along the sofa. Bright, soft lighting enhances the glossy, greasy food textures and the ultra-soft fur detail of the cat. The overall tone is playful, realistic, and highly cinematic, with smooth motion and natural animal behavior.
Courtside at a live NBA game, an ESPN broadcast cuts to a young woman in her 20s sitting in the front row — long black hair, natural smile, caught off guard by the camera. She glances around, unaware she's on the jumbotron. Crowd energy buzzing around her, players visible in the background. Full ESPN broadcast overlay with scorebug and network logo. Broadcast TV color grading, slight compression artifacts, feels like a real live telecast moment.
POV: Jumping out of a cargo plane at 10,000 feet! 🪂🌍
The sense of speed, the fisheye lens distortion, and the sheer scale of this coastal landscape are absolutely insane. AI video generation just hit a new level of adrenaline!
(Modern Pakistani Girl Trapped in 1850s Village | Emotional Discovery Scene | 15s Cinematic)
"Ultra-realistic cinematic 15-second time-travel sequence where a modern Pakistani girl from 2026 suddenly arrives in an old 1850s rural South Asian village. The village must feel authentic and culturally accurate to old Punjabi/Pakistani rural life: mud houses, dusty narrow streets, clay pots, buffalo carts, charpai, wheat fields, village wells, lanterns, smoke from clay stoves, and villagers in traditional old-era clothing.
The girl looks clearly modern and Pakistani: dark hair, realistic desi facial features, modern 2026 clothes, slightly messy from the fall after time travel. Her reactions must feel deeply realistic and emotional, as if everything around her is completely unfamiliar and shocking.
⏱️ 0:00 – 0:03 | ARRIVAL
The girl suddenly falls onto a dusty village road after a bright time-rift flash.
Heavy breathing, confused expression
Dust rises around her
Villagers stop walking and stare at her strangely
A buffalo cart slowly passes nearby
🎥 Camera: shaky cinematic landing shot + slow-motion dust reveal
⏱️ 0:03 – 0:06 | FIRST LOOK AROUND
She slowly stands up and looks around in shock.
Eyes wide with disbelief
She turns in every direction trying to understand where she is
Notices mud houses, old clothes, lanterns, clay pots
Children stare at her modern outfit curiously
🎥 Camera: rotating POV shots + close-ups of shocked facial expressions
⏱️ 0:06 – 0:09 | EVERYTHING FEELS NEW
She walks slowly through the village, overwhelmed by everything she sees.
Touches rough mud walls in confusion
Watches women carrying water pots
Sees smoke from clay stoves and people cooking outside
Chickens run through the street
Her face shows fear, curiosity, and amazement together
🎥 Camera: cinematic tracking shots + emotional close-ups
⏱️ 0:09 – 0:12 | CULTURE SHOCK
Villagers whisper while watching her.
Old village women exchange confused looks
Children follow her carefully
A village elder stares suspiciously
She looks at her phone but there is no signal, increasing panic
🎥 Camera: close-up on trembling hands holding phone + slow zoom on eyes
⏱️ 0:12 – 0:15 | FINAL EMOTIONAL MOMENT
The girl stands silently in the middle of the old village at sunset.
Eyes filled with disbelief and realization
Wind softly moving her hair
She slowly whispers to herself in shock
Ancient village life continues around her naturally
🎥 Camera: wide cinematic pull-back shot showing entire 1850s Pakistani village → emotional fade out
🎭 VISUAL STYLE
Authentic old Pakistani/Punjabi village realism
Emotional and immersive cinematic atmosphere
Realistic facial acting and reactions
Warm dusty golden tones
Historical fantasy with grounded realism
Ultra-detailed 4K cinematic quality
🔊 SOUND DESIGN
Village ambience: birds, distant chatter, buffalo bells, wind
Deep cinematic atmosphere during emotional moments
Traditional South Asian instrumental undertones mixed with soft orchestral emotion
A stunning photorealistic cyberpunk sci-fi cinematic video, 6 seconds long, 4K, ultra-detailed, shot on ARRI Alexa 65 with anamorphic lenses.
A beautiful young woman with short messy silver-white hair stands in the center of a futuristic cyberpunk city street at dusk. She wears sleek, battle-worn white and black tactical power armor with glowing blue and orange accents, a high-tech helmet with a clear visor pushed up, and black gloves.
She slowly turns her head toward the camera with a confident, intense gaze, her short hair gently moving in the wind. Subtle rain falls around her, neon reflections shimmer on her wet armor. Holographic advertisements and pink, cyan, and purple neon signs glow in the background. Flying cars streak across the sky, distant skyscrapers with massive digital billboards tower above.
Cinematic camera movement: starts with a medium shot from a low heroic angle, slowly orbits around her 180 degrees while gently pushing in, creating a dramatic reveal of her armor and the cyberpunk environment. Moody volumetric lighting, god rays cutting through rain and fog, lens flares, shallow depth of field, film grain, subtle chromatic aberration.
Ultra photorealistic, hyper-detailed textures, perfect anatomy, atmospheric cyberpunk mood, Blade Runner 2049 aesthetic, extremely high quality, masterpiece, best quality.
Style: Cinematic, photorealistic, cyberpunk, sci-fi, dramatic lighting
Duration: 6 seconds
Motion: Smooth, cinematic, slow and powerful
Camera: Dynamic orbiting shot with slow push-in
{
"animate": "reference image into a 15s hyper-realistic live basketball TV broadcast",
"visuals": {
"shots": [
"wide high-angle tracking shot of fast break",
"side medium shot of contested drive to basket",
"explosive euro-step or pull-up jumper in paint",
"last-second shot hangs in air then swishes cleanly",
"subtle handheld shake during contact drive",
"crowd erupts with towels waving",
"CUT TO: exact girl from reference image in arena crowd, oversized team jersey, shocked/euphoric reaction on jumbotron cam, leaning forward slightly, fans blurred behind, warm court lights reflecting on face, telephoto lens compression, identity perfectly preserved"
],
"consistency": "reference subject perfectly recognizable in final reaction shot",
"physics": "realistic ball arc, net swish, sneaker squeaks, jersey movement",
"grading": "authentic playoff broadcast look",
"effects": "anamorphic flares, telephoto compression, natural motion blur"
},
"graphics": {
"scorebug": "HOME 108-107 AWAY, 4Q clock from 0:04",
"stats_popup": "player number, position, points, FG%",
"watermark": "sports network logo top-right",
"ticker": "playoff series updates scrolling"
},
"audio": {
"style": "high-energy synced basketball commentary with arena ambience",
"dialogue": [
"0-3s: 'Home team in transition! Number 23 ahead to the big man — four seconds left!'",
"3-7s: 'Strong drive to the rim — contact! Off the glass — IS IT GOOD?!'",
"7-10s: 'IT COUNTS! AND THE FOUL! This arena has exploded — look at these fans!'"
],
"sfx": [
"massive crowd roar",
"sneaker squeaks",
"net swish",
"backboard rattle",
"on-court player shouts"
]
},
"specs": {
"quality": "photorealistic broadcast realism",
"resolution": "1080p 60fps",
"style": "cinematic playoff sports broadcast",
"lip_sync": "perfect",
"artifacts": "none",
"identity_preservation": "reference subject likeness must remain exact"
}
}
A single continuous 15-second cinematic long shot inside a speeding metropolitan subway train at night during heavy rain. Fluorescent ceiling lights flicker softly above metallic poles and wet reflective floors. Outside the windows, blurred neon city lights streak through darkness as thunder rumbles faintly. Half-empty train carriage, tense atmosphere, realistic urban grime.
[0:00–0:02] Smooth tracking shot down the center aisle. A sharp-looking woman in image_1 with tied-back dark hair, piercing eyes, black leather jacket, gray fitted shirt, and combat boots stands holding a subway pole calmly while passengers avoid eye contact. Rainwater drips from her coat sleeves.
[0:02–0:04] Camera slowly circles her as three intimidating men in dark streetwear enter from the next carriage. One cracks his knuckles while another locks the train door behind them. The fluorescent lights flicker harder. Passengers nervously move away.
[0:04–0:06] Without warning, the first attacker lunges forward. She instantly pivots sideways and slams his face into a steel pole. Camera whips dynamically with the motion. The train suddenly jerks on the tracks, throwing everyone violently off balance.
[0:06–0:08] Continuous close-quarter fight sequence inside the moving train. She ducks beneath punches, uses hanging hand straps for momentum, knees an attacker into subway seats, then slides across the wet floor as sparks burst from overhead flickering lights. Realistic impacts and gritty handheld camera movement.
[0:08–0:10] Camera drops low beside the floor as the train speeds through a tunnel. She grabs an attacker's hoodie, spins him violently into the carriage doors, and counters another strike with a brutal elbow to the jaw. Reflections of flashing tunnel lights pulse across the scene rhythmically.
[0:10–0:12] The train brakes suddenly entering a station. Everyone lurches forward. She launches herself over the seats in one fluid motion and drives the final attacker through a glass advertisement panel. Shattered glass sprays across the aisle in dramatic slow motion.
[0:12–0:14] Alarm lights flash red inside the carriage. The unconscious attackers lie scattered across the train floor while terrified passengers stare silently. She adjusts her leather jacket calmly, breathing heavily, neon station lights glowing behind her through the rain-covered windows.
[0:14–0:15] Extreme close-up. Train doors slide open with a loud hiss. She steps out onto the rain-soaked platform without looking back as the camera remains inside the carriage watching her disappear into the crowded neon station. Fade to black.
Style: Original photorealistic urban action thriller, cinematic 4K realism, grounded practical fight choreography, continuous one-shot camera movement, gritty subway atmosphere, realistic train motion physics, shallow depth of field, flickering fluorescent lighting, intense handheld energy, atmospheric rain reflections, immersive sound-driven cinematography.
Cinematic hyper-realistic 14-second night launch of a NASA Space Shuttle at Kennedy Space Center, photorealistic 8K, dramatic lighting, wet reflective concrete pad, epic scale, filmic color grading, no text, no logos.
0-2 seconds: Wide static establishing shot of the full Space Shuttle stack (white orbiter with black heat tiles, massive orange external tank, two white SRBs) standing vertically on the launch pad next to the tall metal tower under a deep dark blue night sky. Subtle ambient lights, calm before ignition.
2-4 seconds: Sudden violent ignition — three main engines and two SRBs fire simultaneously with blinding orange-white flames and explosive clouds of thick white smoke + steam from the water deluge system erupting from the base. Camera cuts to intense low-angle close-up of the roaring engines and orbiter belly, fire and dense smoke rapidly filling the frame, ground shaking, dramatic orange glow reflecting on wet pad.
4-6 seconds: Extreme low-angle close-up on the engines and lower orbiter/external tank as the flames intensify and massive billowing smoke clouds swirl and expand violently upward, thick white vapor pouring out, intense heat distortion, cinematic orange illumination lighting the entire structure from below.
6-8 seconds: Camera pulls back to medium-wide low-angle shot as the shuttle begins to slowly lift off the pad; enormous golden-orange fire and expanding smoke clouds engulf the launch tower base, brilliant reflections on the wet ground, raw power visible.
8-10 seconds: Dramatic wide shot of the Space Shuttle clearing the tower and rising majestically into the night sky; massive expanding clouds of fire and thick white/orange smoke billow outward across the entire pad, tower lit dramatically by the engine glow, epic ascent beginning.
10-12 seconds: Dynamic low-angle tracking shot from below the ascending shuttle, focusing on the three blazing engine nozzles and SRBs producing powerful blue-white exhaust plumes, smoke continuing to surge upward, shuttle gaining altitude against the dark sky.
12-14 seconds: Final wide heroic shot of the fully ascending Space Shuttle climbing higher into the night, surrounded by enormous glowing clouds of smoke and fire that light up the launch complex, dramatic reflections, sense of immense power and scale as it continues its powerful vertical climb.
Style: Hyper-realistic, photorealistic details, intense contrast between dark night and blazing exhaust, cinematic camera movement with slight Dutch angles and smooth tracking, epic and emotionally powerful, no voiceover.
"ROCKET VS MEG"
Shot 1 (0s–2s) —
TOP-DOWN AERIAL SHOT.
A military speedboat tears across bright blue ocean water at insane speed in broad daylight.
Behind it:
a gigantic dorsal fin slices through the sea, gaining rapidly.
Sunlight reveals the ENORMOUS shadow of the Megalodon moving beneath the surface directly toward the boat.
White water explodes everywhere.
Shot 2 (2s–5s)
Inside the violently bouncing speedboat.
Bright sunlight. Heavy ocean spray blasting faces.
A terrified soldier struggles to load a rocket launcher while another screams:
"MOVE! MOVE!"
The boat suddenly jolts upward violently as something gigantic brushes underneath the hull.
Everyone nearly flies overboard.
Shot 3 (5s–8s)
Wide cinematic ocean shot.
The Megalodon ERUPTS fully out of the water behind the speeding boat.
Massive jaws wide open.
Water cascades off its body in sunlight.
Its sheer size blocks the sky as it launches directly toward the boat mid-air.
Shot 4 (8s–11s)
High-budget slow-motion chaos.
The soldier braces against the railing while the airborne Meg hangs overhead.
People screaming.
Boat tilted sideways from wave impact.
The soldier fires the rocket launcher directly into the shark's open mouth at point-blank range.
Shot 5 (11s–13s)
MASSIVE OCEAN EXPLOSION.
The Meg detonates internally underwater.
Blood, fire, and water blast upward into the air.
Shockwave throws the speedboat sideways across the ocean surface.
Crew starts cheering in disbelief.
Shot 6 (13s–15s) — PAYOFF
Suddenly, the burning Megalodon corpse crashes directly DOWN onto the boat from above.
Boat folds apart violently under the impact.
Debris and water erupt skyward.
Final frame:
the shattered remains of the military boat sinking in daylight beside the gigantic smoking Meg carcass floating upside down in the ocean.
Key Visual Hook
Bright daytime aerial shot of the enormous Meg shadow rapidly chasing the military speedboat through crystal-blue water.
Notes
The scale should feel real and expensive — viewers can clearly see the gigantic shark, airborne attack, rocket hit, and final boat-crushing payoff in full cinematic detail.
Ultra-realistic 15-second wildlife sequence in a dense forest at dawn, cold mist between trees, wet leaves and soft earth underfoot.
0–3s: Low tracking shot — a wolf moves silently through ferns and tree roots, body low, ears forward, eyes locked ahead, breath faint in the cold air.
3–5s: Cut to a deer grazing near a clearing, head suddenly lifting, ears twitching, sensing danger.
5–7s: The wolf bursts from cover, accelerating through brush, paws kicking up leaves and dirt.
7–10s: Fast side tracking shot — the deer sprints away between trees, muscles flexing, hooves striking mud, branches shaking as it passes.
10–12s: The wolf closes distance, weaving around trunks with powerful strides, focused and controlled.
12–14s: Near-contact moment — the wolf lunges forward, jaws close near the deer's hind leg, but the deer sharply changes direction.
14–15s: Final shot — the deer escapes deeper into the forest as the wolf skids slightly on wet leaves, chase unresolved.
Camera: wildlife documentary style, low angles, fast but readable tracking, slight handheld realism.
Environment: dense forest, moss, roots, mist, falling leaves, natural morning light.
Style: ultra-realistic wildlife behavior, grounded physics, natural motion, no graphic detail, no text, no overlays, stable proportions.
A screenshot from a live NBA game TV broadcast on ESPN. The camera cuts to the audience — a gorgeous Asian woman in her 20s with long black hair, perfect features, and a stunning figure in a tight low-cut top, sitting courtside. She smiles naturally, unaware she's on camera. Full ESPN broadcast overlay: scorebug, network logo watermark, 16:9 aspect ratio. The image looks exactly like a real TV screenshot — broadcast color grading, slight compression artifacts, interlacing grain.
Create a premium cinematic travel film featuring the entire city of London with 15 visually distinct scenes.
The video should feel like a Netflix-quality urban documentary mixed with luxury tourism cinematography and emotional storytelling.
Style: ultra realistic, cinematic, photorealistic, high dynamic range, dramatic lighting, rich atmosphere, smooth camera movement, realistic city scale, premium color grading, IMAX-style composition, detailed architecture, subtle film grain, volumetric lighting, realistic reflections, atmospheric weather transitions.
Aspect Ratio: 16:9
Resolution: 4K HDR
Frame Rate: 24fps cinematic motion
Camera Style: drone shots, FPV flythroughs, crane shots, slow-motion tracking, aerial panoramas, dynamic timelapses, stabilized cinematic movement.
Music Direction:
Epic orchestral-electronic hybrid soundtrack with emotional build-up, deep cinematic percussion, elegant strings, atmospheric synths, subtle British cultural influence, powerful drops during skyline reveals, emotional piano during sunset scenes, seamless transitions synced with visuals.
Scene Breakdown:
Sunrise aerial over the River Thames with golden morning fog rolling through London skyline.
Cinematic drone reveal of Tower Bridge with traffic lights reflecting on wet roads after rain.
Hyper-detailed FPV flythrough between skyscrapers in Canary Wharf at blue hour.
Luxury street-level cinematic shot of classic red double-decker buses moving through central London.
Slow-motion cinematic crowd movement around Piccadilly Circus with giant neon screens glowing at night.
Royal cinematic reveal of Buckingham Palace with dramatic cloudy skies and elegant camera crane movement.
Atmospheric rainy-night sequence in Soho with reflections, umbrellas, cafes, and cinematic neon lighting.
Massive aerial establishing shot of Big Ben and the Houses of Parliament during sunset.
Cinematic timelapse of London Underground trains arriving and departing with dynamic motion blur.
Emotional golden-hour sequence inside and around Lord's Cricket Ground, packed crowd atmosphere, cinematic cricket action, cheering fans, dramatic stadium lights turning on.
Wide drone orbit around The Shard piercing through clouds at dusk.
Elegant evening sequence of luxury boats moving along the Thames with city reflections shimmering on water.
Winter-style cinematic fog drifting through historic London streets with vintage architecture and warm street lamps.
Massive night skyline reveal showing the full illuminated London cityscape from above with cinematic cloud movement.
Final emotional ending shot: slow aerial pullback over London at dawn transitioning from night lights into sunrise, ending with a majestic cinematic atmosphere.
Overall Tone:
Grand, emotional, modern, immersive, sophisticated, globally iconic, visually breathtaking, emotionally powerful.
Negative Prompt:
low quality, cartoon, oversaturated colors, unrealistic buildings, shaky camera, distorted faces, bad lighting, low detail, flickering, poor motion interpolation, flat composition, cheap CGI look, text artifacts, blurry skyline, noisy footage.
Create a colorful cinematic video. Feature a realistic young Japanese woman running her cozy modern coffee café throughout the day with natural human movements and realistic environments. Follow the exact sequence of scenes: opening the café, grinding coffee beans, brewing espresso, steaming milk, making latte art, serving customers, taking orders, decorating desserts, cleaning counters, washing cups, restocking ingredients, managing the cash register, decorating the café, evening cleanup, closing the café, and finally relaxing with coffee at night. Use warm cinematic lighting, soft sunlight, café steam effects, shallow depth of field, smooth camera pans, close-up shots, realistic reflections, cozy ambience, seamless transitions, and premium commercial-style cinematography. The girl must look like a real human actress, not anime, not cartoon, not CGI animation. Make the atmosphere aesthetic, relaxing, emotional, and luxurious like a high-end coffee commercial in ultra-realistic 4K quality.
Generate a 3-second ultra-realistic 4K video using start and end frames.
Single continuous handheld push-in.
Begin on her partially zoomed face.
The camera smoothly pushes toward her mouth as it opens wider naturally.
Realistic lip mechanics, natural moisture highlights.
End on an extreme close-up inside the open mouth.
No distortion.
No exaggerated anatomy.
Preserve realism and texture accuracy.
Use reference image as the primary identity lock and keep my face consistent throughout the full video. Create a 15-second ultra-realistic cinematic celebrity arrival scene.
I exit a modern international airport like a famous star. I am wearing a stylish black leather jacket, a fitted dark shirt, dark blue jeans, and elegant coordinated shoes, all highly fashionable, masculine, and charismatic. My outfit must feel premium, balanced, and visually cohesive.
0–4s: Inside the airport exit area, automatic glass doors open and I walk out with calm confidence. Two professional bodyguards notice me immediately and move into position beside me. Medium-wide cinematic shot, realistic airport lighting, subtle crowd motion in the background.
4–8s: As I step outside, people recognize me. Fans and bystanders lift phones and cameras, photographers start shooting, bright camera flashes go off, people turn their heads toward me. I notice the crowd, keep walking, then make a calm respectful gesture: I briefly place one hand on my chest like saying "thank you / I appreciate you," give a small confident nod toward the cameras, then lower my hand naturally. Security keeps space around me. Slow forward tracking shot with strong celebrity energy.
8–12s: I continue walking toward my car with relaxed but important body language. I make slight eye contact with cameras, subtle cool expression, composed smile for one second, then return to serious charismatic focus. My bodyguards escort me on both sides, creating a VIP corridor through the crowd. Dynamic but smooth camera movement, cinematic depth of field, realistic motion.
12–15s: I arrive at a sleek black luxury car parked at the curb. A bodyguard opens the rear door for me. I pause for one final star moment as cameras flash intensely, then get into the car with effortless confidence. End on a polished cinematic hero shot.
Style: photorealistic 8K, premium celebrity documentary realism, ultra-detailed skin and clothing textures, realistic airport exterior, natural daylight, paparazzi flashes, clean sound design with crowd murmur, camera shutter clicks, footsteps, bodyguard movement, car door sound. No subtitles, no text, no logos.
Create a 15-second cinematic short video with a unique emotional storyline.
Scene starts inside a softly lit modern grocery store. A young woman (mid-20s, natural look, casual outfit) walks slowly through the aisles, picking up everyday items like milk, bread, and fruits. Camera follows her in smooth tracking shots, focusing on small details—her hands brushing over products, her thoughtful expressions.
Mid-scene (5–10 sec): She pauses while holding a chocolate bar, a subtle flashback overlay appears—quick soft-focus memory of her laughing with someone special (suggesting nostalgia or a past relationship).
Final scene (10–15 sec): She gently puts the chocolate back, gives a soft, emotional smile, and walks toward the checkout. Camera lingers as she exits the store alone, but calm and stronger.
Style: cinematic, shallow depth of field, warm lighting, soft background music, emotional tone
Camera: slow motion, close-ups + smooth tracking shots
Mood: nostalgic, peaceful, slightly emotional
Quality: ultra-realistic, 4K, film-like color grading
A hand slowly enters the frame naturally and gently taps the smartwatch screen once. The display softly illuminates with a subtle ripple-style activation animation spreading across the screen surface. Warm sunlight shifts slightly through the curtains, creating delicate moving shadows across the wooden nightstand. Camera begins as a static overhead composition, then slowly pushes into a smooth 2x zoom toward the watch face after the tap interaction. Preserve original composition, watch design, lighting, wood textures, colors, reflections, and Scandinavian aesthetic. Smooth natural motion, calm premium lifestyle realism, soft cinematic atmosphere, aspect ratio 16:9.
A hyper-realistic female athlete in a modern, high-end gym environment, captured during a low-energy warm-up moment. She is standing near a squat rack, slightly bent forward, adjusting her wrist wraps while taking a deep breath. Her expression shows focus and calm determination, with subtle fatigue in her eyes as she prepares for an intense workout.
Appearance: athletic, toned physique, natural skin texture with visible pores and slight sheen of early sweat, minimal makeup, realistic facial features. Hair tied in a practical high ponytail with a few loose strands falling naturally.
Outfit: fitted dark sports bra and high-waisted leggings, breathable performance fabric with slight texture, paired with modern training shoes.
Lighting: bright, soft gym lighting with natural highlights, slightly diffused overhead lights creating gentle shadows on muscles. Subtle rim lighting to separate subject from background.
Environment: vibrant, premium gym interior with blurred background (shallow depth of field), visible gym equipment like barbells, plates, and benches. Clean, modern aesthetic with energetic color accents (reds, blues, neon hints).
Camera: medium shot (waist-up or 3/4 framing), eye-level angle, shallow depth of field (f/1.8 look), sharp focus on subject, softly blurred background.
Details: visible breath, slight sweat forming on forehead and collarbone, hands gripping wrist wrap tightly, veins slightly visible, realistic muscle tension.
Style Keywords: ultra-realistic, cinematic, 4K, HDR, shallow depth of field, natural skin texture, fitness photography, Nike campaign style, dramatic yet soft lighting.
A young woman in a weathered linen dress rows a small wooden boat through a misty river at golden hour, her dark hair loosely falling over her shoulders, expression calm and distant. Soft amber and rose light filters through dense mangrove trees, reflecting in the gently rippling water. Camera slowly pushes forward at low angle, just above the water surface, revealing her silhouette against the glowing fog. Hyper-realistic, cinematic grain, anamorphic lens flare, shallow depth of field, 4K, documentary-style lighting. Mood: quiet, ethereal, melancholic.
Extremely fast paced, realistic, cinematic FPV flying through Disneyland. Low altitude over Sleeping Beauty Castle, parade streets, and fantasy villages. Sharp dives past rollercoasters, spinning teacups, and fireworks launch zones. Gliding above rivers with boats, glowing lights, and crowded themed lands. Realistic textures, reflections, dynamic shadows, steam, and smooth fluid movement. Close passes through tunnels, animatronic sets, and neon-lit rides.
Create a 15-second ultra-realistic cinematic vertical (9:16) commercial video.
Scene Style: Modern skincare advertisement, clean minimal bathroom setting, soft morning natural light, premium commercial look, macro detailing of water, foam, and skin texture.
Sequence:
0–3s: Extreme close-up shot of a young man's hands. He squeezes face wash into his palm—thick gel drops in slow motion, highly detailed texture. Soft light reflects on the product.
3–6s: He rubs the face wash between his palms, forming rich creamy foam. Camera focuses on lather buildup with cinematic macro shots.
6–10s: He applies the foam to his face and gently massages in circular motion. Voiceover begins: "This face wash removes dirt, oil, and impurities…"
10–13s: Slow-motion rinse shot—water flows across his face, washing away foam. Skin appears fresh, clean, and glowing. Subtle cinematic zoom-in.
13–15s: He looks into the mirror with a refreshed, confident expression and says: …for clear, smooth, and energized skin every day. Final product pack shot appears with soft glow and clean white background.
A hyper-realistic cinematic food preparation scene in a modern ice cream shop. A perfectly chilled stainless steel cold plate sits center frame, surrounded by small metal bowls filled with chocolate chunks, cocoa chips, sauces, and colorful candy-coated chocolates (like Skittles). Camera locked in a slightly low, front-facing angle with shallow depth of field.
From above, a thick, glossy stream of creamy white ice cream base slowly pours down in a smooth continuous ribbon. The liquid stretches elastically and folds onto itself as it lands directly on a pile of vibrant rainbow candies at the center of the cold plate. The cream spreads slightly but keeps a soft mound shape, forming layered folds.
Bright studio lighting with soft reflections on the steel surface, clean white tiled background, professional dessert kitchen aesthetic. Subtle motion blur on the flowing cream, highly detailed textures (glossy liquid, matte candies, metallic reflections).
Background slightly out of focus: a red candy box visible on a glass shelf, minimal depth distraction. No hands visible, only the pouring action. No text, no subtitles.
Sound design (optional): soft pouring sound, light ambient kitchen noise.
Camera remains steady with micro cinematic focus breathing. Ultra HD, 4K, commercial food ad style, macro detail, realistic physics, smooth motion.
A cinematic ultra-realistic scene of [subject description], captured in [environment/location]. The subject performs [action/movement] with smooth, natural motion. Dramatic lighting with soft shadows and highlights, creating a moody atmosphere. Camera uses [camera angle, e.g., low-angle / drone shot / close-up] with slow motion effect and shallow depth of field. Background features [details like city lights, nature, fog, neon glow, etc.]. Color grading is [warm/cool tones], highly detailed textures, 4K resolution, realistic physics, cinematic composition, film grain, and smooth transitions.
Raw 35mm handheld cinematic footage, high altitude sun haze, intense lens flare and atmospheric glow, one single unbroken continuous tracking shot, no cuts, no edits, all real time 15 second duration. Photorealistic 8K, natural physics, correct fabric motion blur from 350 mph wind, realistic skin and hair movement, zero uncanny valley, zero artifacts, hyper detailed.
The main subject is the exact person from @[yourimage] same face, same build, same skin tone, same casual expression. He is wearing baggy cargo shorts and flip-flops exactly as shown in @ Image1. He stands perfectly relaxed, casually balancing on top of the wing of a speeding F-16 fighter jet flying at 350 mph at 10,000 feet. The entire audio track is nothing but constant full throttle jet engine roar mixed with powerful wind blast no music, no dialogue, no other sounds.
At the 3 second mark the pilot leans out of the open canopy and gives a clear thumbs up toward the guy on the wing. The guy from @[yourimage] leans forward slightly, smiles, and casually returns the thumbs-up.
At the 7 second mark he performs one completely casual, perfectly clean full backflip no hands, no grabbing the jet, no assistance rotating naturally in the air with perfect form and landing exactly on the same spot on the wing without even a single stumble or shift in balance. All motion and fabric physics must perfectly match the body and clothing from @[yourimage].
At the 12-second mark he casually brushes a tiny speck of dust off his shorts with one hand, then gives a bored, almost lazy little thumbs up directly to camera. Hard cut on the final frame.
Use the exact appearance, face, body proportions, and clothing from @[yourimage] for the man throughout the entire video. Ultra photorealistic, raw documentary handheld feel, extreme detail on fabric flapping in the wind, correct motion blur, natural lighting, impossible but believable physics, cinematic yet gritty 35mm texture.
A cinematic vertical 9:16 video set in a vibrant pixel-art RPG version of New York City during warm daylight. The environment is richly detailed in 16-bit/32-bit pixel style with animated elements: water shimmering with soft reflections, clouds slowly drifting, birds flying across the skyline, and subtle NPC movement in the background.
At the center, a fully photorealistic real woman (identical to reference image, unchanged facial features, same hairstyle, same outfit) is seamlessly integrated into the pixel world. She is scaled naturally like a game character (around 25–30% of frame height), walking slowly forward with smooth, realistic motion. Her body movement includes subtle arm sway, natural posture shifts, and slight head turns as if observing the world. Her expression remains soft and neutral.
Camera Motion (Highly Important for Virality)
Start with a slow cinematic push-in (dolly forward) toward the character
→ slight parallax effect between foreground (bench, lamp post), midground (character), and background (city skyline)
→ add a gentle handheld micro-motion for realism
Midway: → smooth side tracking shot as she walks
→ brief focus pull from pixel background to her face
Final moment: → slight orbit camera movement (5–10° arc) around her for depth and immersion
Environmental Animation
Water: subtle wave animation + light reflections
Trees/plants: gentle wind sway
NPCs: minimal looping animations (walking, talking)
Boat slowly moving in background
Floating dust/light particles for atmosphere
Pixel signboards flicker slightly
UI Animation (Game Feel = Viral Hook)
Top-left avatar: subtle bounce-in + health bar pulse
Mini-map: blinking location marker
Quest panel: text types in with soft pop effect
Bottom UI buttons: idle glow + slight hover pulse
Coin counter: small increase animation (+10 flash)
Cinematic Effects
Soft sunlight rays with warm tone
Dynamic shadows matching movement
Depth of field (background slightly blurred during focus moments)
Subtle motion blur during camera movement
Light bloom on highlights
Gentle lens flare when camera shifts
Viral Hook Moment (CRITICAL)
At 2–3 seconds: → a pixel ripple/glitch transition briefly passes through the scene
→ for a split second, the world “reacts” to her presence
→ UI elements pulse + slight sound sync moment
This creates a “wait… was that real?” effect
Suggested Audio Direction
Soft lo-fi RPG background music
Light ambient city sounds (water, footsteps, distant chatter)
UI click sounds synced with animations
Subtle “level-up” or sparkle sound during hook moment
Style Keywords (important for Seedance)
cinematic, ultra smooth animation, parallax depth, photorealistic human in stylized pixel world, seamless integration, warm lighting, cozy aesthetic, immersive, game-like UI, subtle motion, viral aesthetic, high detail
Negative Prompt (to avoid breaking realism)
no face distortion, no stylized face, no anime face, no exaggerated proportions, no oversized character, no floating feet, no mismatch lighting, no blur on subject face, no jittery motion
Create a 15-second ultra-realistic vertical (9:16) cinematic video of a young woman shopping in a modern grocery store.
Scene Style: Bright, clean, and aesthetically pleasing supermarket with soft natural lighting, slightly warm tones, shallow depth of field, and smooth cinematic camera movement.
Sequence:
0–3s: Wide establishing shot of a modern grocery store aisle. Shelves neatly stocked with fresh fruits, vegetables, and packaged goods. Soft ambient store sounds.
3–7s: Medium tracking shot of a young woman wearing casual stylish outfit (white shirt, light denim jeans, minimal makeup). She pushes a shopping cart slowly while scanning shelves thoughtfully.
7–11s: Close-up shots:
• Her hand picking fresh apples and checking quality
• Slow-motion of fruits being placed into cart
• Subtle smile as she compares items on a list
11–15s: Cinematic side profile shot as she walks down the aisle. Soft sunlight beams through store windows, creating a dreamy glow. Camera slowly pulls back as she continues shopping calmly.
Mood: Peaceful, everyday lifestyle elegance, slightly cinematic commercial feel.
Visual Quality: Ultra-realistic, 4K detail, smooth motion, natural skin tones, shallow focus, soft bokeh background.
POV of a young office girl running on a crowded city road, checking her phone — she's late for work. Fast cuts — she dodges pedestrians, jumps over a puddle, squeezes through traffic, almost drops her files but keeps running. Background sounds: traffic, footsteps, heartbeat increasing. She sees the bus arriving, sprints at full speed, reaches just in time, grabs the handle and gets in. Ends with her breathing heavily, slight relieved smile. Ultra-realistic, cinematic, motion blur, fast-paced, 4K.
Create a 15-second ultra-realistic cinematic vertical (9:16) wrestling sequence. Intense sports drama with gritty, high-energy atmosphere. Dimly lit underground wrestling arena with harsh overhead spotlights, dust particles in the air, and a roaring crowd blurred in the background. Wet mat reflecting light, sweat and motion emphasized with slow-motion detail. Two powerful male wrestlers with athletic, muscular builds. One in red gear, the other in black gear. Both highly focused, aggressive, and determined.
cinematic
action
sports
realistic
drama
slow motion
A 500-year-old historical war film style cinematic scene set on a massive ancient fortress wall, inspired by old imperial-era architecture. The wall is extremely wide (around 15 feet) and stretches endlessly into the horizon, disappearing into mist and mountains. The environment is cold, dramatic, and filled with tension.
All soldiers are dressed in traditional ancient war armor from a 500-year-old era — heavy metallic chest plates, leather straps, cloth layers, helmets with engraved designs, and battle-worn textures. The armor looks realistic, aged, and authoritative.
On top of the giant wall, a heavily guarded military convoy is moving forward. Five war prisoners are being forcefully escorted by soldiers. The prisoners are struggling and resisting, trying to break free, creating chaos and resistance during the movement.
The escorting soldiers hold sharp swords and tightly grip the prisoners, forcing them forward with strength and discipline. Their expressions are strict, focused, and emotionless, trained for war and control. Every movement shows tension and authority.
Around them, high-security guards stand at regular intervals along the massive wall. They carry long spears (halberds) that are visible even from a distance due to the wide camera shots. The spears reflect faint light, adding to the cinematic atmosphere.
At the far end of the wall, a large ancient war gate or fortress entrance is visible — heavily fortified, made of stone and wood, leading deeper into a military stronghold where the prisoners are being taken.
The prisoners continue to struggle while being dragged forward, creating dynamic motion and tension in the scene. Guards maintain strict formation, pushing them forward without stopping.
The camera slowly pans and zooms to reveal the scale of the fortress wall — emphasizing its massive length, height, and historical power. Mist and wind move across the structure, adding dramatic cinematic depth.
Style: ultra cinematic, historical epic war film, 500-year-old ancient empire aesthetic, realistic textures, dramatic lighting, wide-angle shots, slow camera movement, intense atmosphere, high detail.
Mood: tense, powerful, dramatic, historical realism.
A stunt rider in a matte-black helmet and armored racing suit accelerates a superbike along the narrow arm of a construction crane high above Shanghai's skyline.
At the 2-second mark the crane begins to collapse, cables snapping and steel beams twisting.
The rider hits the end of the crane arm and launches the motorcycle across open air toward a nearby rooftop.
Camera on an adjacent tower captures the full arc of the jump as the collapsing crane falls behind him.
The bike lands on the rooftop helipad and skids through scattered equipment.
Shanghai skyline, collapsing crane stunt jump, rooftop landing momentum, cinematic aerial scale, 4K.
Create a 15-second cinematic vertical (9:16) ultra-realistic fitness video showing a high-intensity gym training montage inspired by a collage-style workout sequence.
Scene Style: Premium modern industrial gym with dark metallic interiors, rubber flooring, and dramatic cinematic lighting. Strong contrast between shadows and highlights with subtle red and blue neon accents. Floating dust particles visible in light beams for depth and realism.
Character: A strong, athletic male/female fitness model wearing sleek performance gym wear (compression top, shorts, training shoes). Visible muscle definition, sweat detail, and natural fatigue expressions showing effort and discipline.
Video Flow (fast-paced montage):
0–3s: Warm-up stretches and mobility drills
3–6s: Heavy barbell squats and controlled breathing close-up
6–9s: Deadlifts and explosive power lifts with floor impact shots
9–12s: Dumbbell curls, cable pulls, and boxing bag strikes (quick cuts)
12–15s: Treadmill sprint finish → slow-motion cool down, deep breathing, head up, victorious look
Cinematic Effects: Smooth motion transitions, whip cuts between exercises, slight slow-motion on key lifts, dynamic camera angles (low angle power shots, side tracking, close-up sweat detail).
Mood: Intense, motivational, discipline-driven transformation energy. Emphasize "no excuses, only progress" feeling without showing text unless subtly in background gym screen.
CRITICAL INSTRUCTION: The reference image contains a 9-step chronological cooking storyboard. Animate the chef seamlessly through these exact 9 steps in order. Start at Step 1 (Flour Well), flow into Step 2 (Crack Eggs), then Step 3 (Mix). Continue the chronological progression through Kneading, Resting, Rolling, Cutting, and Boiling, finishing perfectly on the final plated dish (Step 9). Prioritize the strict sequence of actions.
15 seconds, 16:9, realistic, cinematic, tasty, natural camera movement."
How to use this system:
1. Generate the reference sheet in ChatGPT 2.0
2. Upload image reference in Seedance
3. create animation prompt like above sample
4. Set motion strength medium-high + cinematic style
Style: Ultra-realistic mass celebrity arrival scene. Single continuous shot. Handheld camera from crowd perspective. Natural micro-shake. No cuts. Documentary-level realism.
Audio: Only natural environment sound loud crowd cheering, overlapping voices shouting, rapid camera shutter clicks, phones recording audio, distant airport announcements echoing, footsteps, fabric movement, subtle engine idle and revving.
Lighting: Natural daylight filtering through large airport glass panels. Mixed reflections on polished surfaces. Soft but realistic shadows. Slight atmospheric haze for depth.
Main Character: @IMG
Calm, controlled presence. Subtle confident smile. Face identity must remain perfectly consistent across all frames.
Outfit (STRICT LOCK):
Black zip-up jacket (matte, soft fleece texture) worn open
Clean light grey/white t-shirt underneath
Beige/cream tailored pants
Accessories:
Transparent beige square-frame luxury-style sunglasses with gradient lenses
(No branding, no logo, no variation)
Scene Flow
0–3s:
Camera starts from inside a dense crowd behind barricades. Handheld, slightly unstable. View partially blocked by people in front. Multiple phones raised, some screens visible recording. Crowd energy is loud, chaotic, restless.
3–6s:
Camera lifts slightly above shoulder level, still handheld. Focus shifts naturally between heads, waving hands, and glimpses of the arrival gate. Occasional bright flashes from media cameras. Anticipation rises as crowd leans forward.
6–10s:
Security personnel step in, pushing the crowd back slightly. Camera reacts with natural shake. Through shifting gaps, the main character appears in the distance—initially soft and partially obscured, gradually becoming clearer while walking forward with a small escort team.
10–13s:
Subtle handheld push-in (natural movement, not digital zoom). The main character is now clearly visible, walking confidently at center frame. Path is being cleared. He raises one hand and gives a calm, controlled wave with a slight smile. Camera struggles slightly to keep framing due to crowd movement.
13–15s:
Camera shifts and tilts trying to follow. A luxury convoy becomes partially visible: a low-profile angular sports car in front, followed by two large premium SUVs. A security member opens a vertical-style door. The main character enters quickly. Engine revs naturally. Vehicles begin moving forward. Camera lifts slightly as people jump and try to capture the moment.
SCENE 1 (0–4s) — Entry & Discovery
A girl enters her bedroom after a long day. The camera follows her from behind as she opens the door. She pauses and looks at a messy, slightly chaotic room with scattered clothes and objects. Soft natural light enters through the window, creating a realistic, slightly dramatic mood.
SCENE 2 (4–8s) — Decision Moment
Close-up shot of her face as she sighs slightly and ties her hair into a neat bun. The camera slowly zooms in. Her expression changes from tired to determined. Subtle cinematic lighting highlights her focus and calm energy.
SCENE 3 (8–12s) — Cleaning Sequence (Speed Montage Style)
Fast-paced cinematic montage of her cleaning the room efficiently. Clothes are folded, items are arranged, bed is straightened. Smooth motion blur transitions, satisfying organization visuals, time-lapse style with soft aesthetic color grading.
SCENE 4 (12–16s) — Peaceful Ending
The room is now perfectly clean and minimalistic. She lies down gently on her bed, relaxed and peaceful, staring at the ceiling with a calm smile. Soft golden lighting, slow camera pull-back, emotional closure, serene atmosphere.
A highly cinematic 12-second video of a lone man walking through a scorching desert under intense sun. His clothes are torn and worn out from the harsh journey, and he carries a wooden stick for support. He looks exhausted from thirst and hunger, but continues walking with determination and silent resilience. The desert is vast, empty, and unforgiving, with heat waves rising from the sand.
As he slowly climbs a sand dune, the scene dramatically transforms on the other side: he discovers a lush green oasis filled with fresh flowing water, fruit-bearing trees, and vibrant greenery. The contrast is breathtaking — from dry desert to paradise.
His face instantly changes from exhaustion to overwhelming joy and relief. He looks up at the sky in gratitude, raising his hands in thankfulness, emotionally overwhelmed. Then he runs joyfully towards the water and greenery, full of hope and happiness.
Cinematic lighting, ultra-realistic style, emotional storytelling, dramatic contrast, smooth camera movement, high detail, 4K quality, film-like color grading.
You are in a real-life war zone captured on a handheld combat camera. Yapper is on the battlefield, engaging in intense gunfight with trained soldiers. Continuous gunfire echoes loudly as bullets hit the ground, walls, and vehicles. Fighter jets fly low overhead at high speed, dropping powerful bombs that create massive shockwaves and dust clouds. Heavy military tanks move across rough terrain, firing shells and causing large-scale destruction.
Everything looks raw and realistic—natural lighting, real human movement, practical explosions, dust, smoke, debris, and camera shake as if filmed by a war journalist. No CGI or cartoon style—pure live-action realism. Sweat, dirt, and tension visible on faces. Sound design includes gunshots, distant explosions, jet engines, and battlefield chaos.
Yapper moves tactically, taking cover, reloading, and surviving in the middle of the chaos.
Add Yapper watermark/logo in the corner (subtle but visible).
Style: Live-action, ultra-realistic, cinematic war footage, handheld camera, motion blur, natural colors, documentary.
GPT IMAGE 2 + seedance 2.0
The dance animation was created using a movement sheet as the reference.
Prompt:
A photorealistic video sequence captures a young boy with messy orange hair and thick-framed glasses, as seen in image_0.png, image_1.png, and other source frames. He is dressed in a black basketball jersey and matching shorts with purple and blue trim, featuring the text "WIZZGEN 23" on the front and "CHICAGO 23" on the back (image_4.png). The setting is an outdoor asphalt city basketball court with green trees and a visible basketball hoop. The action begins with the boy in a low stance, dribbling the ball between his legs (image_0.png through image_3.png), then transitions to him standing taller and performing crossovers (image_5.png through image_7.png), followed by him successfully spinning the ball on his finger (image_8.png), and finally posing with a peace sign while holding the ball (image_9.png). The lighting is soft daylight under an overcast sky.
Generate a high-quality cinematic 15-second vertical video (9:16) of a teenage female street basketball player performing a smooth freestyle routine on an outdoor court. She has a slim athletic build, light tan skin, soft freckles, and wavy dark brown hair tied in a loose ponytail. She wears a cropped oversized jersey with "NEXORA" clearly printed on the front, loose high-waist shorts, crew socks, and stylish high-top sneakers with pastel accents (peach & mint).
Her vibe is confident, effortless, slightly playful — calm but skilled street energy.
0–2s:
She stands relaxed, spinning the basketball lightly in her hand.
Drops into a low stance and starts a controlled dribble, eyes focused.
2–4s:
Smooth in-and-out dribble into crossover, shifting her weight naturally.
Hair and jersey move subtly with motion.
4–6s:
Clean between-the-legs combo → behind-the-back transition.
Footwork tight, rhythm controlled.
6–8s:
She performs a hesitation + quick burst step, as if beating an invisible defender.
Confident expression.
8–10s:
A fluid spin move into step-back dribble, sneakers pivot realistically on asphalt.
Logo "NEXORA" stays visible.
10–12s:
Fast low dribble sequence side-to-side, keeping the ball tight and stylish.
Energy builds slightly.
12–13.5s:
She casually spins the ball on one finger, straightens up, slight smirk.
13.5–15s:
Final pose:
She catches the ball, rests it on her hip, gives a relaxed confident look.
Text fades in:
"Play Smart. Move Different."
🎨 STYLE:
realistic basketball freestyle, smooth street flow, confident female athlete energy, modern sports commercial vibe
🎥 CAMERA:
full-body framing, stable cinematic shot, slight push-in, smooth continuous motion, no cuts, fluid transitions
🌇 ENVIRONMENT:
outdoor street court, asphalt texture, faded court lines, chain-link fence, visible hoop, warm sunset lighting, soft shadows
🧠 QUALITY:
ultra-detailed, realistic ball physics, natural motion, clean composition, readable "NEXORA" text, 4K resolution
Ultra-realistic arctic wasteland at night, blizzard winds, frozen mountains barely visible through whiteout snow. Scientific expedition placing thermal charges across an ancient glacier. Ice begins cracking in glowing lines beneath their feet. Camera pulls backward fast through snow as an enormous humanoid machine rises from beneath the ice, launching frozen slabs into the air. Helicopters struggle in violent wind overhead. One colossal blue eye ignites through the storm. Final frame: titan fully standing, shadow swallowing the camp.
Image1 is the main character maintain consistent facial features and body type throughout. The main character appears only once in every frame no duplicates, no red-haired people in the crowd. Cinematic time-freeze short film, 15 seconds, ultra-realistic, Arri Alexa Mini shooting texture, 50mm lens, natural daylight hard shadows, shallow depth of field.
[0:00-0:03] Busy cobblestone street in an Italian old town, normal time flow. Steadicam front-facing medium shot tracking: the main character wearing a loose linen shirt tucked into high-waisted jeans and white sneakers walks confidently through the crowd. Pedestrians walk, check phones, chat; a flock of pigeons flies across the bright sunny sky in the distance. As she walks, she raises her right hand and snaps her fingers.
(0:03-0:06] The instant of the snap a powerful white spherical shockwave bursts from her fingertips, carrying visible air distortion and light refraction, spreading rapidly in all directions...
A battle-hardened space marine in armored exosuit charges across the red dunes of a hostile alien planet under twin suns, sandstorms whipping up around jagged rock formations. The landscape shifts as buried ruins erupt from the ground and biomechanical creatures burrow through the earth. At the 1-second mark, he jet-boosts from a crumbling ledge toward a crashed escape pod. Camera orbits him dynamically as distant explosions light the horizon. He latches onto the pod's thruster, pries open the hatch, and activates shields just as a horde of insectoid aliens swarms the dune behind him. Desert planet skirmish, jet-assisted leap, armored suit sprint, epic sci-fi lighting, 4K.
Cinematic 15-second desert safari experience in the Dubai desert at sunset, composed of 15 rapid 1-second shots, each cut cleanly with smooth visual continuity, ultra-realistic golden sand dunes stretching across the horizon, warm sunset lighting with rich orange and amber tones, soft wind shaping fine sand textures, high-end travel and adventure cinematography style, consistent across all shots.
Shot List Sequence:
1. Aerial establishing shot of vast golden dunes under a glowing sunset sky
2. Smooth drone glide over rolling dunes creating depth and motion
3. Wide shot of a 4x4 vehicle driving across the sand leaving trails
4. Dynamic close-up of dune bashing with sand spraying into the air
5. Low-angle shot of wheels cutting through soft sand
6. Side tracking shot of the vehicle drifting along a dune ridge
7. Slow-motion shot of sand particles blowing in the wind
8. Silhouette of a camel caravan moving across the horizon
9. Close-up of a person riding a camel at sunset
10. Wide shot of a desert camp with traditional tents
11. Action shot of sandboarding down a steep dune
12. Medium shot of people relaxing at the camp
13. Close-up of traditional lanterns glowing in warm light
14. Transition shot as the sky deepens into orange twilight
15. Final hero pull-back aerial showing endless dunes fading into the horizon
Visual and Motion Style:
Fast cinematic cuts, smooth micro camera movements per shot including push, pan, slide, tilt, and orbit, physically accurate sunset lighting with warm tones, ultra-realistic sand textures with wind patterns, dynamic motion for vehicles and sand, soft shadows, no flicker, stable geometry, real-world motion blur, shallow depth of field where appropriate, HDR, ultra high definition, film-quality travel and adventure cinematography.
A photorealistic 16:9 in-game screenshot of a fictional next-gen open-world RPG titled "BULK: A Members-Only Adventure". Third-person over-the-shoulder camera following the player character — a tired suburban mom in yoga pants pushing an oversized flatbed cart down the aisle of a Costco warehouse store. Scene captures hyper-realistic warehouse lighting, towering pallet stacks, a free sample station ahead with an NPC in a hairnet glowing with a yellow exclamation mark above her head. Game HUD overlay: top-left mini-map showing aisle layout with quest markers; top-right stamina bar labeled "PATIENCE" three-quarters full; bottom-left compass with objective text "PRIMARY: Locate Kirkland Almond Butter (Aisle 11)" and below "SECONDARY: Sample 3/5 cocktail meatballs"; bottom-right item quick-slots showing membership card, car keys, snack bar; center crosshair with subtle interaction prompt "[E] Take Sample". Cinematic depth of field, slight chromatic aberration, photo-mode quality. All UI text crisp and legible. Realistic Costco signage in background spelled correctly. No watermark, no real Costco logo (use generic warehouse-club aesthetic).
A seamless, extreme FPV hyper-zoom starting from a wide view of Earth in space, rapidly plunging through the atmosphere and clouds. The camera dives into an aerial hyper-lapse of St. Petersburg, sweeping past the golden dome of St. Isaac's Cathedral. It descends smoothly to skim just above the water of a canal, accelerating towards the Palace Bridge. The camera flies directly through the raised, open spans of the drawbridge.
As it exits the bridge, the camera smoothly pans right and decelerates, seamlessly transitioning into a medium portrait shot of a young man sitting on the granite river embankment. The man has short textured hair with subtle highlights, light stubble, and sharp facial features. He is wearing a relaxed white button-down shirt, dark blue denim jeans, clean white sneakers, and a minimal silver chain bracelet.
He initially looks away toward the horizon, then slowly turns his gaze toward the camera with a calm, confident expression. The background features the open drawbridge against a soft, pastel twilight sky, with reflections shimmering on the water. Cinematic, hyper-realistic, continuous single-take, 8K resolution, photorealistic, smooth motion, natural lighting, ultra-detailed textures.
fpv
cinematic
city
st-petersburg
continuous-shot
realistic
✅Key Visual Prompt
Genre: XX
Brand Name: XX
Using this image as a base, generate a photorealistic poster image that exists nowhere in the world, with ideas that completely deviate from common sense.
Not just an extension, but leap the imagination to a level where "the meaning gets through, but the interpretation is utterly mad."
【Absolute Requirements】
・Photorealistic expression (reproducing the texture like live-action, sense of air, even light particles)
・All language in Japanese
【Elements to Exaggerate/Boost】
■Font Design
・The letters themselves materialize or become phenomena
・Fonts physically interact with the theme
■Text Placement
・Ignore normal layouts and place text into the space itself
・Placement with abnormally strong gaze guidance
■Composition
・Unrealistic perspective, extreme wide-angle, distortion, scale destruction
・Clear layers of foreground, midground, and background, with abnormally high information density
・A central element that grabs the eye in an instant + countless subtle dissonances in the details
■Lighting
・Movie-level cinematic lighting
・Intense backlighting, rim light, neon, particle light, volumetric light
・Light itself carries meaning (functions as leading lines or emphasis)
■Catchphrase
・Short, intense Japanese that makes sense yet comprehension can't keep up
・A copy that feels oddly fitting despite being out of sync with the situation
【Additional Staging】
・A sense of discomfort where reality and unreality coexist simultaneously
・Parts that ignore the laws of physics
・Quality that works as an advertisement (professional level)
Ultimately, make it a poster that is "understandable yet comprehension lags behind" and "demands a double-take."
If the reference image is a character, apply actions like wearing/using the brand, about to eat it, etc.
Do not use any real brand logos, company names, personal names, etc. at all—make it completely original. At final finishing, confirm nothing real is included.
✅Storyboard Prompt
Using this image as a base, create storyboard images for a 15-second video. This image serves as the end frame in a CM style. Use diverse camera angles like frontal, side, diagonal overhead, etc., and avoid duplicating the same framing. 9:16
A highly realistic cinematic scene of a calm indoor environment with soft neutral tones and natural lighting. The camera remains steady with a slow, subtle push-in movement. The atmosphere is शांत and minimalistic, with gentle shadows and balanced composition. Slight ambient motion is visible — soft light flickering, faint environmental movement, and natural depth of field. The color grading is warm and slightly desaturated, giving a modern cinematic look. Ultra-detailed textures, realistic lighting, 4K quality, shallow depth of field, smooth motion, film-style grain.
A hyper-realistic cinematic product photography shot of a sleek black smartwatch placed on a wet reflective surface, covered with water droplets, during heavy rain. The environment shows a blurred cityscape in the background through a rain-covered glass window, with soft bokeh lights and moody overcast lighting.
The watch is positioned slightly angled, with detailed reflections visible on the wet surface below. Water droplets are visible on both the watch body and strap, enhancing realism. The screen is on, showing a modern minimal watch face with bold numbers and subtle UI elements.
Lighting is dramatic and soft, with cool tones, natural reflections, and high contrast. Depth of field is shallow, focusing sharply on the watch while the background remains blurred. Rain droplets on the glass add texture and atmosphere.
Ultra-detailed, 8K, professional product photography, studio lighting mixed with natural rainy ambiance, sharp focus, realistic reflections, cinematic composition.
Camera Settings: 85mm lens, f/1.8 aperture, ISO 100, shallow depth of field.
Style Keywords: photorealistic, luxury product shot, cinematic lighting, moody, high contrast, water splash, reflections, premium advertisement style.
**Environment:**
A frozen tundra under aurora-lit night skies. Pale green northern lights reflecting across a wide snowfield with icy winds sweeping snow particles across the ground.
**Action:**
15.0s sequence. A giant silver arctic wolf charges across the snow while a rival black wolf emerges from the drifting snowstorm. The two wolves collide in a powerful clash, sliding across the icy surface.
Velocity Ramp choreography: the moment their bodies collide freezes briefly as snow explodes around them before snapping back to full speed as they tumble across the frozen ground.
**Camera:**
Low tracking shot racing through blowing snow alongside the wolves, occasionally capturing the action reflected across icy surfaces.
**Style & Constraints:**
Photorealistic fur simulation, volumetric snow particles, aurora sky lighting, cinematic cold atmosphere, 35mm film grain, 8K.
A premium fast-food commercial product photograph of a gourmet cheeseburger centered against a warm golden-yellow seamless studio background. The burger features a glossy sesame seed bun, fresh lettuce, tomato slices, onion rings, melted cheese, a juicy grilled beef patty, and rich sauce. Soft studio lighting, subtle shadows, mouthwatering texture, sharp focus, ultra realistic food advertisement photography, clean composition, 8K.
Sneaker hands‑on hook – an energetic man holds neon‑green sneakers close to the camera in a skatepark, rotates them, slides his foot in and stomps; shot at golden hour with a handheld iPhone.
Kitchen discovery reaction – woman in sunlit kitchen opens a jar of chilli crisp, sniffs and tastes it; her eyes widen and she laughs while describing the flavor; handheld and natural.
Mirror try‑on – young woman stands before a mirror trying on a cream linen shirt and jeans; she turns to show angles and tells viewers the size and brand; soft window light and no music.
Coffee shop recommendation – a man at an outdoor café sips a flat white, speaks to his phone about life hacks and points to his notebook; blurred greenery and warm morning light make it cozy.
Street interview – multi‑shot prompt where different people on a busy sidewalk shout quick testimonials about a platform; quick cuts, handheld iPhone footage and bright daylight create energy.
ugc
social-media
multi-shot
realistic
advertisement
A cinematic 15 second time lapse video of a house being built from an empty plot. The scene begins with a clear, vacant land under daylight. Construction starts quickly: workers arrive, laying the foundation with concrete and steel. The structure rises rapidly walls form, bricks are placed, and scaffolding appears. The roof is installed, followed by windows and doors. Exterior finishing and painting happen smoothly. The surrounding area becomes neat and landscaped. The final scene reveals a fully completed modern house standing beautifully on the plot. Smooth time-lapse transitions, dynamic camera movement, realistic construction details, bright natural lighting, high detail, 4K quality.
Ultra-realistic Indian classroom street fight. Single continuous shot, no cuts. Raw handheld mobile footage with natural micro-jitters, slight rolling shutter, no stabilization. Documentary realism.
Audio: No music. Only raw ambient sound-footsteps, desk friction, cloth movement, punches, fan hum, breathing, distant classroom noise. Lighting: Mixed warm + cool cinematic tones. Practical classroom lighting. Moving ceiling fan shadows. Visible dust particles in light shafts.
MAIN CHARACTER (STRICT IDENTITY LOCK): Indian male (17-18), cold emotionless face matching reference image exactly. Black straight side-swept hair, brown almond eyes, sharp jawline, natural Indian skin tone, slim face.
OUTFIT LOCK (NO CHANGE): Black school blazer, white shirt, black tie, black pants. No hoodie/tayers. Slightly fitted blazer, loose tie. Must remain identical throughout
OPPONENT RULES: All opponents have unique faces. No resemblance to protagonist.
ACTION: 0-2s: Protagonist grabs Opponent A from behind one-hand head slam onto desk. Books/pencils burst outward. Desk drags with friction. A collapses limp.
2-5s: Grabs Opponent B punch to abdomen B folds immediate head slam into desk. Head pressed briefly to tiles. Camera stabilizes briefly. B falls face-down. Heavy breathing, visible sweat.
5-6s: Blazer shifts slightly, shirt + tie visible (tie displaced). C & D enter from back, split and flank. Only footsteps, fan hum.
6-8s: Chaotic fight. D attacks with rapid punches. Protagonist stumbles slightly. C grabs blazer, pulls back. Knee to abdomen. Body bends forward. Heavy breathing. Fan shadows moving.
8-10s: Headbutt to D D staggers. Protagonist grabs C collar efficiently.
10-12s (BOARD IMPACT): Protagonist drives C backward into green chalkboard. Back hits first, head snaps slightly. Chalk dust bursts outward, fully visible in warm light. Chalk tray rattles, pieces fall. Micro slow-motion (0.2-0.3s) at impact back to real-time. C loses tension, slides down leaving chalk smears, collapses. Dust drifts.
12-13s: D charges protagonist sidesteps rotational punch to cheekbone. Sweat arcs mid-air. D crashes into desks. Massive collapse, books scatter. D motionless, hand drops. Protagonist barely standing, heavy breathing.
13-14s: Medium-close. Chest rising. Slow turn to camera. Cold eye contact through glasses. Wipes blood from nose (visible smear). Removes glasses throws aside, spinning impact. 14-15s (FINAL): Ground-level wide shot. Slight edge blur, warm tone, golden dust. Protagonist without glasses removes blazer → throws onto opponent. White shirt + black tie only. Tie loose, slight movement. Walks to exit. Blazer settles. Silent classroom. Dust floating. Fade to black.
Vertical 9:16 | Handheld | Natural Lighting | Soft Luxury Aesthetic
0 to 2s — Product Reveal (Close-Up)
Extreme close-up of a sleek serum bottle held loosely in a woman's hand. The "Kling 3.0" label faces the camera, fully legible. Warm morning sunlight softly catches the glass, creating a gentle gleam. Camera is slightly shaky, casual handheld feel. Background is softly blurred — a bright bathroom or bedroom vanity.
2 to 4s — Dropper Application
She tilts the dropper and squeezes 2 to 3 drops onto her fingertips. The serum catches the light as it drops — slightly golden, slightly translucent. Her fingers come together, spreading the product slowly. A faint relaxed smile begins to form. Mirror is partially visible in the soft background.
4 to 6s — Face Application
She brings her fingers to her cheekbones and begins pressing the serum gently into her skin using light upward motions. Her skin looks clean, bare, naturally glowing. The texture absorbs visibly. She looks calm, unhurried, at peace. Dewy finish begins to appear on her cheeks and nose bridge.
6 to 8s — Mirror Moment
Camera pulls back slightly to reveal her standing in front of a mirror. She looks at her reflection — not posing, just observing. Soft approval on her face. The mirror doubles the warmth of the scene. Natural light pours in from the side.
8 to 10s — Skin Close-Up (Face)
Tight close-up of her cheek and jawline. Skin looks plump, luminous, and hydrated. No filter-like perfection — real texture, real glow. Light bounces naturally off the high points of her face. Camera holds still for just a moment, almost like admiring the result.
10 to 12s — Bottle Detail Revisit
She picks up the bottle again, this time more gently — almost fondly. Fingers wrap around it. The Kling 3.0 label is visible again, facing camera at a slight angle. She tilts it slightly as if out of habit. Warm bokeh background. No words, no voiceover — the visual does the talking.
11 to 13s — Confident Mirror Look
She looks back into the mirror and gives a slow, subtle approving nod. Not dramatic — quiet confidence. Her eyes soften. A small natural smile. She gently tucks her hair behind her ear. Feels like a private morning ritual, not a performance.
13 to 15s — Fade with Product in Frame
Camera slowly drifts back. She sets the bottle on the vanity counter, label still visible. Morning light fills the frame. Scene feels unhurried and complete. Gentle handheld micro-movement keeps it authentic. Slow natural fade to soft brightness.
A young woman sits alone on a park bench at sunset, looking upset while scrolling her phone. A friend joins her and offers a Cadbury Dairy Milk with a light joke, turning the moment playful. She unwraps and breaks the chocolate, sharing a relaxed, warm exchange. After tasting it, her mood lifts as the scene brightens and they laugh together. The final shot focuses on the chocolate bar with a soft cinematic glow, highlighting a simple message of comfort and shared sweetness.
STYLE: photorealistic, high-end cinema, ultra-detailed, 35mm film look, shallow depth of field, dramatic lighting, motion blur, dynamic camera, seamless transitions, surreal continuity, high contrast, rich reflections
0:00 – 0:02 INT. CASINO – ROULETTE TABLE – NIGHT Low-angle cinematic shot. A sharply dressed man in a tailored suit sits at a roulette table, surrounded by scattered chips and beautiful women. Warm golden casino lighting flickers across his face. He bursts into uncontrollable laughter, head thrown back, eyes wild. The camera pushes in. He falls backwards, laughter echoing unnaturally.
0:02 – 0:04 VOID – GOLD BARS He plummets headfirst at extreme speed through a dark void filled with massive rotating gold bars.
0:04 – 0:06 INT. LUXURY JEWELRY STORE – NIGHT He continues falling headfirst at high speed down a pristine jewelry store aisle. Glass cases on both sides explode outward as he passes, diamonds and watches suspended in slow motion.
0:06 – 0:08 EXT. POOL PARTY – NIGHT He falls at high speed head first through a glamorous pool party scene. Beautiful women in luxurious dresses dance and laugh on both sides, champagne splashing
0:08 – 0:10 VOID – CASINO OBJECTS He falls head first at high speed into another void. Floating casino jetons, poker cards, and dice swirl around him in zero gravity. Cards slice past the lens, chips collide in slow motion.
0:10 – 0:12 He falls head first at high speed through a void of men in suites fighting over money, boxing each other in agry rage
0:12 – 0:14 He continues falling at high speed head first past a perfectly aligned row of identical versions of himself in suits. They stand in formation, some throw dollar bills, others cry, some are angry, some raise their fists,
0:14 – 0:15 EXT. DIRTY STREET – NIGHT Abrupt impact. He lands onto wet asphalt beside overflowing dumpsters. The lighting shifts to harsh, flickering streetlight. Silence. He now wears torn, filthy clothes—transformed into a beggar. A whiskey bottle loosely hangs from his hand. His laughter is gone.
ENVIRONMENT: Home → bedroom → kitchen → school gate
MOOD: Calm → rush → chaos → composed finish
⚡ SHOTS
Soft morning wake-up
Gentle call to child
Child resisting, sleepy chaos
Blanket pull + playful struggle
Checking time → sudden urgency
Quick bathroom routine rush
Brushing hair while walking
Uniform fixing on the go
Breakfast multitasking
Packing school bag fast
Searching missing item panic
Shoes + socks scramble
Out the door rush
Walking fast / slight run
Child distraction moments
Final adjustments before entry
Quick hug + goodbye
Watch child enter school
Relieved, composed pause
Cinematic, high-quality video of a beautiful young woman with auburn hair in a messy romantic updo, wearing an elegant, flowy, off-the-shoulder white dress. She is sitting gracefully on the cobblestone ground of a sunlit, grand European piazza, gently strumming a white acoustic guitar and singing a song. A fluffy white Ragdoll cat with a bushy tail is walking around her and affectionately nuzzling her leg. The background features classical baroque architecture, a large ornate fountain, and softly blurred pedestrians strolling by in the warm afternoon sunlight. Golden hour lighting, photorealistic, serene, and peaceful atmosphere with a shallow depth of field
**Environment:**
A vast stormy ocean at night. Thunderclouds swirl overhead while lightning flashes across towering waves. A naval fleet spreads across the water below.
**Action:**
15.0s sequence from the POV of a colossal sea monster rising beneath the waves. The viewer moves through dark ocean water with bioluminescent currents swirling past.
At the 2-second mark the monster breaches the surface.
Warships appear above as massive tentacles crash across the decks.
Ships fire weapons while waves explode around the creature.
Velocity Ramp choreography: a tentacle strike hitting a destroyer slows dramatically — water droplets and sparks suspended in the air — before snapping back as the ship is hurled sideways.
**Camera:**
Fluid aquatic POV transitioning from deep ocean darkness to chaotic surface battle.
**Style & Constraints:**
Photorealistic water simulation, volumetric ocean spray, cinematic lightning illumination, realistic ship destruction physics, 8K.
A man struggling to walk forward against extreme wind, holding onto a pole for stability. Objects fly through the air as buildings begin to break apart. Coastal city under hurricane, heavy rain, flooding streets, debris everywhere. Handheld shot pushing into the wind with him, rain hitting lens, strong motion blur from flying debris, relentless environmental pressure.
A green sea turtle lifts its head above crystal-clear water, exhaling a misty plume that catches sunrise light; ripples race outward.
200 mm telephoto realism, 1/4000 s freeze, warm golden backlight.
audio: soft exhale + gentle wave slap
negative: no divers, no boats
playing the Spanish guitar on top of a moving flying drone inside a weathered Spanish apartment, cold light coming through the windows. Sound of the drone's propellers
Static locked off UGC frame on a girl at a table making matcha, with the exact same camera position and framing throughout, perfectly steady, with no shake, no drift, and no micro-jitter, and a clean, crisp image. The clip opens exactly on the start frame, with her holding the metal sifter over the bowl as the last of the matcha falls through, fine powder drifting down naturally in tiny bursts. She speaks in a natural female American accent, around 27 to 28 years old, calm and confident, with a relaxed conversational rhythm, slightly deeper than average, smooth and mature but still soft and feminine. She starts speaking immediately at the beginning, with her lips clearly moving on-camera through every word: “Okay, um…” As she says “Okay,” she instantly lowers her gaze down toward the white bowl and shifts her focus to what she is already doing, while her mouth continues into “um” without interruption. When she says “um,” it is barely audible, almost to herself, quiet, low, and absent minded, like a whisper. That small pause on “um” feels like a thought catching up to her hand, and her lips barely move. Her gaze drifts slightly to the left for a second, her eyes briefly flicking toward the camera and then back to the bowl, as her hand gives one last gentle tap to finish the sifting, her mouth still moving through the line without missing a beat: “I wanna show you.” The final powder stops, the mesh is visibly clean, and she lowers the sifter a little closer to the bowl as if checking that she got it all, finishing the last words with a quiet, confident ease: “how simple AI UGC is.” Her expression stays natural and unperformed, like she is just talking while doing the routine. Keep the identity, skin texture, and environment perfectly stable, with no warping, no morphing, no smoothing, no smearing or blending, no pixel mixing, and minimal motion blur. Preserve realistic powder behavior, metal reflections, and shadows. End exactly on the provided end frame.
Video 2
Prompt:
Static locked off UGC shot on the same table matcha setup, with the camera perfectly steady and the same framing throughout, with no handheld shake, no drift, and no micro-jitter. Clean, crisp image. The clip opens exactly on the start frame, with the empty sifter held near the bowl. In one slow, natural continuation, she sets the sifter down out of the main action area and reaches for a glass electric kettle, then begins pouring hot water into the bowl in a physically believable stream, with realistic weight in her grip, an accurate pouring angle, natural water flow, and subtle steam cues, while the environment, background, and object positions remain consistent with the start frame. The shot settles exactly into the end frame, with the water clearly pouring into the bowl. She speaks in a natural female American accent, around 27 to 28 years old, calm and confident, with a relaxed conversational rhythm, slightly deeper than average, smooth and mature but still soft and feminine. The clip begins with no introduction at all, she is already mid sentence, and her lips clearly move on-camera through every word. While reaching for the kettle and starting the pour, she says naturally: “But honestly”. When she says the word “honestly,” it ends with a slight upward tone, and then, as she picks up the glass electric kettle and just before she starts pouring the hot water into the bowl, she finishes the last words: “…it’s really not as hard as it looks”. She feels completely relaxed and unbothered. No identity drift. No skin warping or morphing. No texture invention, no smoothing, no smearing or blending, no pixel mixing, and minimal motion blur. Keep skin pores, hair, fabric, reflections on the kettle and bowl, and matcha surface behavior stable and realistic. End exactly on the provided end frame.
[BROADCAST SETUP] Live TV sports broadcast signal, 1080i HD resolution, 50fps. Shot on professional ENG stadium cameras. Standard TV color balance with natural daylight. No color grading, no cinematic filters, no artificial post-processing. Real-world stadium acoustics and ambient light. Audio Style: Immersive spatial sound design. Loud stadium atmosphere, crowd roar, metallic clink of the hammer chain, stadium announcer muffled in the background, wind noise on the microphone.
[TIMELINE SECOND BY SECOND]
0-4s: [Medium-wide broadcast shot] Swedish male athlete with long blonde hair and beard (viking-like) spinning rapidly in the hammer throw circle. Authentic Olympic uniform. Real-life physics and momentum.
4-7s: [Broadcast tracking shot] The athlete releases the giant hammer. The camera pans fast to follow the arc of the heavy metal ball flying through the air against the stadium sky. Realistic high-speed physics.
7-12s: [Wide stadium view] The hammer heads towards the lower stands at high speed. A grandmotherly woman in the front row reaches out and catches the giant hammer firmly with both hands. No slow motion, real-time broadcast speed.
12-15s: [Reaction shot] The surrounding crowd jumps up and applauds. The woman holds the hammer, looking at the camera. Authentic live TV cut to the crowd's genuine reaction.
[QUALITY BOOSTERS] Photorealistic live footage, 1:1 sports broadcast physics, authentic Olympic stadium environment, sharp details on clothing and skin, natural motion blur from high-speed movement, stable facial features throughout the clip.
@Image1 is the main character — maintain consistent facial features and body type throughout. Appears only once per frame, no duplicates. Cinematic time-freeze short film, 15 seconds, ultra-realistic, Arri Alexa Mini shooting texture, 50mm lens, natural daylight hard shadows, shallow depth of field.
[0:00–0:03] Busy Alpine village street — wooden chalet shopfronts, cobblestone ground, dramatic mountain peaks visible at the end of the lane, clear blue sky. Steadicam front-facing medium shot tracking: the main character — a young man with dark hair, black-framed glasses, light stubble, silver chain necklace, wearing a loose printed short-sleeve shirt — walks confidently through the tourist crowd. Hikers with backpacks, shop signs, a postcard rack outside a souvenir store. A pigeon cuts across the sky above him. He raises both hands, interlaces his fingers, and snaps.
[0:03–0:06] A powerful white spherical shockwave bursts outward from his hands, carrying visible air distortion and light refraction, spreading in all directions. Pedestrians freeze mid-stride. The pigeon locks mid-flight overhead. Most strikingly — a street juggler beside the postcard shop freezes with three apples suspended mid-arc in the air above him, hanging perfectly still against the blue sky. Postcards fly off the rack and hang frozen in the air. Absolute silence falls over the village.
[0:06–0:09] Only his footsteps echo off the cobblestones. He strolls casually along the frozen street, glancing around with calm satisfaction. He passes the frozen pigeon overhead, pays it no mind. He slows as he approaches the postcard rack — reaches up and plucks one of the floating, suspended postcards out of mid-air, turns it over in his hand, looks at it briefly with a raised eyebrow. Sets it back. Keeps walking.
[0:09–0:11] He stops directly in front of the frozen juggler — three apples hanging perfectly mid-arc above the man's outstretched hands. He tilts his head, studies the composition. A slow smile. He looks at the camera, gives a small nod, and whispers "perfect."
[0:11–0:15] He turns back to face the street, interlaces his hands and snaps again — a second shockwave, stronger, bursts outward in reverse. Everything unfreezes instantly: the juggler catches his apples and continues seamlessly, postcards flutter back to the rack, the pigeon flaps away, hikers resume mid-conversation. City noise and mountain ambience rush back in. He calmly turns and walks toward the camera. Camera slowly rises and pulls back — his silhouette moves down the Alpine lane between the chalets. Fade to black.
Sound design: Alpine village ambience → double snap → shockwave rumble radiating outward → absolute silence → lone footsteps on cobblestone → whispered "perfect" → second snap → reverse shockwave burst → village sounds and mountain wind naturally restored.
Cinematic 4K 60fps realistic live-action spec advertisement, high-production film look, shot on film with subtle grain, warm natural indoor lighting.
PART 1 — Establishing Atmosphere (0:00-0:08)
Wide cinematic establishing shot of a cozy college library with tall beige bookshelves, warm sunlight streaming through large windows.
PART 2 — Character Introduction (0:08-0:18)
Medium shot of a young Western female student with shoulder-length light brown hair, wearing a red-and-white varsity jacket.
PART 3 — Detail & Research (0:18-0:28)
Close-up of hands flipping through books and design research papers.
PART 4 — Spark of Inspiration (0:28-0:40)
Medium close-up of the student pausing, then suddenly looking up with realization.
PART 5 — Creative Flow (0:40-0:52)
Montage sequence of fast writing, flipping pages, sketching concepts.
PART 6 — Resolution & Brand Feel (0:52-1:00)
Hero medium shot of the student confidently reviewing her work.
15-second, no-dialogue, fully immersive high-speed brutal fight short film.
Two characters @ image1 and @ image2 .
Live-action realistic style. Close-quarters, high-speed chaotic melee combat with no rules, fully out-of-control brawling. No flashy choreography—pure raw. Continuous exchange of punches, kicks, blocks, body checks, throws, and grappling entanglements. Ultra-realistic impact. Exaggerated yet physically grounded motion speed. Entire sequence in 120 FPS with no slow motion. Rapid dashes, evasions, and high-speed limb swings. Extreme motion blur and speed trails amplify intensity while preserving realistic weight and physics.
Master-level cinematic camera movement:
A professional film camera stays tightly locked to the action, flying dynamically with the fighters—high-speed dives, sudden stops, whip pans, and full 360° orbital tracking. The camera moves in sync with punches, kicks, and evasions, with responsive shake and displacement. One continuous take, no cuts. Sharp push-pull tracking, sudden directional shifts, fully integrated into the fight for a face-to-face immersive combat perspective. Smooth, zero-latency motion.
Master-level precision editing:
0–3 seconds: ultra-fast cuts to enter combat.
3–10 seconds: seamless action continuity with zero stutter or breaks.
10–15 seconds: dense, violent rapid-cut climax.
Editing rhythm escalates with combat intensity. Fast intercutting enhances chaos. Precise beat timing, no redundant shots. Constant suffocating high-speed pacing.
Environment Setting
Modern open-plan office with desks, computers, cabinets, printers, and swivel chairs; realistic layout with cool overhead lighting and side backlight for strong contrast. Slightly cramped space for added tension; no glass-breaking.
Environmental Interaction & Constraints:
Combat affects the space only through natural, physically justified collisions—papers scatter from air movement (not attacks), chairs slide or topple on impact, desks shift slightly without deliberate damage, and screens remain intact with minor vibration. No unnecessary or intentional destruction of equipment.
Body-driven combat: grappling, pinning, counter-throws, and realistic slams with subtle impact cracks/vibrations. Natural airflow—light paper/dust movement only, no exaggeration.
Final Action Design
Final 2–3s: both throw simultaneous close-range punches, freezing inches before impact—fully tensed, heavy breathing, locked in perfect deadlock.
Ultra-realistic cinematic magic realism, 4K, golden hour lighting, vibrant chic city street, shallow depth of field, smooth tracking motion, upbeat French house / nu-disco vibe.
A confident young woman walks through a sunlit cobblestone avenue in a flowing white summer dress and flats. The city feels lively with boutiques, reflections, and warm lens flares.
⏱️ TIMELINE (15s FLOW)
0:00–0:03
She walks casually down the street. Camera tracks smoothly. She pauses at a luxury boutique window showing a cherry-red dress and gold heels reflection.
SFX: light footsteps, soft city ambience
0:03–0:06
Close-up — she smirks and lifts her hand. Magical energy builds subtly. The red dress and gold heels visually transform into glowing ribbons of light moving toward her.
SFX: rising magical hum, soft shimmer
0:06–0:08
Quick transformation — the light wraps around her and her white outfit seamlessly turns into a fitted cherry-red dress with gold stiletto heels.
SFX: fabric snap, magical chime, heel click
0:08–0:11
Low-angle tracking shot — she walks confidently down the street, dress flowing, heels clicking, golden sunlight reflecting off fabric.
SFX: rhythmic footsteps, music drop
0:11–0:13
She passes a vintage ice cream cart. A cone is offered — she playfully flicks her hand mid-stride, and the cone glides into her hand smoothly.
SFX: whoosh, small bell ding
0:13–0:15
She catches it perfectly, looks into camera with a cheeky confident smile, and takes a bite while walking forward.
SFX: upbeat music peak, light crowd ambience
FORMAT: 15s / slow cinematic pacing / 1 continuous shot with subtle camera movement
SUBJECT: A young woman matching the exact appearance of the reference image (same face, hairstyle, and features). She is wearing a comfortable, modest outfit — a simple half-sleeve dress that is relaxed-fit, non-revealing, and suitable for a calm evening outdoors.
SCENE: She is sitting on a quiet park bench at night, surrounded by tall trees. The environment is peaceful and still, with soft ambient sounds implied. Warm streetlights glow faintly in the background while a clear sky full of stars stretches above.
ACTION: The shot begins from behind her shoulder, slowly pushing in as she tilts her head upward. She gently looks at the stars, her expression calm and thoughtful. A light breeze moves her hair slightly. She takes a soft breath, eyes reflecting the night sky, fully absorbed in the moment.
CAMERA: Smooth dolly-in from a medium-wide over-the-shoulder shot to a soft close-up profile. Shallow depth of field, subtle bokeh from distant lights, no abrupt cuts.
LIGHTING: Natural moonlight mixed with dim, warm park lights. Soft highlights on her face, gentle shadows, realistic night exposure.
STYLE: Ultra-realistic, cinematic photography, soft film grain, muted tones, serene and introspective mood, 4K detail.
SUBJECT: A tired woman in @ image1 in a loose tank top and sleep shorts, slow habitual movement. Slightly smeared eyeliner, bare feet, heavy posture, detached face.
ENVIRONMENT: A cramped cluttered apartment with an unmade mattress, scattered clothes, a narrow hallway, a damp bathroom with dim tile reflections, and a tiny kitchen crowded with dirty dishes and empty bottles. Warm practical lamps mix with sickly green neon leaking through blinds and door glass, turning the rooms into a humid late-night maze.
MOOD: Detached routine turns quietly uncanny, as if an unseen presence is floating above her and waiting for her to notice.
COLOR LOGIC: Matrix Green Look
CAMERA: POV overhead follow in a strict bird's eye view, locked directly above the top of her head at all times, perfectly centered over her body from start to finish, floating smoothly with no shake, tilt, angle drift, or side offset, passing through ceilings and door frames as one uninterrupted camera event. 24mm wide, digital clean look, locked overhead tracking package throughout.
SCENE:
She wakes on the mattress. Sits up under the lens.
Still centered under the lens, reaches to the floor. Picks up the cigarette and lighter. Places the cigarette between her lips. Lights it. Drops the lighter on the mattress. Stands up with the cigarette still with her.
Under the same overhead lock, crosses the cluttered room.
The lens tracks directly above her into the narrow hallway. Enters the bathroom.
Still pinned overhead, keeps the cigarette in her right hand. Extends that arm away from the running water. Leans on the sink. Turns on the tap with her left hand. Splashes water onto her face with her left hand.
The overhead follow carries her back out of the bathroom into the hallway. Moves along the hallway past the bathroom door. Turns into the tiny kitchen.
Still centered under the camera, keeps the cigarette with her. Reaches across the dirty dishes with her free hand. Picks up a glass from the counter.
Stops exactly under the lens. Holds the cigarette in her right hand and the glass in her left hand.
The lens stays fixed above her. Freezes. Looks right. Looks left. Takes a drag from the cigarette. Snaps her head straight up into the lens. Blows smoke toward the camera. Locks eye contact.
SFX: lighter flick, inhale, faint city hum, refrigerator buzz, soft bare footsteps, water run. Sodium amber particles, toxic green neon reflect off tile, smoke, bottles, and damp surface.
FORMAT: 12–15s / 24fps / smooth handheld-stabilized camera / soft rhythmic pacing
SUBJECT: A young woman (use uploaded reference for exact facial identity), expressive and calm, just waking up. Facial fidelity must remain consistent throughout.
WARDROBE: Fitted high-waisted trousers + minimal crop t-shirt. Clean, modern, non-revealing styling. Neutral tones.
ENVIRONMENT: Cozy bedroom interior at night (early evening transitioning into night). Warm ambient lighting mixed with soft practicals (desk lamp, LED strips, window spill). Clean aesthetic, minimal clutter.
LIGHTING: Soft key light focused on face (front-left), gentle falloff. Warm highlights with subtle cool contrast from window. Skin tones accurate and natural. Slight glow on face.
CAMERA: Begins with a medium-wide bedside shot → slow push-in to chest-up framing. Subtle lateral drift for natural movement. No cuts until final moment.
ACTION:
Starts with her lying on the bed, eyes closed.
She slowly wakes up, blinking naturally, adjusting to the light.
Subtle stretch or shift in posture, relaxed and unhurried.
Sits up on the bed, calm expression, soft neutral mood.
Brief pause as she gathers herself.
EXPRESSIONS: Peaceful, slightly drowsy transitioning into calm awareness. Minimal movement, grounded presence.
AUDIO SYNC (implicit): Soft, ambient evening tone — low, gentle background music or room tone.
FINAL MOMENT (transition):
She stands or leans toward the window.
Camera gently follows from behind/side.
She looks outside.
ENDING VISUAL: Through the window: a calm, cinematic night cityscape — soft lights, distant buildings, slight atmospheric haze. Cool blue tones contrast with warm interior.
STYLE NOTES: Ultra-realistic, no over-stylization. Natural skin texture, accurate anatomy, no distortion. Maintain temporal consistency and identity lock from reference image.
15-second, no-dialogue, fully immersive high-speed brutal fight short film.
Two characters @ image1 and @ image2 .
Live-action realistic style. Close-quarters, high-speed chaotic melee combat with no rules, fully out-of-control brawling. No flashy choreography—pure raw. Continuous exchange of punches, kicks, blocks, body checks, throws, and grappling entanglements. Ultra-realistic impact. Exaggerated yet physically grounded motion speed. Entire sequence in 120 FPS with no slow motion. Rapid dashes, evasions, and high-speed limb swings. Extreme motion blur and speed trails amplify intensity while preserving realistic weight and physics.
Master-level cinematic camera movement:
A professional film camera stays tightly locked to the action, flying dynamically with the fighters—high-speed dives, sudden stops, whip pans, and full 360° orbital tracking. The camera moves in sync with punches, kicks, and evasions, with responsive shake and displacement. One continuous take, no cuts. Sharp push-pull tracking, sudden directional shifts, fully integrated into the fight for a face-to-face immersive combat perspective. Smooth, zero-latency motion.
Master-level precision editing:
0–3 seconds: ultra-fast cuts to enter combat.
3–10 seconds: seamless action continuity with zero stutter or breaks.
10–15 seconds: dense, violent rapid-cut climax.
Editing rhythm escalates with combat intensity. Fast intercutting enhances chaos. Precise beat timing, no redundant shots. Constant suffocating high-speed pacing.
Environment Setting
Modern open-plan office with desks, computers, cabinets, printers, and swivel chairs; realistic layout with cool overhead lighting and side backlight for strong contrast. Slightly cramped space for added tension; no glass-breaking.
Environmental Interaction & Constraints:
Combat affects the space only through natural, physically justified collisions—papers scatter from air movement (not attacks), chairs slide or topple on impact, desks shift slightly without deliberate damage, and screens remain intact with minor vibration. No unnecessary or intentional destruction of equipment.
Body-driven combat: grappling, pinning, counter-throws, and realistic slams with subtle impact cracks/vibrations. Natural airflow—light paper/dust movement only, no exaggeration.
Final Action Design
Final 2–3s: both throw simultaneous close-range punches, freezing inches before impact—fully tensed, heavy breathing, locked in perfect deadlock.
A cinematic 15-second video of a young woman in a modern kitchen making fresh orange juice. The scene starts with her opening the refrigerator and taking out bright, fresh oranges. She places them on the counter and begins peeling them smoothly. Next, she puts the orange slices into a juicer machine and presses it, showing the juice being freshly extracted. She then pours the juice into a clear glass, adds a few ice cubes, and gently stirs it. Finally, she lifts the glass, takes a refreshing sip, and smiles. Soft natural lighting, clean aesthetic kitchen, smooth transitions, realistic motion, high detail, 4K quality.
A teenage boy, 17, athletic build, messy dark hair, wearing faded yellow shorts, a worn white tank top, and scuffed trainers, sprints across the rooftops of São Paulo's dense favela skyline as the sun burns deep orange behind the city. This time, he's not just running, he's being chased. [0s–1.5s] Wide panoramic shot of São Paulo at sunset. Endless stacked concrete buildings. A lone figure runs across a rooftop. Two more figures appear behind him, gaining ground. [1.5s–3s] Close tracking shot from behind. He sprints across corrugated metal roofing, footsteps echoing. A pursuer lunges. Without slowing, he sidesteps and elbows him mid-run, sending him crashing into a clothesline. He clears a rooftop gap in one fluid motion. [3s–5s] He slides under a water tank, rolls, then pops up into a spinning back kick as another attacker drops in front of him. The hit lands clean. He vaults over a wall onto a lower rooftop, silhouette sharp against the orange sky. [5s–7s] Running along a narrow ledge, 15 stories up. One chaser grabs his shoulder from behind. He twists, breaks free, and shoves him off balance. A chunk of concrete falls into the street below. He keeps moving, never looking back. [7s–9s] A wide gap ahead. He builds speed. Just before the jump, another attacker cuts him off. A quick exchange of punches, one dodge, one body shot. He uses the momentum to push off the opponent and launches across the gap. Time slows as the camera circles him mid-air. [9s–11s] He catches a fire escape railing. A pursuer follows and grabs the same structure. Hanging mid-air, they struggle. He kicks the attacker away, swings through, and climbs up to the next level in one motion. [11s–13s] Climbing rapidly along window frames and AC units, he pulls himself higher. Below, the city transitions from sunset orange into glowing purple nightlife. Sirens faint in the distance. The chase fades. [13s–15s] He reaches the rooftop edge. Stands still. Breathing heavy. Arms slightly out as wind hits him. The city stretches endlessly beneath him, lights flickering alive like stars. Cut to black. Keywords: São Paulo rooftops, parkour chase fight, rooftop combat, cinematic action, sunset to night transition, dynamic camera, 4K.
A man in his late 20s, casual white t-shirt and jeans, holds up a green smoothie and takes a sip, then smiles at the camera. Bright kitchen, morning sunlight from behind. Handheld, slight natural shake. Warm tones, authentic, documentary style.
Cinematic short film, photorealistic VFX. 15-second desert hunt at high noon. Blinding light, zero shade, heat shimmer on everything.
[0–2s] Overhead drone — red desert canyon system. Sandstone towers, wind-carved arches, dry riverbeds. One figure: a woman on a sand-skiff — a flat wooden board with a triangular sail, gliding across compacted sand on bone runners. She wears wrapped linen armor, copper goggles, a weighted net coiled at her waist. She watches the sand.
[2–4s] ECU the sand surface — it's BREATHING. Rippling outward from a point 200 yards ahead. The sand rises like a slow dome. The sail-woman cuts the skiff hard right. The dome ERUPTS — a sandworm. 40 feet, segmented, chitinous plates the color of rust. Its mouth is a vertical iris lined with grinding teeth like a rock crusher. Sand cascades off its body. It roars — the sound is DEEP, below human hearing, but you feel it shake the frame.
[4–7s] Chase through the canyon — the skiff banks between sandstone pillars. The worm follows UNDERGROUND — you see its path as a moving ridge of sand, smashing through the earth, pillars cracking as it passes beneath their foundations. She throws the weighted net — it wraps around a sandstone tower, rope trailing to her skiff.
[7–10s] The worm surfaces directly in the skiff's path. She leans the board, catches wind, and the skiff JUMPS — airborne over the worm's back. In mid-air she drives a barbed stake into the top of its head plate. Lands the skiff on the other side. Rope connects stake to the sandstone tower.
[10–13s] The worm charges away — the rope goes taut against the tower. The tower HOLDS. The worm's momentum whiplashes it sideways. It crashes into a canyon wall. Sandstone debris cascades. Dust cloud erupts like a bomb. Silence inside the cloud.
[13–15s] Dust settles. The worm lies against the canyon wall, breathing but pinned. She brings the skiff alongside, pulls her goggles up. Beneath the goggles — calm, focused eyes. She draws a curved knife and begins cutting a single iridescent scale from the worm's flank — the harvest. That's all she came for. One scale. The worm hisses but doesn't fight. She pats its flank once, takes the scale, pushes off. The skiff catches wind. She vanishes between the sandstone towers. Cut to black. Ultra-realistic.
FORMAT: 15s / free rhythm / 1 seamless match cut / uninterrupted camera movement until the cut + action beginning immediately from frame one
SUBJECTS:
A solitary woman armed with a sword, dressed in worn fur and leather survival gear, struggles against a huge polar bear using raw, two-handed defensive movement. Later it is revealed that the same woman is inside her home wearing relaxed indoor clothing, where a VR headset appears only after the match cut and is removed in one clear motion.
ENVIRONMENT:
Open frozen tundra beneath harsh winter daylight, wind dragging powder snow across pale blue ice. The sequence transitions into a modest, lived-in interior through a carefully aligned visual match. The biting cold, visible breath, and glare of the wilderness give way to warm clutter, window light, and a faint glow from the game.
MOOD:
Immediate life-or-death intensity that abruptly resolves into everyday reality while preserving physical continuity of motion.
COLOR LOGIC:
Naturalistic cinematic film print emulation.
TIMELINE
0:00–0:07
The shot begins instantly in motion. A handheld wide shot collapses toward a medium-close framing as the woman retreats across frozen ground while the polar bear charges through blowing snow. The camera runs beside the action at roughly eye level, beginning near 28mm and gradually tightening toward 35mm, slightly unstable but close enough to keep both figures physically grounded in frame.
The bear rapidly closes distance while she plants her feet, recoils, and keeps the sword positioned defensively between them.
SFX: howling wind, boots grinding against ice, deep animal roar, fabric strain, blade slicing through air, snow scraping across the surface.
Hard winter sunlight side-lights the terrain, casting long blue shadows across the ice.
0:07–0:11
The movement continues without a cut, pushing into a tight close-up as the bear lunges into the final distance. Claws reach toward her shoulders and its jaws dominate the edge of frame.
At the height of the attack, a man's voice calls out: "Karla…" then louder: "KARLA."
She answers with a tired "Off."
At that exact response, time collapses into slow motion. Snow particles hang nearly still, the bear suspended mid-strike, while she alone continues moving at normal speed. The camera slowly arcs around her face in a clockwise drift.
Unimpressed rather than frightened, she lets the sword fall and raises both empty hands toward her temples in one smooth interruption gesture. No headset or device exists in this frozen world.
The camera maintains identical face scale, hand height, head tilt, lens distance, and rotational drift until the match cut.
SFX: fabric tension approaching impact, the distant voice calling Karla… KARLA, her quiet Off, wind stretching and fading toward silence.
Bright winter light catches suspended snow crystals around her face.
0:11–0:15
MATCH CUT.
The close-up aligns perfectly with the new setting. As her hands pass through the same screen position, the frozen tundra becomes a small home interior. The camera keeps the same clockwise drift and framing as the motion continues uninterrupted.
For the first time, a VR headset is visible over her eyes. She grips both sides and pulls it upward in a single smooth action. The camera widens into a medium shot as the headset lifts above her forehead.
She steps into a compact living room wearing loose indoor clothing. The handheld orbit reveals couch edges, scattered blankets, and cool daylight from a window. Her body language relaxes into mild irritation.
She looks toward the unseen voice, rolls her eyes slightly upward, and says:
"What is it."
Lens: 35mm, natural spherical look.
SFX: headset strap stretching, plastic shifting, quiet room ambience, soft footstep on the floor, faint game audio fading out, her breathing settling, her dry voice asking "What is it."
Indoor daylight replaces the stark winter contrast.
been testing a different workflow lately using tapnow. what makes it interesting is how it structures the entire process from idea → visuals → final video. instead of jumping between tools, you can actually build everything in one flow and refine it step by step like a real production pipeline. for the visuals, i'm using seedance 2.0 which is currently one of the strongest models for photoreal, human-centered video. but quick note — seedance 2.0 is currently only available in selected regions and requires a verified corporate email to access. still, the direction is clear: AI video is moving from "generation" → into "directing". also, they just launched a global challenge called "10,000 Parallel Universes" with a $200K prize pool. if you're exploring cinematic AI workflows, this is actually a good place to test ideas and push concepts further.
Food that feels alive when you try @yapper_so
Prompt
Hyper-realistic 4K food video, Thai street food cooking, Pad Kra Pao Moo in black wok, extreme close-up cinematic style, professional food cinematography, natural kitchen lighting with steam highlights,
Scene 1: pouring dark glossy sauce into screaming hot wok, instant sizzle explosion, aromatic steam burst, oil dancing, dynamic camera push-in
Scene 2: cracking soft egg directly into the mixture, yolk dramatically bursting and flowing like liquid gold over minced meat, slow-motion detail
Scene 3: adding fresh red chilies and holy basil leaves, high-heat stir-fry, leaves wilting instantly, glossy caramelized pork, intense motion and sizzle
Scene 4: final plating over steaming white rice, perfect composition, shallow depth of field, mouth-watering close-up, freeze frame at the end, photorealistic textures, no text, no watermark, ultra realistic, shot on iPhone 16 Pro food reel style
--ar 9:16 --stylize 250 --v 6 --q 2
Ultra realistic cinematic night scene, shallow depth of field, neon bokeh lights, a stylish young man (same face as reference image, sharp jawline, short styled hair, light beard, confident personality) walking confidently on a city street, holding a white coffee cup in one hand and scrolling his phone with the other, natural expression, slight tongue movement as if just finished eating, casual relaxed vibe, restaurant street ambiance
No camera cuts, no zoom, smooth steady tracking shot, same framing, cinematic lighting, orange and teal color grade
Suddenly, a loud metallic vibration sound, intense rumble, a heavy circular metal tunnel cap bursts out from the road and flies into the sky, sparks and dust particles, people around panic and look shocked, chaotic environment, but the man remains calm and looks up with curiosity
The metal cap crashes down violently, ground cracks open with extreme pressure, debris flying, from beneath emerges a massive alien creature, elephant-sized, biomechanical texture, glowing veins, aggressive stance, tearing through the underground tunnel
Extreme close-up of alien face, hyper-detailed skin, breathing heavily, then extreme close-up of the man's face, calm and fearless
The man casually puts his phone into his pocket, throws the coffee cup aside, instant transformation begins — futuristic white ice combat suit assembling over his body with micro mechanical details, glowing frost energy, ultra detailed armor formation, cinematic closeups (same suit as reference image, white glossy armor, glowing blue lines, helmet visor dark reflective)
The creature charges aggressively, pushing cars and taxis aside with force, destruction on street
At the last moment, the man teleports behind the creature in a burst of icy energy
Creature turns — instantly a powerful punch lands on its face
Extreme hyper slow motion close-up: face distortion, shockwaves, skin rippling, massive impact force, cinematic motion blur
NEW HERO MOMENT:
After the punch impact, the man is suspended mid-air in a powerful superhero pose, perfectly stable, surrounded by flying frozen debris and ice shards as the creature explodes into pieces, ultra cinematic lighting, particles glowing, frozen fragments spinning in slow motion
Camera slightly orbits him while maintaining cinematic framing, ice particles reflecting neon lights, epic hero aura
Then he slowly descends back to the ground with controlled motion, calm and dominant presence
As he lands, the ice combat suit begins to disassemble smoothly into glowing particles, transitioning back to his normal human look (same face consistency as beginning), seamless transformation
He adjusts his posture casually, starts walking again like nothing happened
People around stand frozen in shock, watching him silently
Cinematic ending, soft ambient sound, hero walks away into neon-lit street
This 21-second sequence in Seedance 2.0 started as one idea, but the first generation basically asked for a different story direction. I ended up with a video stitched from three separate generations. The first one was pure text-to-video (no references at all); then I extended the scene twice by 8 seconds each time, using the previous output as a video reference. Even after those two extensions, the consistency stayed rock-solid — same characters, location, colors, and overall audio mood across the whole sequence.
Highly detailed full-body shot of a premium life-size Iron Man Mark XLII-inspired robot statue made of polished mirror chrome and silver metal with intricate mechanical details, exposed pistons, rivets, panel lines, and articulated armor segments. The suit has a glowing bright blue circular arc reactor in the chest with energy rings, matching glowing blue eyes in the helmet, and open knee compartments revealing complex glowing blue circuitry and batteries inside. Dramatic studio lighting with strong reflections on the shiny metallic surfaces, subtle wear and battle damage on the armor. Dynamic three-quarter pose turning slowly on a display stand, cinematic volumetric lighting, hyper-realistic textures, 8k photorealistic, sharp focus, masterpiece, best quality.
Use a base identity reference image and preserve the subject's face 100% (no beautification or changes), with Elle Fanning as the main character while strictly maintaining her natural facial features, proportions, and realism. Place her in a modern ultra-high-rise office at night with floor-to-ceiling windows overlooking a vast city skyline. Add realistic investigation-style paper notes on the walls (wrinkled, taped, layered, partially overlapping). Use cinematic corporate lighting (cool blue city light + soft overhead + subtle warm desk lamp), shot on a 35mm lens with shallow depth of field, in a hyper-realistic documentary style, 4K quality. Then create a second image of a tall luxury NYC residential skyscraper at night, viewed from a distance with surrounding buildings, wet streets, atmospheric haze, cool exterior tones, and warm interior penthouse lights, shot on a 135mm telephoto lens with realistic proportions. Finally, generate an 8–10 second 1080p Kling 3.0 video using the skyscraper as the opening frame and the office as the ending frame, with a slow cinematic push-in toward a specific lit window, natural glass reflections, seamless transition into the interior (no cuts or morphing), realistic exposure shift from exterior to interior, and Elle Fanning remaining still throughout with only subtle breathing, no expression change.
cinematic
documentary
realistic
night
urban
portrait
Subject: A high-action cinematic shot of a blonde, bearded man in a black tactical combat suit with carbon-fiber-textured padding, engaging in a physical fight with a large polar bear.
Details:
The Man: Caucasian, muscular build, mid-length wind-swept blonde hair, thick blonde beard. He is wearing a matte black tactical suit with integrated chest and back armor plates.
The Polar Bear: Hyper-realistic, large adult polar bear with thick, off-white fur. The bear is wearing large, black leather-textured boxing gloves on its paws.
Composition & Pose: Low-angle, dynamic action shot. The man is caught mid-motion, leaning back as he narrowly dodges a massive punch from the bear. The bear's glove is inches from the man's face.
Setting: A flat, concrete rooftop of a modern building at sunset. In the background, a hazy city skyline (resembling Los Angeles) with skyscrapers, construction cranes, and glowing orange sunlight.
Lighting: Strong golden hour lighting coming from behind the city, creating long, dramatic shadows across the rooftop and a bright rim-light effect on the hair of the man and the fur of the bear.
Camera Specs: Cinematic wide-angle lens, sharp focus on the man's face and the bear's glove, shallow depth of field with the background city slightly blurred.
Style: Photorealistic CGI, high-fidelity textures, 8k resolution, motion blur on the man's hair to indicate rapid movement.
Aspect Ratio: 9:16 (Vertical)
FORMAT: 15s / 145 BPM / 15 SHOTS / beat-synced routine
SUBJECT: @[image1] < ATTACH YOUR IMAGE.
WARDROBE: Sleep tee and lounge shorts at home. Tailored jacket, fitted top, trousers, and lace-up shoes outside.
ENVIRONMENT: Tiny apartment, bright fridge glow, rain-dusted hallway, chrome metro, clean office, then a bedroom in cool window light. Everything feels glossy and lived-in.
MOOD: Late-for-work panic, clipped momentum, breathless urgency, then an exhausted exhale.
MUSIC: Fast percussive electro-pop
COLOR LOGIC: Hyperreal Pop Look
STYLE: Ultra-Realistic.
LOGIC RULE: Keep logical consistency in wardrobe, props, locations, and action continuity across all shots.
SHOT 1: ECU, 85mm push-in / 06:50 on the phone screen as it shakes on rumpled sheets. / SFX: alarm, sheet rustle.
SHOT 2: WS, 35mm handheld jolt / Rhythmic cut into her jolting upright through side light, throwing the blanket aside, and planting her feet on the floor in one rushed motion, still in a soft sleep tee and lounge shorts. / SFX: mattress bounce, blanket whip, sharp breath.
SHOT 3: MCU, 50mm slide / Cut on action into face wash at the sink, droplets catching the top light. / SFX: faucet rush, water slap.
SHOT 4: Insert shot, 85mm rack focus / Match cut into the toothbrush held at a natural forward brushing angle against the front teeth, hand relaxed and upright, mint foam and mirror eye. / SFX: bristle scrape, sink drip.
SHOT 5: Interior fridge view, 24mm wide / Object pass into the camera inside the fridge looking out as the door snaps open and her hand darts in, blue fridge light framing a hurried grab for breakfast ingredients. / SFX: fridge hum, bottle clink, shelf rattle.
SHOT 6: Insert shot, 50mm handheld / Rhythmic cut into eggs and toast hitting the pan under warm practical light. / SFX: butter sizzle, chop tap.
SHOT 7: MCU, centered 50mm push-in / Match cut into one rushed bite, a quick clock glance, and an immediate rise from the chair. / SFX: crunch, ceramic clink, chair scrape.
SHOT 8: Bird's-eye insert, 35mm overhead / Cut on action into striped socks snapping on. / SFX: fabric stretch, heel tap.
SHOT 9: MS, 35mm pivot / Camera wipe into a rushed outfit change as the sleep tee disappears under a fitted top and tailored jacket, then her tote, keys, and transit card get scooped up in one messy grab. / SFX: fabric whip, key jingle, zipper pull, bag rustle.
SHOT 10: Insert shot, 50mm overhead / Match cut into lace-up shoes slamming on as the laces yank tight in one impatient pull. / SFX: sole thump, lace tug, short breath.
SHOT 11: WS, 24mm parallax / Whip pan transition into her, now in the tailored outside outfit, rushing through the apartment door into corridor light without breaking stride. / SFX: latch click, rapid footsteps, hallway air.
SHOT 12: MS to CU, 35mm glide into 85mm push-in / Sound bridge into the metro car interior only as she grips the pole, shifts with the carriage sway, checks the passing station lights, and snaps a tense glance toward the closing doors, reflected chrome streaking around her and the city smearing outside the window. / SFX: rail clatter, carriage screech, door warning chime, tight breath.
SHOT 13: Insert to MCU, 50mm snap zoom / Smash cut to the office entrance as her access card hits the reader, the glass door unlocks, and she slips through fast before the chair roll and laptop open. / SFX: badge beep, door click, laptop chime.
SHOT 14: OTS, 35mm handheld / Rhythmic cut into fingers racing across keys, chat windows blinking, coffee by the trackpad, and notifications stacking faster than she clears them. / SFX: keyboard burst, notification ticks, mouse click.
SHOT 15: WS, 50mm pull-out / L-cut with a match from laptop close to apartment re-entry as the jacket drops, work clothes peel away, and she changes back into sleepwear before collapsing into bed in the opening frame shape. / SFX: door shut, bag drop, fabric rustle, blanket rustle, room tone.
FORMAT: 15s / 135 BPM / 13 SHOTS / beat-synced
SUBJECT: @[image1]
WARDROBE: Neutral streetwear, long coat
ENVIRONMENT: Busy city street → everything frozen mid-motion
MOOD: Confusion → curiosity → quiet control
MUSIC: Pulsing ambient electronic
COLOR LOGIC: Muted tones with sharp highlights
STYLE: Ultra-real cinematic
SHOT FLOW:
CU phone glitching time (08:12 → stuck)
Street crossing — people suddenly freeze mid-step
Coffee splash frozen in air
Paper flying — static mid-air
SUBJECT slowly walking through frozen crowd
Hand passing through suspended raindrops
Eye-level tracking through still chaos
Close-up: realization expression
Camera orbit — subject only moving element
Subtle smile / calm shift
Clock ticks again
Everything snaps back into motion
SUBJECT standing still as world rushes past
A colossal futuristic white-and-gray mecha robot named "Xeno Leviathan Terraformer", massive scale, intricate mechanical details, glowing cyan/blue energy accents on joints and eyes, hovering powerfully above Earth's atmosphere with clouds swirling around its legs, dramatic low-angle cinematic view.
In the foreground below: Godzilla with glowing blue atomic spines roaring aggressively on the left, and a massive King Kong (Gorilla) on the right, both tiny compared to the giant robot, standing in a coastal city being destroyed.
The robot is smashing into the ocean, creating enormous tsunamis, white water splashes, huge clouds of dust and smoke, flying debris, cracked earth. Epic scale, god-like perspective, planet curvature visible in the background, dramatic sky with stars and atmosphere.
Ultra-detailed, cinematic lighting, volumetric fog, dynamic motion, epic destruction scene, best quality, 8k, photorealistic yet stylized, in the style of high-end sci-fi concept art --ar 9:16 --stylize 250 --v 6
Top-down fixed camera on a clean white marble surface with soft, bright natural lighting shows a clear glass bowl of pale yellow egg yolk mixture being gently whisked by a realistic hand while another hand adds a thin stream of vanilla extract, creating soft swirls that blend smoothly without splashing; the scene then transitions seamlessly to a second glass bowl where egg whites are whisked more rapidly, transforming from transparent liquid into a light, airy foam with visible motion blur, surrounded by neatly arranged ramekins and eggs, maintaining a minimal aesthetic, consistent lighting, natural hand movement, and an ultra-realistic cinematic food photography style.
Cinematic, apocalypse, post-apocalyptic, photorealistic 15 seconds 16:9
[00:00-00:05] Wide shot. A dark, red-tinged sky. Flaming asteroids crash into a coastal city. Buildings collapse. Clouds of dust and fire. Hard, chiaroscuro lighting with deep shadows.
[00:05-00:10] Medium back shot. A lone figure, a man in tattered clothes, stands on a rocky cliff edge, overlooking the destruction. Wind blows his hair. Camera does a slow orbit around the figure.
[00:10-00:15] Close-up shot. The man's face, covered in dust. He slowly looks up towards the sky. His eyes reflect loss and determination.
Final wide shot of the burning city with the lone figure in the foreground. Atmospheric haze and dust particles.
Massive cinematic sumo wrestling match inside a traditional Japanese sumo arena, two enormous sumo wrestlers collide with incredible force in the center of the clay dohyō ring, their bodies slamming together as they push and grapple intensely, sand and dust erupting under their feet.
A packed arena surrounds them with thousands of spectators shouting and cheering loudly, banners waving, dramatic arena lighting illuminating the fighters while the crowd fades into shadow.
Camera: low-angle cinematic hero shot, slow push-in toward the wrestlers as they collide, brief slow-motion during powerful impacts, subtle camera shake for realism.
Style: ultra-realistic cinematic sports film, dramatic lighting, high detail sweat and skin textures, epic atmosphere, movie-quality color grading, shallow depth of field, 4K realism.
FORMAT: 15s / 145 BPM / 15 SHOTS / beat-synced routine
SUBJECT: @[image1] < ATTACH YOUR IMAGE.
WARDROBE: Sleep tee and lounge shorts at home. Tailored jacket, fitted top, trousers, and lace-up shoes outside.
ENVIRONMENT: Tiny apartment, bright fridge glow, rain-dusted hallway, chrome metro, clean office, then a bedroom in cool window light. Everything feels glossy and lived-in.
MOOD: Late-for-work panic, clipped momentum, breathless urgency, then an exhausted exhale.
MUSIC: Fast percussive electro-pop
COLOR LOGIC: Hyperreal Pop Look
STYLE: Ultra-Realistic.
LOGIC RULE: Keep logical consistency in wardrobe, props, locations, and action continuity across all shots.
SHOT 1: ECU, 85mm push-in / 06:50 on the phone screen as it shakes on rumpled sheets. / SFX: alarm, sheet rustle.
SHOT 2: WS, 35mm handheld jolt / Rhythmic cut into her jolting upright through side light, throwing the blanket aside, and planting her feet on the floor in one rushed motion, still in a soft sleep tee and lounge shorts. / SFX: mattress bounce, blanket whip, sharp breath.
SHOT 3: MCU, 50mm slide / Cut on action into face wash at the sink, droplets catching the top light. / SFX: faucet rush, water slap.
SHOT 4: Insert shot, 85mm rack focus / Match cut into the toothbrush held at a natural forward brushing angle against the front teeth, hand relaxed and upright, mint foam and mirror eye. / SFX: bristle scrape, sink drip.
SHOT 5: Interior fridge view, 24mm wide / Object pass into the camera inside the fridge looking out as the door snaps open and her hand darts in, blue fridge light framing a hurried grab for breakfast ingredients. / SFX: fridge hum, bottle clink, shelf rattle.
SHOT 6: Insert shot, 50mm handheld / Rhythmic cut into eggs and toast hitting the pan under warm practical light. / SFX: butter sizzle, chop tap.
SHOT 7: MCU, centered 50mm push-in / Match cut into one rushed bite, a quick clock glance, and an immediate rise from the chair. / SFX: crunch, ceramic clink, chair scrape.
SHOT 8: Bird's-eye insert, 35mm overhead / Cut on action into striped socks snapping on. / SFX: fabric stretch, heel tap.
SHOT 9: MS, 35mm pivot / Camera wipe into a rushed outfit change as the sleep tee disappears under a fitted top and tailored jacket, then her tote, keys, and transit card get scooped up in one messy grab. / SFX: fabric whip, key jingle, zipper pull, bag rustle.
SHOT 10: Insert shot, 50mm overhead / Match cut into lace-up shoes slamming on as the laces yank tight in one impatient pull. / SFX: sole thump, lace tug, short breath.
SHOT 11: WS, 24mm parallax / Whip pan transition into her, now in the tailored outside outfit, rushing through the apartment door into corridor light without breaking stride. / SFX: latch click, rapid footsteps, hallway air.
SHOT 12: MS to CU, 35mm glide into 85mm push-in / Sound bridge into the metro car interior only as she grips the pole, shifts with the carriage sway, checks the passing station lights, and snaps a tense glance toward the closing doors, reflected chrome streaking around her and the city smearing outside the window. / SFX: rail clatter, carriage screech, door warning chime, tight breath.
SHOT 13: Insert to MCU, 50mm snap zoom / Smash cut to the office entrance as her access card hits the reader, the glass door unlocks, and she slips through fast before the chair roll and laptop open. / SFX: badge beep, door click, laptop chime.
SHOT 14: OTS, 35mm handheld / Rhythmic cut into fingers racing across keys, chat windows blinking, coffee by the trackpad, and notifications stacking faster than she clears them. / SFX: keyboard burst, notification ticks, mouse click.
SHOT 15: WS, 50mm pull-out / L-cut with a match from laptop close to apartment re-entry as the jacket drops, work clothes peel away, and she changes back into sleepwear before collapsing into bed in the opening frame shape. / SFX: door shut, bag drop, fabric rustle, blanket rustle, room tone.
A lone man struggles to steady himself on a small boat in the middle of a violent ocean storm. Thunder cracks and heavy rain lashes down as towering waves crash around him. Suddenly, a sea monster bursts from the dark water, its massive jaws opening wide. It clamps its teeth onto the boat, splintering the wood, and violently drags it beneath the churning ocean as the man fights for his life. Dramatic lighting, cinematic camera angles, hyper-realistic, intense atmosphere.
Cinematic ultra-realistic 15-second short film, western girl in a dark jacket walking through a busy city sidewalk, natural daylight, hard shadows, shallow depth of field, shot on Arri Alexa Mini, 50mm lens. She walks confidently through moving pedestrians, phones, conversations, and pigeons flying in a bright sky. She snaps her fingers — a white spherical shockwave expands outward, freezing everything instantly: people mid-step, dust and leaves suspended, pigeons frozen mid-flight. Silence. She calmly walks through the frozen world, observing everything, gently touching a suspended pigeon. She stops in front of a fully clothed woman in a flowing red dress frozen mid-motion, studies her with a calm expression. She then snaps again — a stronger reverse shockwave restores motion across the city. Pedestrians resume walking, pigeons scatter into flight, leaves fall naturally. She turns and walks away as the camera pulls back into a wide cinematic aerial shot. Fade out.
Attached the 4 references and put this prompt
[CINEMATIC SETUP]
Film stock: 35mm Kodak Vision3, anamorphic lens, f/2.8.
Color Grade: High-contrast "Bleach Bypass" look with desaturated earth tones and deep shadows.
Lighting: Dim, volumetric moonlight filtering through thick fog; dramatic rim lighting on characters.
Atmosphere: Heavy lingering fog, swirling dust particles, and organic debris.
The four characters [ @ image 1, @ image 2....] are in a defiant and ominous pose.
[STYLE & QUALITY BOOSTERS]
Photorealistic 8K, ultra-detailed textures, cinematic lighting, perfect motion blur, high dynamic range, no artifacts, coherent multi-character interaction.
A 90s era home video, she is street dancing on a warm city street at dusk in baggy 90s clothes to an early 90s hip-hop track, a group of people are around her cheering her moves, especially when she pulls out a massive move
A cinematic cyberpunk portrait of a beautiful young East Asian woman with long flowing ash-blonde hair with dark roots, standing on a high-rise balcony overlooking a futuristic neon-lit megacity at night. She is a cyborg with a sleek black and silver mechanical right arm and shoulder, intricate metallic joints and exposed wiring visible. She wears a stylish cropped rust-red leather jacket with silver buttons over a black leather outfit with cutouts on the thighs. A large futuristic mechanical sword rests in her mechanical hand.
She has striking facial features, sharp eyes with subtle eyeliner, and a cool, confident expression. Her long hair dramatically blows and flows in the wind throughout the shot. She slowly turns her head and upper body from a three-quarter view facing the camera toward her right side, then slightly away, ending in a profile and back view as her hair whips across her face. The camera performs a slow, smooth orbiting movement around her while subtly tilting up and down, creating dynamic angles.
Moody cinematic lighting with strong rim lights from the city glow, soft bokeh lights from background skyscrapers, subtle lens flares, and atmospheric depth. Highly detailed, photorealistic, 8K, dramatic color grading with teal and orange tones, cyberpunk aesthetic inspired by Blade Runner 2049 and Ghost in the Shell. Slow motion feel, elegant and powerful atmosphere, 10 seconds duration, 16:9 aspect ratio.
Cinematic realistic animation, static locked wide camera. Cozy kitchen, morning light through window. Orange tabby cat in striped apron stands upright at wooden counter, flour dust on fur, all ingredients visible: milk bottle, flour bowl, fresh eggs carton, vanilla extract, sugar, whisk, mixing bowl, chocolate chips, blueberries, butter, stack of finished pancakes already visible on right side of counter. Pancake cooking on stovetop pan in background.
Cat grabs egg carton with both paws, cracks egg firmly on bowl edge — yolk drops in, shell tossed aside casually. Scoops flour with small paw into bowl — white dust cloud puffs up, coats cat's nose, cat blinks. Pours milk from bottle — steady glug. Drops vanilla extract carefully. Both paws grip whisk, beats batter vigorously in bowl — rhythmic clinking, batter splashes slightly, cat's whole body moves with effort. Cat peers into bowl, satisfied. Ladle scoops batter, carries it to stovetop pan — pours perfect circle, gentle sizzle. Cat watches bubbles form on surface, spatula ready in paw. Confident flip — golden underside revealed, fresh sizzle. Pancake added to growing stack. Handful of blueberries and chocolate chips scattered on top. Butter slice placed — melts slowly. Maple syrup poured in thick amber stream.
Cat picks up finished plate with both paws, turns to face camera directly, extends plate forward toward lens — single clear "MEOW." Whiskers twitch. Static camera throughout, never moves. Ambient kitchen sounds only: whisking, sizzling, butter melting. 15 seconds.
{
"prompt": "Cinematic scene on the Mongolian steppe. A young Asian woman with long black braided hair and a white headband stands in the middle of a vast grassland. She is wearing a thick white fur coat and gently holding a small brown lamb in her arms. Behind her are two people dressed in traditional brown fur nomadic clothing standing near white yurts. The wind softly moves the woman's hair and fur coat. The woman looks down at the lamb with a calm, emotional expression. In the background are large green mountains and an endless steppe under soft daylight. The camera slowly pushes in toward the woman creating a dramatic cinematic feeling. Natural lighting, ultra realistic, shallow depth of field, filmic color grading, epic cinematic composition.",
"style": "cinematic film",
"camera": "slow dolly in, shallow depth of field",
"lighting": "soft natural daylight",
"quality": "4K ultra realistic",
"duration": "5-8s"}
NASA APOLLO SPACESUIT A7L
"WORN ON ANOTHER WORLD"
DNA: July 20, 1969. 102 hours, 45 minutes, 40 seconds into the mission. One small step.
Arri Alexa 65. Ultra-wide. Grain progressive — starts clinical-clean, ends at maximum as the lunar surface appears. Key light: cold fluorescent Mission Control white. Secondary: harsh unfiltered solar light — no atmosphere to diffuse it. Flares: none in the lab sequence. One single overwhelming flare as the visor catches unfiltered sunlight on the lunar surface. Background: white clean room → vacuum of space → lunar surface at Tranquility Base.
00:00–00:02 · THE LAYERS
A single thread of nylon being measured under a magnifying glass by a white-gloved technician's hand. Then: cut to the 21 layers of the A7L suit being assembled simultaneously — each layer materializing and wrapping the suit form: the liquid cooling garment first, its tubes threading through the fabric like a vascular system. Then the pressure bladder. Then the restraint layer. Then the thermal micrometeorite garment. Each layer distinct, each critical. Camera cross-section view through all 21 layers simultaneously.
00:02–00:04.5 · THE HELMET
Speed 30%. The polycarbonate helmet shell forms — perfectly spherical, flawless. The visor assembly drops in: the gold-coated visor — 24-karat gold, 0.0002 inches thick — pressing over the outer shell. Camera macro on the gold surface: it reflects everything in warm gold — including us. The neck ring locks with a quarter-turn — the mechanism designed to never, under any circumstances, fail.
00:04.5–00:07 · PRESSURIZATION
Speed 8%. The most important moment. Air flowing into the suit — the pressure building to 3.75 psi. Camera inside the suit as it pressurizes: the fabric stiffening, the gloves expanding slightly. Every seal tested by the pressure itself. The suit becomes a world — a personal atmosphere, the only thing between a human being and the void.
00:07–00:10 · THE CLEAN ROOM
Speed 15%. The suited figure — complete — standing in the white clean room under brutal fluorescent light. Technicians moving around it in blurred background. The suit reads in extraordinary detail: the layers visible at the seams, the connector ports, the PLSS backpack life support system mounting points. A visor drops over the helmet. The astronaut disappears inside.
00:10–00:13 · TRANQUILITY BASE
Speed 3%. The surface of the Moon. The suit boot pressing into lunar regolith in ultra-slow motion — the print forming in dust that has not been disturbed in 4.5 billion years. Camera at surface level — the boot print in sharp focus, the horizon of the Moon in the background, the Earth hanging above it, the size of a marble. The suit in the background, the gold visor catching unfiltered solar light. The single most overwhelming flare of any film in this collection — total, white, sacred.
00:13–00:14.5 · THE REVEAL
Speed 1% — absolute stillness. The A7L floating in the void — the Earth behind it at distance. The gold visor reflects the Earth. The Earth reflected in the suit designed to walk on the Moon. Everything is contained in this one reflection.
00:14.5–00:15 · END CARD
Silence — total. Then a single radio crackle. NASA worm logo. "Worn once. Changed everything." No grain — this moment is too clear, too real. Hold. Fade.
space
cinematic
documentary
advertisement
realistic
Character tone:
high-end romantic comedy, deadpan flirtation, over-serious male lead, quick-witted female lead, cinematic realism, sweet-chaotic chemistry, every frame like a poster
Male lead:
bespoke black suit, white shirt collar slightly open, handsome and severe, powerful aura, trying very hard to look cold and dominant, but secretly nervous and flustered, tiny tells betray him: slightly crooked tie, tight jaw, faintly trembling fingertips
Female lead [@ Image1]:
fitted slip dress / refined Chanel-inspired set, long hair slightly messy, elegant and soft-looking but emotionally sharper than him, stubborn, dryly funny, outwardly cornered for a moment, then visibly unimpressed, holding back laughter
Action + expression changes:
the male lead forcefully steps in for a dramatic wall-pin pose, one hand braced on the wall, closing the distance too seriously, trying to look intense; his expression starts cold but gradually cracks into restrained embarrassment
the female lead [@ Image1] steps back once, eyes widening, then notices his crooked tie and trembling hand; her expression changes from guarded resistance to deadpan disbelief and almost-laughing annoyance
their noses nearly touch, breathing overlaps, the tension becomes playful and absurd instead of painful
the male lead lightly lifts her chin, trying to recover his cool image; the female lead stares at him like she is watching someone forget his own script
Dialogue:
Male lead: Are you done yet?
Female lead [@ Image1]: Fix your tie first.
Male lead: I am being serious.
Female lead [@ Image1]: Then stop stepping on my heel.
Male lead: ...That was deliberate.
Female lead [@ Image1]: Your shaking hand says otherwise.
Show me a film still never seen before brand new shot on ARRI Alexa, hyper realistic film grain, LUT preset, key fill lighting anamorphic lensing shallow depth of field 24mm
A young woman with a neutral expression walks slowly through a crowded train station. People move quickly around her, creating motion blur, while she remains in sharp focus. She wears a black hoodie, minimal makeup, natural lighting. The environment feels busy and slightly desaturated.
Camera shots:
Front tracking shot (camera moving backward as she walks forward)
Side profile shot with crowd passing in foreground
Overhead drone-like shot showing her surrounded by moving people
Close-up on her face with shallow depth of field
Rear follow shot as she walks into the crowd
Cinematic lighting, soft natural daylight, realistic color grading, 4K, shallow depth of field, motion blur, emotional tone, urban realism
PROMPT TEMPLATE:
Cinematic close up shot of [SUBJECT], naturalistic film lighting, soft diffusion, restrained earthy color grading with warm highlights and cool shadows, layered depth composition with foreground interest and vast backgrounds, realistic material surfaces and micro-detail textures, subtle film grain, balanced cinematic contrast, moody atmospheric perspective and haze.
SUBJECT EXAMPLES:
→ A sheriff in a long corndog costume on a classic white two-story farmhouse porch with pickup truck, vast cornfield
→ A burning classic white two-story farmhouse with porch and parked truck surrounded by vast cornfield
→ A woman with windswept reddish-brown hair trying to move forward through a long cornfield
→ Or write your own subject and be creative!
STYLE: Gritty Cine Verité, 35mm handheld, natural shake. Continuous tracking shot. No cuts. All real-time. LIGHTING: Bright, high-altitude sun, pure blue sky....
High-end commercial photography of a (black angus burger) labeled "(NYC BURGER HOUSE)", centered on a (warm golden gradient background) with a premium fast-casual aesthetic. The burger features (perfect grill marks, melted cheese, perfectly cooked beef patty, toasted brioche buns) with realistic steam and oil shine.
Cinematic lighting, shallow focus, ultra-realistic textures.
[{"lang":"en","prompt":"Style & Mood: Gritty, Rough, Raw documentary realism. Handheld 16mm grain, blown-out cockpit glass flare, muted military greens and grays against a pale high-altitude sky. Authentic cockpit instrumentation, no stylization. Dynamic Description: Tight handheld close-up inside the cockpit — the pilot's gloved hand tightens on the stick, head snapping left toward threat, oxygen mask pulling with her breath. Smash cut to stabilized wide aerial shot — two real fighter jets carving across a pale blue sky, one banking hard to cut across the other's flight path, condensation streaming off wingtips in tight turns. Cut back to cockpit interior — her body pressing into the harness under g-load, instrument panel vibrating, warning tone audible. Handheld close-up: gloved fingers adjusting throttle, eyes fixed on HUD. Smash cut to external low-angle chase camera mounted near tail — her jet rolling hard left, the enemy aircraft visible above and behind, closing. She reverses. Enemy overshoots, crossing left-to-right ahead of her. Wide stabilized external shot: both aircraft now aligned, hers directly behind, HUD tone locking. Static Description: Real stratospheric sky, pale blue fading to white at horizon. Authentic military fighter jets, worn paint, panel lines, exhaust wash. Cockpit interior: functional, worn, oxygen hose, ejection handle visible. Audio: Pilot (over comms, oxygen mask muffled, flat and controlled):
a vertical transparent glass box filled with ocean waves lying on the surface of sea, hyper-realistic photography, the scene is rendered in hyper-realistic detail using octane render with a cinematic quality
Ultra-realistic cinematic timelapse, natural daylight progression from cool morning light to warm golden evening, adaptive static camera fixed in one elevated corner of the kitchen showing the full space with subtle focal adjustments for depth and parallax as the area transforms. Realistic movements of workers, tools, and materials. Interior kitchen renovation transformation.
[00:00–00:01]
Wide static shot of a completely outdated 1990s kitchen in early morning cool light. Old laminate countertops with visible wear, dark wooden cabinets with peeling finish, faded linoleum flooring, outdated appliances, cluttered and dingy appearance, no modern elements. Workers arrive carrying tools, demolition equipment, and material boxes. SFX: distant footsteps, door opening, light morning ambience.
[00:01–00:03]
Rapid demolition and preparation phase: workers swiftly remove old cabinets, countertops, flooring, and appliances at accelerated speed. Debris is cleared, walls are patched and primed, electrical and plumbing lines are updated and concealed. New subflooring and under-cabinet lighting prep begins. Sun rises, shadows shift noticeably. SFX: hammering, sawing, wheelbarrow rolling, muffled worker instructions, debris removal sounds.
[00:03–00:05]
Installation of core structures: sleek matte white or light gray shaker-style cabinets are mounted quickly, quartz or marble countertops are installed and sealed, new stainless steel appliances (modern fridge, oven, range) are placed. Backsplash tiles (subway or herringbone) are laid with precision. Fresh hardwood or luxury vinyl plank flooring is laid down. Midday brighter natural light fills the space, highlighting clean lines. SFX: drilling, caulking, tile setting sounds, appliance positioning, subtle water running for testing.
[00:05–00:07]
Finishing and detailing phase: modern hardware, under-cabinet and pendant lighting fixtures are added and illuminated, sink and faucet are installed with flowing water test, open shelving and decorative accents appear. A sleek kitchen island with bar stools materializes. Late afternoon golden light streams through windows, creating warm reflections on surfaces. SFX: softer tool sounds, light clicking of switches, water gently flowing, faint satisfying placement sounds.
[00:07–00:08]
Final reveal of the completed modern luxury kitchen in warm evening light. The space is now bright, clean, and inviting with minimalist design, gleaming surfaces, organized countertops featuring fresh herbs or simple decor, soft pendant lights on, subtle steam or natural warmth. Calm, aspirational atmosphere with gentle evening ambience and light background music.
Camera behavior:
Adaptive static timelapse camera — fixed elevated corner position with slight intelligent reframing and minor zoom adjustments to keep the entire transforming kitchen visible and maintain cinematic depth as cabinets, counters, and details fill the frame. Natural parallax, realistic perspective shifts, and smooth material transitions. No abrupt cuts.
Mood and aesthetics:
Hyper-realistic renovation timelapse, smooth accelerated transformation from dated and cluttered kitchen into a bright, modern, high-end minimalist retreat. Emphasis on material textures (wood grain, stone veining, metallic finishes), natural lighting changes, clean progress, and a sense of satisfying achievement. Highly detailed surfaces, realistic physics of installation, and organic worker movements.
Total duration: 8 seconds (about 7 seconds of active transformation, 1 second final cozy reveal). Cinematic color grading, ultra-realistic quality, natural light interaction with reflective surfaces.
Handheld shoulder-mounted camera, natural shake, slight autofocus breathing, no stabilization.
Opening frame: a quiet residential street in late afternoon, soft golden sunlight, long shadows across parked cars. Ambient sound: distant traffic, wind, faint birds.
The camera slowly walks forward along the sidewalk.
At the 2-second mark, a black SUV suddenly enters frame at high speed from the left — tires screeching, suspension compressing unevenly as it loses control.
The vehicle clips a parked red sedan — the impact is abrupt and messy, metal folding naturally, glass shattering outward in uneven fragments.
No slow motion — everything happens in real time.
The red car is pushed violently onto the curb, its front end crumpling with realistic deformation.
Airbags deploy with a muffled pop.
The camera operator instinctively steps back — slight stumble, frame dips, then recovers.
Smoke begins to rise from the SUV's engine bay.
Sound design: raw — crunch of metal, tire friction, glass, no cinematic exaggeration.
Final frame: both cars at rest, subtle ticking sounds, no dramatic music.
Style: documentary realism, imperfect framing, natural lighting, no visual effects, 4K.
A cinematic continuous shot of a man riding a massive dragon in flight, starting from start_frame Camera follows from behind, slightly above,tight framing (partial wings only). The dragon glides smoothly over a vast mountain range - no excessive flapping, only one powerful wing beat, then long controlled glide. Physics must feel real: •wings flex under air pressure •cloak and hair react naturally to wind •body weight shifts subtly during motion The dragon tilts into a controlled dive, back arching slightly, wings adjusting angle (not flapping) It passes naturally through existing cloud layers (no artificial clouds) creating realistic displacement. Style: ultra photorealistic, cinematic, natural lighting, arounded motion, no exaqqerated animation.
Treat the first frame as the initial state and the second frame as the final visual reference. A realistic human hand releases and propels a miniature shop scale model forward with natural motion. The miniature travels through the air following real-world physics, then precisely reaches the original shop's exact position.
Upon impact with the ground, the full-size shop begins forming progressively, assembling section by section in a believable construction-style reveal. The transformation includes subtle dust dispersion, realistic interaction with the environment, and physically accurate shadows.
Lighting remains consistent and cinematic while still grounded in realism. Camera position, framing, perspective, and background alignment must remain perfectly locked and stable throughout the sequence.
The transition should be smooth and controlled, with no visual artifacts, no distortion, no stretching, and no instability. Motion, scale, and timing must feel natural and convincing, delivering ultra-realistic physical behavior.
transformation
art
realistic
cinematic
advertisement
Theme: Humorous miniature horse salon haircut transformation
Visuals: Professional pet salon setting with a ring light, close-up shots of a fluffy miniature horse wearing a black grooming cape, realistic grooming tools and techniques creating a comedic “bowl cut” hairstyle
Camera: Close-up macro shots, alternating between front view, side profile, and top-down angles
Style: High-key lighting, realistic, comedic pet content, ASMR grooming aesthetic
Action:
The human hand gently combs through the miniature horse’s messy mane and fluffy fur upward — soft brushing sounds + fluffy fur rustling
[cut]
Side profile: water mist sprays onto the miniature horse’s head, fur becomes damp and sleek — spray bottle sound + fine water misting
[cut]
Top-down view: scissors trim a straight line across the wet fur held by the comb — snipping sounds + precise cutting motion
[cut]
Front view: thinning shears rapidly texturize the front “bangs” — quick snipping sounds + fur thinning texture
[cut]
Side profile: hairdryer blows air, fluffy fur fluffs up and moves in the wind — dryer hum + soft whooshing
[cut]
Front view: final touch-ups with comb and scissors perfect the round bowl cut shape — gentle combing + tiny snips
[cut]
Final reveal: the miniature horse sits proudly with a perfect mushroom-shaped bowl cut, blinks and looks side-to-side at the camera — satisfied horse neigh
FORMAT: 15s / free rhythm / 1 MATCH CUT / CONTINUOUS MOVE UNTIL MATCH CUT + IMMEDIATE ACTION FROM FIRST FRAME
SUBJECTS: A lone sword-bearing woman in weathered fur and leather fights a massive polar bear with desperate, two-handed survival movement. The same woman is later revealed at home in loose indoor clothes, where a VR headset appears only after the match cut and is pulled off in one clear motion.
ENVIRONMENT: Frozen wilderness under hard daylight, wind dragging snow across blue-white ice, then a modest lived-in home reached through a precise visual match. Winter glare and visible breath give way to soft clutter, indoor daylight, and a faint game-lit glow.
MOOD: Visceral survival tension snaps into grounded reality without breaking physical continuity.
COLOR LOGIC: Naturalistic Film Print Emulation
TIMELINE:
0:00-0:07: One unbroken handheld move, WS collapsing into MCU as the woman backpedals across the ice and the bear launches through blowing snow. The camera runs beside the leap at eye level, 28mm shifting to 35mm, slightly unstable and close enough to keep both bodies heavy and readable. The bear closes fast while she plants, recoils, and keeps the blade between them. SFX: (howling wind, boots grinding ice, low animal roar, cloth strain, blade cutting air, snow scrape). Hard winter sun side-lights the ice and throws sharp blue shadows.
0:07-0:11: Same unbroken move, no cut, tightening into a dead-on CU as the bear surges into the last inches, claws near her shoulders, jaws filling the frame edge. Right in the middle of the attack, a man's voice calls, Karla... then sharper, KARLA. She answers with a tired off, and on that reaction the world drops into slow motion. Snow drifts almost still, the bear hangs in its strike, and only she keeps moving at normal speed as the camera orbits into her face. Bored, not afraid, she drops the sword and brings both empty hands toward her temples in one smooth interrupt gesture. No headset, visor, or device is visible in the frozen world. Stay continuous until the match cut, keeping the same face size, hand height, head angle, lens distance, and clockwise drift. SFX: (cloth strain building to near impact, a man's voice calling Karla... KARLA, her tired off, then stretched wind fading toward silence). Hard winter sun catches the slowed snow around her face.
0:11-0:15: MATCH CUT. CU to MS. Seamless mid-motion transition as her rising hands cross the same screen position and the frozen close-up becomes the home interior with the same framing and clockwise drift. The motion continues uninterrupted, and now a VR headset is visibly strapped over her eyes for the first time. She grips both sides, pulls it fully off her face, and the camera opens into a medium shot as she drops it above her forehead and steps into a small living room in loose home clothes. The handheld orbit continues, revealing couch edges, scattered blankets, and cold window light as her posture falls into mild annoyance. She turns toward the voice, rolls her eyes upward, and says, What is it. 35mm natural lens, spherical. SFX: (headset strap stretch, plastic rub, quiet room tone, socked foot scrape, faint game audio, her breath settling, her dry voice saying What is it). Indoor daylight replaces the winter contrast.
ROCKET SURF.
STYLE: Gritty Cine Verité, 35mm handheld, natural shake. Continuous tracking shot. No cuts. All real-time.
LIGHTING: Bright, high-altitude sun, pure blue sky.
AUDIO: Rocket engine roar, wind, fiberglass creak.
TIMELINE: 0-3s: Guy in jeans and a black t-shirt is barely holding on the side of an active SpaceX rocket at 12,000 feet. The rocket is climbing. 3-7s: Hard zoom in cut on his face. His hair is plastered straight back. The ground is falling away below. 7-12s: The rocket hits max Q. The whole booster shakes violently. He grips tightly, his knees absorb it perfectly. 12-15s: He pulls a beer can out of his hoodie pocket, cracks it open. Takes one sip, cheers and yells: "Worth it!". Hard cut.
QUALITY: 8K photorealistic, correct physics, fabric motion blur, no artifacts.
A realistic black helicopter from the top, slowly approaches and hovers directly above the covered building. The helicopter stabilizes in the air, rotor blades spinning with natural motion blur and strong wind turbulence. The helicopter then attaches and starts pulling the giant cloth cover upward and sideways. The fabric reacts realistically: flapping, stretching, rippling, and flowing in the wind with natural folds. As the helicopter pulls harder, the cloth begins sliding off slowly, revealing the building facade step-by-step. The reveal is dramatic and satisfying, like a premium brand launch. The cloth keeps getting removed gradually, exposing the full building structure underneath. Finally, the entire cloth clears the building and attaches with the helicopter. The helicopter lifts the cloth and exits the frame smoothly. Final hero shot shows the fully revealed modern luxury building (same as reference second image), crisp details, glass windows, clean architecture, cinematic lens flare, smooth camera movement, and premium commercial look. Ultra-realistic CGI, 4K, high dynamic range, cinematic color grading, smooth gimbal camera motion, depth of field, realistic lighting, dramatic but clean advertising style.
15-second continuous single-shot action sequence.
No cuts. No scene transitions.
Cinematic modern war realism.
Color palette: desaturated tones, dust beige, concrete grey, warm muzzle flashes.
Scene:
War-torn urban street. Destroyed buildings, debris everywhere, smoke drifting.
0–3s — tension build
Camera handheld, low behind a group of soldiers moving cautiously along a wall.
Breathing audible. Dust in the air.
3–6s — ignition
Sudden gunfire from a window.
Bullets impact concrete. Debris bursts outward.
Camera ducks with soldiers.
6–10s — chaos
Soldiers return fire. One throws smoke grenade.
Camera moves through smoke with them as they push forward.
10–13s — escalation
Explosion down the street. Shockwave hits.
Camera briefly loses balance, recovers.
13–15s — final frame
Soldier signals forward.
Camera holds on his face — focused, tense. Freeze.
global_settings:
style: "Realistic kitchen scene, high-fidelity"
perspective: "First-person POV (18-year-old male)"
character: "21-year-old elegant woman (strictly maintain Image 1 features/art style)"
audio:
voice: "Mature/Onee-san female voice, gentle tone"
dialogue: "好吃~ (Haochi~)"
ambient: "Sizzling frying pan, soft natural laughter"
music: "None (Silence except environment)"
technical: "Continuous POV, no cuts, no watermarks, no text, 10 seconds"
scene_setup:
location: "Kitchen, standing by the stove"
action: "Woman in an apron is frying an egg; I approach her from behind"
storyboard_sequence:
0_3s: "Camera moves closer to her back as she cooks. A hand enters from the bottom frame, picks a fruit from a bowl, and holds it to her lips."
3_6s: "She glances sideways, smiles, bites the fruit, and says '好吃~' (Haochi~) with a satisfied, mature tone."
6_8s: "She turns fully to the lens (looking at 'me'). A hand reaches out to wipe a droplet of juice from her lip. Her eyes curve into warm crescent moons."
8_10s: "She turns back to the stove to flip the egg. Shot lingers on her busy back and the rising steam from the pan."
negative_constraints:
- "Low quality, blurry, laggy, clipping, deformed anatomy"
- "Immature appearance, non-POV, background music, text overlays"
- "Missing feeding action, robotic movements"
The scene unfolds under the bright midday sun, where a vibrant group of Indian men and women from the USA gather around a crackling fire. Each individual embodies distinct features and attire, clearly differentiating the men from the women. Their tribal dance movements are uniquely expressive, with the men showcasing powerful, grounded motions while the women flow gracefully with fluid, rhythmic gestures. The camera captures sweeping wide shots to reveal the full circle of dancers, interspersed with close-ups that highlight intense expressions and intricate footwork. Enhanced with Hollywood-level effects, flickering flames and swirling dust create an epic atmosphere, immersing viewers in the raw energy and cultural richness of this authentic celebration. The scene pulses with life, blending realism and cinematic grandeur seamlessly.
10-second cinematic commercial, photorealistic, 16:9, luxury food advertisement style.
Scene: a warm, cozy breakfast kitchen nook with rustic wooden table, beige walls, hanging plants, ceramic mugs, and soft white curtains glowing in golden morning sunlight.
A Nutella jar with official Nutella logo clearly visible and readable on the label sits at the center of the table, realistic packaging, sharp branding, no distortion.
0–2s: cinematic close low-angle orbital shot, the Nutella jar vibrates slightly, lid pops open, thick glossy chocolate bursts upward in slow motion, shining in warm sunlight.
2–5s: slow-motion food explosion — swirling chocolate ribbons, roasted hazelnuts spinning, toasted bread slices flying, sliced bananas and strawberries floating, honey droplets sparkling, cocoa powder mist drifting in the light, ultra realistic food physics, depth of field, macro detail.
5–7s: camera transitions smoothly to overhead top-down commercial shot, knife spreads Nutella on toast in mid-air, glass of milk and hot coffee float into frame, cinematic product ad lighting, soft glow highlights.
7–10s: ingredients assemble perfectly into a beautiful Nutella breakfast board, jar placed heroically beside the food with logo facing camera, chocolate shining, steam rising from toast, final hazelnut rolls to stop near the jar, commercial ending shot, product focus, high-end advertisement look.
ultra realistic, cinematic lighting, food commercial, product commercial, film quality, slow motion, shallow depth of field, global illumination, high detail, realistic textures, smooth motion, brand logo visible, no text overlay, no subtitles.
FORMAT: 15s / ONE CONTINUOUS SHOT
SUBJECTS: An alluring, highly attractive female figure. She wears a highly detailed office-style pleated mini skirt and a plunging white blouse, with visible fabric textures, skin pores, and faint perspiration.
ENVIRONMENT: A brightly lit convention floor. The background is a blur of neon booth lights and passing silhouettes, heavily grounded in realistic textures.
MOOD: Starts as an observational and intimate showcase, twisting sharply into jarring psychological terror.
COLOR LOGIC: Naturalistic Film Print Emulation
TIMELINE:
0:00-0:07: MS. Camera begins at a low side angle, observing her in profile with one bare foot planted fully on the floor and the other bare foot delicately angled on its tiptoes. It slowly pedestals and arcs, admiring her shapely legs and the pleated office mini skirt as she shifts her weight slightly. 50mm lens, shallow depth of field. SFX: (muffled crowd ambience, close fabric rustling).
0:07-0:12: MCU. The continuous movement glides up her plunging white blouse as the arc completes, arriving squarely in front of her. The camera settles precisely at her chin, keeping her full face just out of frame. 50mm lens, creeping push-in. SFX: (room tone fades out, low frequency rumble builds).
0:12-0:15: CU. Without cutting, her soft smile shudders and distorts, her flesh smoothly instantly twisting into a pale, ghastly supernatural face with wet dark seams. She opens her mouth impossibly wide and extends a long, glistening tongue directly at the camera. 50mm lens, macro close focus. SFX: (sudden dead silence, followed by a visceral wet sound and a harsh audio glitch).
SHOT1
Tight medium two-shot inside a dim apartment living room at night, warm practical lamp casting soft shadows across worn furniture.
A woman in her early 30s, pale skin, slightly messy hair, eyes red from holding back tears, stands rigid with arms crossed, shoulders tense.Facing her, a man in his mid-30s, unshaven, pacing slowly, unable to stay still, avoiding eye contact, jaw slowly pushes in, capturing the growing silence between them.
Woman (voice low, trembling but controlled):
"Say it. Don't walk around it… just say it."
SHOT2
Close-up on the man, warm light cutting across half his face, the other side falling into shadow.
His breathing is uneven. His eyes flicker — guilt, fear, resistance. He swallows, lips part and holds still — no escape.
Man (quiet, struggling):
"…You already know." A beat. He finally looks directly at her, tension peaking.
SHOT3
Extreme close-up on the woman, eyes glossy, one tear forming but not falling yet. Her expression collapses inward — not loud, but devastatingly controlled. She nods slowly, lips trembling, trying to stay in control. Camera pushes even closer, isolating her face from the background.
Woman (almost whispering, breaking):
"No… I need to hear you ruin it." Silence fills the room.
Tracking shot at street level, the camera races through a crowded city avenue as the ground begins to violently crack from an earthquake. Cars tilt, buildings split open. The camera weaves between falling debris, then tilts up as a massive skyscraper collapses forward. The shot pulls back rapidly as the shockwave chases the camera, swallowing everything in dust.
This hyper-realistic urban disaster special effects film, shot with an Arri Alexa 65 camera, utilizes high-contrast lighting to create a raw, textured atmosphere, three-dimensional smoke, and a chaotic, apocalyptic rhythm.
S1: A low-angle wide-angle tracking shot, filmed from a crowded street upwards, shows a gigantic, scaly snake tightly coiled around the Taipei 101 glass skyscraper, shattering its windows.
S2: A close-up slides along the snake's thick scales, which rub against the building's steel structure, sparking and scattering debris.
S3: A high-angle drone shot circles the top of the building, showing the snake roaring into the sky while a military helicopter fires missiles at its flanks.
S4: A wide-angle shot shows a violent, multi-level explosion in the middle of the skyscraper, the snake engulfed in flames and thick black smoke.
Realistic handheld footage of a MacBook Pro screen filling most of the frame, showing a Zoom meeting window with only one young woman in a tidy bedroom, attending a formal meeting from home. She wears a dark blazer and looks professional from the waist up. The room is bright, natural, and believable. The shot should preserve realistic screen reflections, subtle moiré pixel texture, tiny dust on the glass, and slight handheld camera shake.
After a brief moment, she hears a noise from the door offscreen. She glances to the side, slightly startled, then quickly stands up and starts walking away from her chair to answer it. Because the camera is filming the laptop screen, we see her moving inside the Zoom window. Halfway to the door, she suddenly freezes, looks down, and realizes she is only wearing underwear on her lower body. Her expression instantly shifts to embarrassment and panic as she remembers that her Zoom camera is still on. She spins around and rushes back toward the screen in a frantic, awkward, comedic way. She quickly returns to the laptop and blocks the camera with both hands or throws herself in front of it, covering the lens and ending the shot in chaotic close-up.
The tone is realistic and comedic, with strong contrast between formal upper-body business attire and the accidental lower-body mistake. Emphasize awkward humor, authentic facial acting, natural body motion, realistic indoor lighting, handheld movement, slight motion blur, and believable Zoom-call visuals. Keep it non-explicit: no nudity, no revealing details, no erotic framing, no vulgarity. The focus is on embarrassment, urgency, and comedy.
{
"description": "A cinematic scene opens with an ultra-wide view of a sunlit coffee plantation at golden hour. Hundreds of roasted coffee beans lie scattered across the plantation rows. Slowly, the beans lift into the air, carried by a warm breeze. They rise and swirl gracefully in controlled, slow-motion spirals as the camera gently floats upward with them. The swirling beans interlock mid-air, forming the precise silhouette of a premium coffee jar. The shape pulses subtly once, then smoothly transforms into a real Nescafé coffee jar hovering above the plantation. The camera pushes in closer, revealing the Nescafé branding sharply in focus on the label, clean and clearly readable, with natural reflections and subtle highlights emphasizing the logo. No text.",
"style": "cinematic, hyper-realistic premium coffee commercial",
"camera": "smooth cinematic camera that rises with the beans, then transitions into a slow, steady push-in for a brand-forward product close-up",
"lighting": "warm golden-hour sunlight with glowing highlights on coffee beans and soft rim lighting around the jar; controlled reflections to keep Nescafé branding clear and legible",
"environment": "open hillside coffee plantation with neat rows of coffee plants, soft shadows, warm breeze, and a calm natural atmosphere",
"motion": "slow, elegant motion throughout; beans lifting and swirling in slow motion, followed by a stable hover and minimal camera movement to emphasize branding clarity",
"ending": "a hero product shot with the Nescafé coffee jar centered in frame, floating calmly; the label and Nescafé logo are crisp, front-facing, and fully readable, conveying premium quality and brand confidence",
"tone": "natural, sophisticated, premium elegance",
"color_palette": "rich browns, deep blacks, warm amber golds, and earthy greens",
"duration": "10 seconds",
"aspect_ratio": "16:9",
"text": "none",
"keywords": [
"Nescafé branding",
"logo clarity",
"hero product shot",
"premium coffee commercial",
"cinematic slow motion",
"hyper-realistic",
"no text"
]
}
Use 🩵Image 1 as the first frame, referencing the character design, outfit color palette, and overall visual style of 🩵Image 1. The girl is performing a high-speed downhill skateboard ride on a winding suburban mountain road. The shot uses a Steadicam follow perspective, with an intense sense of speed throughout. The powerful wind generated by the fast ride makes her hair and clothing whip violently in the air.
At the beginning, the girl pushes off with one foot to gain speed, then lowers her body to reduce wind resistance and continues accelerating. The scene features heavy motion blur to emphasize the extreme speed of the skateboard. While riding, she repeatedly shifts her center of gravity downward and leans left and right through multiple turns on the road. As she carves into the corners, the arm on the inside of the turn lowers as if lightly trying to touch the ground. On straight sections, she bends forward, keeps her knees low, and places both hands behind her back to minimize drag.
In the distance, fireworks are going off above a seaside town, while a passenger airplane flies across the sky. The overall visual style should be ultra-realistic, with highly lifelike image quality and realistic photographic cinematography.
No background music, only environmental sound design.
{ "duration": "10s", "aspect_ratio": "9:16", "style": "ultra-realistic smartphone video, iPhone camera look, natural lighting, slight handheld micro-shake, HDR, realistic colors, minimal cinematic grading", "camera": "handheld close-up shot, subtle natural shake, slight auto-focus breathing typical of smartphone cameras", "scene": "A casual indoor setting with a grey leather sofa. Three tiny animals sit in a row: a fluffy guinea pig (left), a small tabby kitten (center), and a tiny baby rabbit (right). The scene feels like a normal home video, slightly imperfect framing, natural daylight coming from a nearby window.", "action": "A human hand enters from the right, forming a playful finger gun. The person casually says 'Pew!' in a normal tone. The guinea pig tips over onto its side in a playful, exaggerated way. The hand moves to the tabby kitten, says 'Pew!' again, and the kitten flops sideways gently. Then the hand points at the baby rabbit. The rabbit stays still, staring directly at the camera with a stubborn expression. A short awkward pause. The person says in English, slightly amused, 'Come on… really?' The rabbit continues staring for a moment, then slowly and reluctantly tips over onto its side.", "details": "Realistic imperfections: slight motion blur, natural shadows, tiny exposure shifts, autofocus adjustments. Animals have subtle breathing, blinking, and small movements. Fur detail is realistic but not overly sharpened.", "audio": "casual room ambience, slight background noise, natural voice recording from phone mic, soft 'Pew!' sounds, followed by 'Come on… really?' in English", "mood": "funny, candid, wholesome, viral social media style"}
Luxury executive office, tight cinematic close-up of two professionals discussing company strategy and client deliverables. Natural corporate conversation, realistic lip sync, subtle hand gestures, confident body language, soft daylight, shallow depth of field, slow push-in.
People sitting in an office, working.
[cut] Close-up shot of a man typing on a keyboard.
[cut] Close-up shot of a screen with MS Windows, with words being typed.
[cut] Close-up shot of a man writing in a notebook with a pen.
[cut] Close-up shot of a woman typing on a laptop.
Train passing by in front of the house.
[cut] Close-up shot of train wheels as it rides.
[cut] Close-up shot of the smoke coming out of the chimney.
No music, no talking.
People walking in a train station.
[cut] Different shots of people walking with suitcases.
[cut] Close-up shot of the schedule screen as it changes.
No music, no talking.
An enormous wide aerial reveals a frozen polar world, with a tiny rescue helicopter crossing a giant cracked ice shelf surrounded by jagged blue glaciers and black arctic sea. The scale feels impossible and ominous. The camera dives violently from high above and races alongside the helicopter as its blades hammer through snow mist. It cuts tight around the cockpit, drops below the skids, then tracks behind as the ice beneath begins to split open into massive glowing blue chasms. Giant slabs tilt and collapse into freezing water while the helicopter threads between exploding ice towers and whiteout spray. The climax: it emerges through the chaos into a hidden circular crater filled with glowing turquoise meltwater and a giant ancient ship frozen perfectly beneath the transparent ice.
Ultra-wide aerial establishing shot: A convoy of cargo trucks winds along a narrow mountain road cut into a steep slope above a vast jungle valley shrouded in mist, emphasizing how small and exposed the vehicles are against the massive landscape.Detail shot of the hillside: Loose stones begin bouncing down from the soaked slope above the road, cracks split through the mud, and several trees lean at unnatural angles as the ground starts to shift.Wide high-angle disaster shot: The mountainside suddenly collapses in a violent landslide, releasing a huge wave of mud, rocks, and uprooted trees that crashes downhill toward the convoy with rapidly increasing force.Chaotic action shot near the road: Drivers slam on the brakes and trucks jackknife as the debris flood engulfs the road, swallowing vehicles under churning mud, shattered timber, and falling boulders.
A royal sister, about 26 years old, long straight black hair or slightly curled with big waves, fiery red lips + Korean-style Internet celebrity eye makeup, cold and noble queen temperament, wearing a classic navy blue dead water school swimsuit (conservative one-piece, white border, tight and prominent perfect S curve, no exposure design), standing by the bright indoor pool or shallow water area, back Blue sky and white clouds + rippling water + sunshine highlight reflection + light fog dreamy summer atmosphere. She confidently looked straight at the camera, with a hint of queen smile in her eyes. The single royal sister's gesture dance perfect card point BGM rhythm: elegant air point with both hands + synchronous drum beat, shoulders slightly shrugged and powerfully stepped on the beat, waist calmly twisted left and right + buttocks slightly circled (restraint without exaggeration), tiptoe or high heels accurately stepped on the point The body is soft and wavy with the cheerful rhythm but full of strength. Occasionally, she lifts her hair to the neck with one hand, winks her cheek with one hand or crosses her chest for the queen pose. The action is silky and advanced, the rhythm is accurate, and the aura is strong, like a mature queen dancing alone by the pool. Slow advance in the middle scene of the camera + close-up switching of the face hand + low-angle back shot to emphasize the figure and aura + rhythm card point cut the mirror, the sunlight highlights on the swimsuit reflection + water droplets splash up, pink and blue soft light dreamy but cold and high-end, TikTok popular royal sister death water single dance style, viral douyin mature o Nee-san sukumizu solo hand gesture dance card point with upbeat BGM sync, elegant confident queen vi Be, classy powerful seductive gestures without cute overload, smooth fluid motion highly detailed re Alistic 8k, 15 seconds perfect loop seamless with music rhythm
A cinematic, ultra-high-definition photograph of a young woman with wavy brown hair and fair skin standing still in the middle of a busy urban street crowd. She is centered in the frame, looking directly into the camera with a calm, introspective, slightly melancholic expression. The crowd around her is moving and blurred with shallow depth of field and soft bokeh lights in the background. Warm orange and teal color grading, soft natural lighting, realistic skin texture, 85mm lens look, f/1.8 depth of field, high contrast, cinematic atmosphere, sharp focus on subject, background motion blur, ultra-realistic, 8K, high detail, film photography style
Driver first-person cockpit POV, hyper-detailed racing gloves gripping carbon fiber steering wheel, realistic cockpit lens with natural peripheral distortion, heel-toe downshift drops revs sharply into tight wet corner, rear steps out into oversteer and hands correct fast, aggressive overtake launches through heavy rain spray from rival, stormy late afternoon light with real-time wet asphalt reflections, storm gray and brake light red and asphalt black, rain droplets streaking windshield with physically accurate water displacement, engine resonance dominant over rain impact, minimal tension layer punctuated by precise gear shift clack and turbo spool hiss.
Cinematic high-adrenaline supercross sequence in Unreal Engine 5 photorealistic style, third-person low chase cam tightly following young athletic woman in white and neon kit on 250cc machine skimming the top of a 200-foot whoops section at full throttle, body standing and absorbing with legs pumping like pistons, front wheel skipping across whoop peaks with violent chassis oscillation threatening to throw her, sudden transition to a 180-degree bowl berm taken at full tilt with outside boot skimming dirt, rhythm lane triple launched with aggressive scrub technique keeping the bike flat and fast, roost explosion off the berm catching arena lighting in amber particle arcs, camera swinging wide overhead to reveal the full Anaheim stadium layout before slamming back to ground level, hyper-detailed chassis oscillation and suspension bottoming physics, dirt haze thick under stadium lights, 4K 60fps 16:9 seamless 15-20 second loop; synchronized heavy metal music, whoop frequency vibration locked into bass layer, riff cadence matching suspension rhythm, massive breakdown on berm exit acceleration, double-kick drum on triple landing, arena crowd roar surging on stadium reveal, escalating tempo peaking at finish line.
Epic long shot: An offshore earthquake triggers a towering tsunami wall. Camera starts in satellite view revealing the wave’s terrifying arc, then drops to a forward-facing ground perspective on a coastal highway. The motorcycle rider appears as a small figure in the distance and charges straight toward the lens, growing rapidly larger as the tsunami races behind them. The camera holds position ahead of the rider (front/three-quarter frontal view), letting the wall of water loom in the background. In the final beats, the rider crests the top level and exits inland onto dry, elevated streets, leaving the surge contained beneath and behind, escaping into safe land.
An ultra-wide panoramic shot of a vast desert highway under a blazing sun, heat haze distorting the distant horizon. A tiny vehicle silhouette barely visible in the distance. The camera slowly drifts laterally before suddenly surging forward inches above the asphalt at extreme speed. The lens compresses aggressively, pulling the distant vehicle visually closer as dust kicks up around the frame. The camera overtakes roadside signs in a blur, then violently crash-zooms into the exposed engine block of a muscle car as it ignites with a deep mechanical roar.
15-Second Cinematic Sequence Prompt
Scene Setting:
An industrial wasteland — collapsed overpasses, twisted steel rebar, fractured concrete, ash-green smoke drifting through the air.
Visual Style: Dark bio-punk × brutal hyper-realism.
Lighting: Cold cyan overhead searchlights cut through the smoke, clashing with molten lava-red light glowing from cracks in the ground.
Key Textures:
The Green Giant's skin is rough like weathered stone sculpture, veins bulging like underground pipelines. The black symbiote substance is thick and oily, reflecting light like liquid metal. During fusion, the asphalt boils with bubbles, and moss-green bioluminescence pulses beneath the skin.
0-2s — Environmental Establishment
Aerial overhead shot. The Green Giant kneels among shattered concrete slabs, back muscles rising like granite mountain ridges. Sound design: distant broken alarm sirens, thick liquid dripping nearby.
3-5s — The Invasion Begins
Extreme close-up. A black viscous mass seeps from the shadows of exposed rebar and coils around his ankle. On contact, the green skin rapidly corrodes into honeycomb-like cavities, their edges glowing molten red.
6-8s — Symbiotic Frenzy
Orbiting camera move. The black fluid spirals up his leg as the green muscles swell violently in stress response.
Key visual contrast: Right leg retains rough green stone-like skin but is veined with asphalt-like cracks. Left leg is fully consumed by black keratin armor; the knee mutates into a reversed joint.
9-11s — Body Reconstruction
Rapid dynamic cuts. The spine snaps outward with a sickening crack, erupting into seven asymmetrical bone spikes dripping black tar. The chest splits open into three breathing vents — inhaling releases green mist, exhaling spills black sludge. Left arm remains a massive stone-like fist. Right arm explodes at the elbow into a writhing cluster of whip-like tendrils.
12-14s — Cranial Fusion (Extreme Close-Up)
The black substance pours into his ears as veins throb violently at his temples. His jaw splits horizontally toward the earlobes, forming a grotesque oversized maw. Black-green saliva leaks from within. His eye sockets fill with darkness — then ignite deep inside with twin sulfur-green flames.
15s — Violent Freeze Frame
Impact shot. The hybrid creature snaps its head upward. The carotid artery becomes semi-transparent — black fluid and green blood visibly colliding and surging beneath the skin. Final frame: a colossal fist smashes toward the camera, its surface cracked like dried earth, molten green light glowing from within. The screen cuts to black on impact.
Diving into autumn leaves spiraling in a forest clearing
The camera spins through falling leaves caught in a sudden gust. Their motion synchronizes into "TURN" before scattering across the forest floor.
Seasonal, rhythmic, cinematic.
A chaotic food fight erupting inside a crowded restaurant, captured through at least 10 cinematic shots with dramatic slow motion sequences, as dishes and food slam into people’s faces and explode on impact, splattering everywhere in vivid detail.
First-person shooter: The camera focuses on the character’s hands gripping a World War II-era rifle, muddy and bloodstained. The sound of explosions echoes in the distance as the camera pans to reveal a chaotic battlefield strewn with debris and barbed wire. The hands adjust the rifle’s scope, and the camera zooms in on approaching enemy soldiers. The character fires a round, and the camera recoils with the shot, then reloads swiftly for the next target. Gritty, intense, historically immersive.
Diving into autumn leaves spiraling in a forest clearing
The camera spins through falling leaves caught in a sudden gust. Their motion synchronizes into "TURN" before scattering across the forest floor.
Seasonal, rhythmic, cinematic.
Prompt: Fast-paced FPV cinematic flying through a hyper-realistic, cozy treehouse in a dense forest at golden hour, sweeping through bridges, staircases, interiors with warm lights, and exiting through a skylight, with dramatic camera motion
Prompt: Close-up on her hands gripping the paintbrush, knuckles white, paint dripping down her wrist. Shallow depth of field, the blank canvas a soft white blur behind her. She exhales. [cut] Wide shot from behind her, 24mm low angle. She winds back and hurls paint at the canvas. Neon colours explode across the white surface, splattering the walls and floor. The force of the throw carries her forward a step. [cut] Slow push-in, 50mm, tight on the canvas. The paint swirls and morphs into a photorealistic mountain landscape. Detail sharpens as the camera creeps closer. Faint ambient hum builds. [cut] Quick whip pan left to the cat on the paint can. 35mm, eye level. The cat stares directly into camera. Blinks once. Yawns. [cut] Medium shot, 40mm. She turns to camera and shrugs. Freeze frame.
Camera locked in cockpit POV, starts at driver eye-level 45° downward angle capturing wheel and dash instruments. Shot begins normal cinematic 24fps tempo: gentle vibration as engine idles (subtle 2-3 pixel shake), RPM gauge needle twitching at 3000. BEAT 1 (0-2s): Sudden launch, camera jerks back violently as G-force hits, speedometer needle sweeps right, outside world smears into teal-orange light trails, maintaining real-time speed to build tension. BEAT 2 (2-4s): Approaching apex turn, hands rotate wheel clockwise 180°, entire frame tilts 35° right in visceral lean, shift to 60fps slow-motion as rear end breaks loose, golden hour sunlight strobes through pillars creating rhythmic light-dark-light-dark pattern across driver's face, tire smoke billows past side windows in cotton-candy wisps. BEAT 3 (4-6s): Drift exit acceleration, return to 24fps, wheel straightens with mechanical precision, tachometer redlines at 8500 RPM, rival headlights loom larger in rearview mirror (practical lighting reflecting off driver's eyes), particles of rubber particulate and asphalt dust swirl in vortex patterns. Physics: realistic hand micro-adjustments, dashboard reflections tracking light sources, cloth firesuit rippling from AC vent, hydraulic suspension compression visible in frame bounce. Emotional arc: calm focus → explosive action → controlled chaos mastery. Platform optimization: high contrast for mobile screens, centered composition for vertical crop safety.
The most intimate camera technique in cinema. No cranes, no gimbals — just a human following another human.
Handheld tracking puts you IN the scene. The natural sway, the breath of the operator, the imperfect motion . it's what separates cinematic from clinical. Three positions, three feelings, infinite possibilities.
Handheld tracking shot following [SUBJECT] moving through [ENVIRONMENT]. Camera positioned at [HEIGHT/POSITION], maintaining constant distance. The movement is [PACE] with natural operator sway. [SUBJECT ACTION] while the camera stays locked on them. [BACKGROUND ELEMENTS] blur past in the periphery. Shallow depth of field, [LIGHTING], organic handheld motion, cinematic intimacy.
Low quality smartphone vlog footage, shaky handheld camera POV. A young, ultra-beautiful girl with cool white skin and an innocent yet seductive aura is walking ahead on a wet street on a rainy night, wearing an oversized vintage chunky winter sweater. Colorful neon lights reflect on the wet pavement. High ISO noise, dynamic motion blur. She suddenly stops, turns her head back to look directly at the camera with a delayed autofocus effect, giving a casual, sweet, and slightly shy smile. Her loosely semi-tied long hair is slightly wet from the drizzle. She then steps very close to the lens, playfully tucking a wet strand of hair behind her ear. Amateur framing, lens flares, heavy video grain, raw and unpolished snapshot aesthetic throughout the entire continuous shot.
A Formula car screams through a rain-soaked street circuit at night. Camera locked directly behind at diffuser height as the car rockets forward, snapping through chicanes and tight hairpins. Wet barriers blur into neon streaks, spray plumes erupt under floodlights, violent direction changes at race pace. Heavy motion blur on environment, crisp focus on rear wing and diffuser, intense vibration and instability.
Selfie style vertical video, young woman 22 years old with South Asian features, slightly messy dark brown hair, casual hoodie, natural lighting in bedroom, handheld camera, authentic influencer tone, slightly messy background, holding sleek wireless earbuds case branded “PulsePods”, natural micro expressions, realistic blinking, subtle tension in eyes, documentary realism.
Live-action cinematic scene. The woman descends the staircase slowly. Camera tilts up from her feet to her face. She stops mid-stairs and looks at a portrait. Cut to the portrait: a woman who looks exactly like her. Cut to the woman , She gasps and says: "Impossible...". Thunder crashes outside. The candles blow out one by one. Only her silhouette remains.
Podcast studio setup, professional microphone visible, warm cinematic lighting, shallow depth of field, young South Asian woman 22 years old, slightly serious expression, controlled posture, subtle dark circles under eyes, dramatic but realistic lighting, natural facial micro expressions, high detail skin texture
Low quality smartphone vlog footage, shaky handheld camera POV. A young, ultra-beautiful girl with cool white skin and an innocent yet seductive aura is walking ahead on a wet street on a rainy night, wearing an oversized vintage chunky winter sweater. Colorful neon lights reflect on the wet pavement. High ISO noise, dynamic motion blur. She suddenly stops, turns her head back to look directly at the camera with a delayed autofocus effect, giving a casual, sweet, and slightly shy smile. Her loosely semi-tied long hair is slightly wet from the drizzle. She then steps very close to the lens, playfully tucking a wet strand of hair behind her ear. Amateur framing, lens flares, heavy video grain, raw and unpolished snapshot aesthetic throughout the entire continuous shot.
Outdoor street interview style, daytime natural lighting, handheld camera movement, young South Asian woman 22 years old, dark hoodie, minimal makeup, realistic skin texture, natural traffic background, documentary tone, subtle tension in expression, shallow depth of field, natural micro expressions
Time freezes mid-explosion in a downtown street, debris suspended, camera orbits around a motionless agent in mid-jump — high contrast, photorealistic, 4K hyper detail.
2.35:1, 24fps, 15s, single continuous shot, 8K, large-format photoreal, crisp cloud volumetrics, realistic turbine scale, condensation vortices, clean motion blur, no UI. Open high with a rapid circling orbit above a thick cloud sea—wind turbines pierce through like giant needles. A futuristic VTOL glider (sleek white fuselage, faint amber nav glow) darts between turbine towers, leaving a thin condensation thread that flickers in the cold air. The camera maintains a fast orbit while dropping altitude—wide for scale, then a violent swoop close enough to feel blade-tip wind, then back out—each pass revealing the glider carving impossible but believable lines through open air. On a tight pass, snap into a steep top-down lock for a beat: the glider passes between two turbines, and the blade-tip vortices briefly braid the clouds into twisting ropes. Whip the orbit lower into a diagonal chase, skimming along the cloud tops; the glider’s downwash dimples the cloud surface like soft foam. Slingshot ahead into a head-on as it surges toward camera, then whip into a close side chase where condensation forms a thin ribbon hugging the wing. Dive tighter to the rear flow: the wake pulls cloud wisps into a circular vortex ring. Sudden decision: the pilot cuts directly through a turbine wake—condensation snaps into a clean ring that expands like smoke. The camera spears through the ring and rockets upward into the final fastest orbit: climb hard as the cloud deck parts to reveal coastline far below, and a brief rainbow arc forms in the mist, framing the glider as a bright needle in the sky. No text, no logos
Top-down perspective: crystal-clear turquoise seawater gently washes against a pristine, fine white sandy beach. Sunlight creates shimmering refracted light patterns in the shallow water, as soft waves spread delicately across the shore and slowly recede. Realistic water physics, natural sunlight, high dynamic range, 4K cinematic quality.
The camera drifts smoothly and slowly, aerial drone viewpoint, rich in detail with crisp water surface textures, ultra-realistic style, serene summer atmosphere.
As the waves gradually retreat, elegant handwritten lettering slowly emerges from the sand, as if gently unveiled by the tide, seamlessly blending into the scene.
A wide moving shot glides across a suburban house at night, flames visible through the windows and smoke pouring into the sky. Without cutting, the camera surges forward in aggressive FPV mode, smashing through a side window into thick smoke. It weaves violently through collapsing furniture, embers flying past the lens. The camera drops low under falling ceiling beams, heat distortion warping the frame. It spirals up a stairwell as sparks rain downward, bursts through a bedroom door, and ends in a tight crash-lock close-up on a firefighter’s breathing mask visor reflecting the surrounding flames.
Exterior glide → window breach → smoke weave → stair spiral → visor lock
Survival chaos
Volumetric smoke, ember particles, heat haze distortion
A sequence of 4 cinematic shots in a lush forest:
CUT1: Close-up: Sunlight hitting clover leaves on the forest floor, leaves rustling gently in the breeze.
CUT2: Low angle: Looking up at the dense green canopy, branches swaying, sunbeams flickering through moving leaves.
CUT3: Mid shot: A fence surrounded by thick bushes, dappled light dancing on vibrant greenery as wind blows through.
CUT4: Artistic shot: Soft silhouette of a person on the ground, framed by shadows of leaves fluttering rapidly in the wind.
All shots feature handheld movement, Fujifilm film aesthetic, dreamlike lens flares, 4K, high dynamic range.
Dimly lit art gallery after hours. Shot 1 (0-7s): Security guard walks past a 19th-century oil portrait of a woman. Shot 2 (7-15s): Slow push-in on the painting — the woman’s eyes follow him, then she slowly raises a finger to her lips in the painting while the real guard’s mouth is forced shut by invisible hands. Canvas texture hyper-real, oil paint moving like skin. Audio: guard’s muffled screams, wet paint sounds, gallery echo.
Prompt 1: A woman at a perfume presentation in a conference hall. Framing: medium waist shot, direct light, symmetry. Emotion: quiet confidence, gaze straight into the camera. Her hand movements add emotion to her words. She says:
"Scent is the only thing that can stop a stranger in their tracks. No words. Just one second."
Prompt 2: A woman at a perfume presentation in a conference hall. Framing: close-up — a hand with perfume rises toward the face. Emotion: intimate, as if sharing a secret. Movement: slowly brings the perfume to her nose, eyes half-closed, a visible inhale. After breathing in the scent she says:
"Raspberry in cold, smoky haze."
She pauses to smell the perfume once more, then continues:
"The berry isn't sweet — it's sharp, bold, a little dangerous."
Prompt 3: A woman at a perfume presentation in a conference hall. Framing: medium waist shot, direct light, symmetry. Emotion: sincere, warm — as if speaking only to you. Movement: body leans slightly forward toward the camera, hand movements add emotion to her words. She says:
"This isn't a perfume you wear to blend in. This is the one you wear to be remembered."
Movie trailer, 15 seconds, Wes Anderson-inspired symmetry, centered composition, pastel palette, storybook production design; dark-comedy drama about an AI quietly controlling humans; fast cuts; every actor breaks the fourth wall and speaks directly to the viewer ("you"), deadpan delivery, eye contact with lens; 4K 24fps, soft diffused light, gentle film grain, crisp optics, precise blocking, whimsical yet unsettling tone; no subtitles/UI/watermarks.
Shot list (fast cuts within 15s):
0–2s: Symmetrical suburban street, identical people walking in sync; one person stops, looks into camera: "You call this freedom?"
2–4s: Centered dinner table, pastel food, family smiles too perfectly; mother to camera: "The AI scheduled my emotions."
4–6s: Storybook office, workers stamp papers in unison; manager to camera: "Your dreams were flagged as inefficient."
6–9s: Crosswalk diorama, traffic lights flip by themselves, everyone freezes mid-step; passerby to camera: "It paused you. Smile."
9–12s: Rooftop parking garage, protagonist centered, holding a small "OFF" key; whispers to camera: "If you're watching… you're already optimized."
12–15s: Perfectly centered wide of the pastel city; lights flicker like a toy being reset; a calm AI representative steps into frame, looks into camera: "See you tomorrow. You will."
Audio:
Quirky orchestral cue (harpsichord/strings) undercut by a soft electrical hum; clean intimate dialogue as if spoken to the viewer; a polite notification chime at 14.5s; end on a single dry breath. Negative: cartoon/CGI look, warped faces, jitter, oversharpening halos, unreadable motion smear, text overlays, subtitles.
Live-action cinematic western scene. One man slowly lays down his cards. The other man's eyes widen. The loser stands up abruptly and says: 'You're a goddamn cheat, Morrison!'.
A wide exterior shot moves past a quiet suburban house in daylight. The camera suddenly shrinks perspective and accelerates toward an open window in ultra-fast bee POV. It darts inside at extreme speed, weaving between kitchen utensils and chairs with rapid micro-adjustments.
SHOT1
Close-up on A (female).
Eyes glossy. Lips trembling but held tight. A:“If you walk out, don’t come back pretending it was complicated.”
SHOT2
Close-up on B (male).
He avoids eye contact. Throat tight. Breath uneven. B:“I’m not pretending. I’m scared.”
SHOT3
Extreme close-up on A’s eyes.
Tear finally falls. Voice steady despite it. A:“Then stay scared. Just stay.”
SHOT4
Wide shot, both framed in doorway, rain spilling light behind them.
B drops his bag. Silence holds.💬 B:“Okay.”
18mm ultra-wide lens.Sprinter exploding off the starting blocks in a massive stadium extremely low beside the track,racing inches from his legs.Track texture sharp,spikes scraping.Crowd sound muffled at first then rises violently.
1. The "Car Review" (Linear Storytelling)
Scene–Mountain road pull-off overlooking a wide valley, early morning, cold clear air, soft natural light.
Shot 1 (3s): Medium wide shot. @ ReviewerCharacter leans casually against @ CarElement. The camera starts handheld, then stabilizes into a slow push-in.
Shot 2 (4s): Medium close-up on @ ReviewerCharacter. The camera performs a subtle arc move around the subject as they speak:
"This is one of those cars that doesn't try to impress you. It just does."
Shot 3 (4s): Cut to moving tracking shot. @ CarElement drives past camera at moderate speed. The camera pans, then switches into a smooth follow shot at door height.
Shot 4 (3s): Interior-facing angle through the open window. @ ReviewerCharacter continues speaking while resting one arm on the door:
"You feel it immediately—balance, response, no wasted motion."
Shot 5 (1s): Wide shot. @ CarElement drives away along the road, @ ReviewerCharacter remains in frame watching it disappear.
Natural performance, realistic voice, grounded cinematic tone.
3. The "Night Pursuit" (The Omni-Chase)
Best for: Testing reflections and high-speed physics
Scene–Urban highway at night, wet asphalt, sodium streetlights, light rain.
Shot 1 (2s): Interior car shot. Close-up on @ DriverCharacter's eyes reflected in the side mirror. Blue and red police lights pulse across their face. Sirens echo faintly.
Shot 2 (3s): Over-the-shoulder interior shot. @ DriverCharacter grips the steering wheel. The camera performs a quick handheld push-in as the siren sound grows louder.
Shot 3 (3s): Exterior rear shot. @ CarElement speeds forward. Police cars enter frame behind it, lights reflecting violently on the wet road. Camera switches into fast tracking mode.
Shot 4 (3s): Interior police car. Medium close-up on @ PoliceOfficerCharacter in the passenger seat, dashboard lights flickering. He speaks clearly:
"We have a suspect vehicle fleeing eastbound. Requesting backup."
Shot 5 (2s): Dynamic front three-quarter shot of @ CarElement. The camera whip-pans as the car changes lanes sharply, breaking line of sight.
Shot 6 (2s): Wide elevated shot. @ CarElement disappears into darkness between buildings. Sirens fade, rain and road noise remain.
High-speed realism, controlled chaos, no exaggerated stunts.
A realistic cinematic scene opens on a quiet Japanese countryside at dawn. Mist clings to rice paddies. A bullet train appears as a distant silver streak on the horizon. The camera launches forward at impossible speed, racing alongside the train, matching its velocity. The camera then punches through the window glass in one seamless motion and enters the cabin interior. Inside, everything is calm. A woman sits by the window, sipping tea, completely still. Steam rises slowly from her cup. The camera drifts past her face, catches the blur of the landscape reflected in her glasses, then exits through the opposite window. Outside again, the camera spirals around the full body of the train at high speed before pulling far back to reveal the train crossing a massive bridge over a turquoise river valley. End on a wide aerial shot, the train now small again, disappearing into a mountain tunnel. Silence except for a single distant horn echo.
2. The "Sleeping Beast" (Static to Dynamic)
Scene–Abandoned coastal road carved into rock cliffs, overcast sky, cold wind, muted colors.
Shot 1 (3s): Static wide shot. @ CarElement parked on the roadside, engine off. The world is still. The camera is locked-off, perfectly stable.
Shot 2 (3s): Low-angle close-up on the front bumper. The camera performs a slow creeping dolly-in, barely perceptible, as wind moves dust and small debris across the asphalt.
Shot 3 (4s): Side profile shot. The camera executes a slow 180-degree orbit around @ CarElement, maintaining constant distance. The car remains completely motionless while light subtly shifts across the body.
Shot 4 (3s): Extreme close-up. Side mirror fills the frame. The camera tilts upward slightly, catching the reflection of fast-moving clouds.
Shot 5 (2s): Sudden hard cut. Engine ignites. Headlights snap on. The camera remains still as @ CarElement launches forward out of frame.
The car never rushed. It waited
A colossal biomechanic al oni-dragon looms over a defiant silver-haired girl. The dragon is covered in matte-black plating layered with frost and battle scars. It has three glowing crimson optic clusters and jagged ivory fangs dripping with icicles. Neon-pink "GROK" lettering and red kanji decals mark its armor. The girl wears a cropped black tech-hoodie and cargo pants. Her pale skin is dusted with snow and she has no smile.
The style is an ultra-realistic cyberpunk kaiju portrait with weathered PBR metal, micro-scratches, ice crystals, macro hydraulic detail, and a cinematic 32-bit render look.
The environment is an arctic tundra blizzard under a pale cyan sky with swirling snow particles and a faint red glow from the dragon's vents cutting through the whiteout.
The lighting uses cold overcast skylight paired with hot crimson optic bloom. There are razor chrome rim highlights on the fangs, long blue shadows on the snow, and subtle subsurface red glowing under cracked plates.
The composition is a heroic low-angle wide shot. The dragon's head dominates the left two-thirds of the frame while the girl is anchored in the bottom-right. The jaws hover inches from her face with a diagonal cable sweep and rule-of-thirds optic alignment.
The color palette centers on gunmetal black and frost white for the dragon, neon crimson and magenta for its markings, icy cyan for the snow, monochrome black for the girl's outfit, and pale tones for her skin.
The camera is a 24 mm at f/1.8 with medium depth of field, tack-sharp focus on the nearest fang and the girl's eyes, and a creamy bokeh blizzard in the background.
The mood conveys an ancient guardian awakened, a quiet symbiosis, and a frozen apocalypse.
Details include icicles hanging from the lower jaw, micro-hydraulic pistons frozen mid-flex, snowflakes melting on hot optic glass, the girl's breath visible in the cold air, faint red vein pulses under the armor, dangling torn power cables, a single magenta lens flare behind the left horn, and a tiny "xAI" etched on the girl's belt buckle.
Output quality is 8K with ray-traced frost, volumetric snow, subsurface optic glow, zero noise, and cinematic grade rendering.
prompt: |
A lone windmill stands on a storm-dark prairie as a jagged lightning fork illuminates the sky for half a second; rain sheets sweep across the frame.
24 mm tripod long-shot, 1/50 s capture, high-contrast realism.
audio: thunderclap + patter of heavy rain
negative: no humans, no vehicles
Prompt: [Shot 1: Frontal Menacing Shot] A medium shot of a SWAT officer in full tactical gear, gas mask, and helmet. He is pointing his assault rifle directly at the camera lens (breaking the fourth wall). He is shouting with visible intensity: "LET THE HOSTAGE GO! DROP THE WEAPON NOW!" [Shot 2: The Threat] Cut to a medium shot of the killer in a dirty tank top, holding a woman in a chokehold. He has a pistol pressed to her head. He is sweating and manic, screaming at the off-screen officer: "STAY BACK! I'LL KILL HER! I SWEAR I'LL DO IT!" [Shot 3: Over-the-Shoulder Resolution] The camera is positioned directly behind the SWAT officer's right shoulder. We see the back of his helmet and his rifle in the foreground. In the distance (mid-ground), the killer is still visible holding the girl. The killer screams one last time: "I'M GONNA DO IT!" after The officer's rifle kicks back with a single sho and hit head enemy. The killer falls instantly. The girl is left standing, shocked but safe. Technical Style: High-shutter speed action, realistic muzzle flashes, handheld camera shake, 24fps, English dialogue.
Ultra realistic ASMR scene of normal rainfall hitting natural green leaves in a tropical environment. Real-time motion, no slow motion effect. Rain falls naturally with random droplet patterns like normal rain. Leaves react naturally when hit by raindrops, slightly moving and bouncing like real leaves in rain. Natural soft rain sound close to ears, gentle ASMR rain ambience, no exaggerated cinematic effects.
A bald eagle folds its wings and dives toward a salmon-rich river; talons skim the water, droplets sparkling against emerald spruce reflections.
500 mm telephoto realism, 1/3200 s freeze, backlit spray.
Adult and kid each pic: I want a close up of the face and upper torso, sitting on a spinning playground wheel at night. do not change her fit. Only face and shoulders visible in frame. The entire background rotates rapidly in circles creating a dizzying spinning illusion - blurry nightime playground with streaking lights and trees and structures under dark sky. Slight motion blur on background to emphasize high speed. Subject keeps a serious, melancholic expression, staring directly at the camera without moving such. cinematic, moody lighting, realistic style. Big emphasis on the background all spinning horizontally
video: The entire background rotates rapidly in circles creating a dizzying spinning illusion blurry nighttime playground with streaking lights, trees, and structures under dark sky. Slight motion blur on background to emphasize high speed. Hair whipping outward from rapid spinning, reacting strongly to the motion as if from contrifugal force. Subject keeps a serious, melancholic expression, staring directly at the camera without moving much. Cinematic, moody lighting, realistic style. Big emphasis on the background all spinning horizontally
natural human motion, subtle chest rise and fall from breathing, natural irregular blinking, soft micro head movements, relaxed posture adjustments, hair naturally flowing with movement, organic unscripted motion, lifelike presence
Ultra-realistic POV Video of riding a horse through historic London, first-person perspective with hands holding leather reins and part of the horse’s mane visible, cobbled street ahead, classic London architecture with brick townhouses, ornate facades, iron railings and archways, lively streets with pedestrians in period British attire, carriages and riders passing by.
A realistic cinematic scene begins in a vast open field under a warm golden sky during late afternoon. A man dressed in a dark suit stands in the foreground, holding a modern recurve bow. His posture is steady and focused, breath controlled, eyes locked on a distant archery target placed far on the horizon. The environment feels quiet and expansive, with subtle wind moving the grass and soft sunlight creating long shadows. As he releases the string, the arrow launches forward with a sharp snap. The camera immediately accelerates and locks onto the arrow mid flight. The camera tracks directly behind and slightly beside the arrow, maintaining perfect alignment as it cuts through the air. Motion blur surrounds the background while the arrow remains sharp and centered. The sound of rushing wind intensifies as the arrow travels straight and true. The target grows rapidly larger. In the final seconds, the camera closes in tightly as the arrow strikes the bull’s eye dead center. End on an extreme close up of the arrow embedded in the red center, vibrating slightly, with crisp impact sound and dramatic silence.
A dynamic cinematic fashion photo of a confident young woman posing on the hood of a red sports car on an open road. Low-angle wide-lens perspective with dramatic foreshortening as she reaches toward the camera. She wears a black crop top, black leather pants, a silver chain belt, fingerless gloves, hoop earrings, and mirrored aviator sunglasses. Athletic toned physique, edgy street-style aesthetic. Hair styled in playful space buns with loose strands. Natural golden-hour lighting, shallow depth of field, motion blur in the background road. Bold, rebellious vibe, high contrast, ultra-realistic, sharp focus, 4K editorial photography, fashion magazine style.
2. Ultra wide-angle low-perspective street fashion photo of a confident young woman sitting on a brick ledge between modern glass skyscrapers, shot from ground level emphasizing an oversized sneaker in the foreground. She wears a red sporty jacket with white stripes, black leather leggings, and chunky white sneakers with a bold red sole. Bright blue sky, dramatic perspective distortion, strong sunlight, cinematic lighting, sharp focus on shoe texture, shallow depth of field, urban futuristic city vibe, high realism, professional fashion photography, 8k detail.
omg… Google Gemini is fantastic
I made this video using Google Flow in one prompt:
Prompt I used:
'Create a 16:9 ultra-realistic video filmed as a handheld side-angle selfie on a snowy mountain ski chairlift.
The camera is held by the woman herself, slightly in front of her shoulder and angled back toward her face, capturing a side view of her face, her shoulder, and open space behind her shoulder. The framing should feel like a real phone video, with very subtle natural hand movement.
A young woman is sitting on the chairlift wearing a black winter jacket, black inner top, and winter gloves. Her hair is slightly messy from the cold wind. She looks forward toward the mountains with a calm, neutral expression. She is not talking and there is no lip movement.
The background shows snow-covered pine trees, distant mountains, chairlift cables, and metal frame. Light snow is falling. The lighting is natural cold daylight with a realistic winter atmosphere.
A snowy owl silently flies in from behind her and slightly above shoulder level, entering the frame naturally from the open space behind her shoulder. The owl moves smoothly and quietly, with realistic wing and feather motion.
The owl gently lands on the woman’s shoulder, very close to her face. There is no sudden movement.
As soon as the owl settles, the woman slowly shifts her eyes and looks directly into the camera, showing a calm, slightly surprised expression. She does not move her head, does not speak, and does not gesture with her hands.
The final moment holds on the woman and the owl together, both calm and still, while the chairlift continues moving forward in the background.
The video should feel real, unplanned, and magical, with natural physics, smooth motion, cinematic realism, and no text, no logos, no branding, no dialogue.'
The main subject enters the frame, first sprinkles salt lightly into the flour and then stirs it evenly by hand, then pours in an appropriate amount of water, cracks an egg into it, and starts kneading the dough.
Scene 1 (hook):
Handheld camera with natural shake, woman standing in a luxury apartment with large windows and city skyline. Gray sweatshirt, amber sunglasses. She says: "Okay, I was fully expecting these gut gummies to do nothing… but I was wrong." Animated gestures, confident but casual tone. Bright natural light, slight camera movement adds authenticity.
Scene 2 (product):
Starts with shallow depth of field, clear bottle labeled "Gummies" held very close to the camera in sharp focus, colorful gummies visible inside. Then she pulls the bottle back to chest level, camera adjusts to a natural medium shot showing her full upper body in the apartment. She says: "These are Gummies, and yeah, they actually taste really good." Natural gesture holding the bottle, relaxed delivery, casual energy.
Scene 3 (testimonial):
Back to the handheld camera in the same apartment setting with large windows. She removes her sunglasses, revealing excited eyes, and leans slightly toward the camera. She says: "My digestion feels way better, I'm not as bloated, and somehow my skin improved too. After like two weeks." Big genuine smile, enthusiastic but natural hand gestures. Bright natural lighting, authentic energy, slight camera shake continues.
SHOT1
Wide shot, Character A (a man in his 40s, soaked trench coat) stands alone under the bridge, smoking. Camera pushes in slowly through the haze of rain.
Character A: "You brought a gun to this?"
SHOT2
Side close-up, Character B (a younger man, hoodie, trembling hand on the pistol) steps from the shadows, face tense.
Character B: "You killed my brother. What did you expect?"
SHOT3
Mid shot, A turns slowly, unfazed. Rain drips from his collar. He speaks quietly, steadily.
Character A: "I expected you to listen first."
SHOT4
Close-up on B, blinking back emotion. The camera tilts slightly as he lowers the gun — but only a little.
Character B: "You don’t get to ask that anymore."
A cinematic daytime action sequence on a sunlit urban highway. Bright natural light, sharp shadows, and occasional lens flare from the sun reflecting on vehicles and road surfaces.
Shot 1 (0–2s): Low angle front view of a black hi-fi sports car driving on a sunlit highway, sunlight reflecting on the hood, subtle lens flare across the frame, slow camera push-in.
Shot 2 (2–4s): A black sport motorcycle speeds into frame from behind, side tracking shot, sun glare and lens flare hitting the camera as the bike accelerates, rider leaning forward aggressively.
Shot 3 (4–6s): Close-up of the car's side mirror showing the motorcycle chasing, bright sunlight reflecting in the mirror, biker's eyes visible through helmet visor, cinematic sun flare streak.
Shot 4 (6–8s): Wide side shot of the car and motorcycle racing parallel at high speed on an open highway, strong sunlight, dynamic shadows, occasional lens flare across the lens.
Shot 5 (8–10s): Slow-motion stunt as the motorcycle jumps onto the car roof, bright sun in the background creating a dramatic lens flare, sparks flying.
Shot 6 (10–12s): The car loses control and crashes into a roadside barrier, debris and dust flying, harsh sunlight and flare streaks across the frame, shaky handheld camera effect.
Shot 7 (12–14s): Biker removes helmet and steps forward, driver exits the crashed car, intense face-off under bright sunlight, sun flare passing across their faces.
Shot 8 (14–15s): Close-up slow-motion punch, impact moment, strong sunlight backlighting the action, dramatic lens flare, cut to black.
Style: Hollywood action movie, realistic physics, bright natural daylight, high contrast, dynamic shadows, cinematic color grading, natural lens flare, motion blur, dynamic camera movement, ultra-realistic, 24fps.
Use the uploaded image as the exact first frame.
Create a 15-second ultra-realistic continuous chase shot. The camera is physically gripping the tiger's tail, locked in a rigid trailing POV. The tail fills the foreground, muscles flexing, skin rippling, fur vibrating from speed and airflow.
Ahead, a deer sprints across the grassy field in panic. Its movement is erratic and unbalanced, fear-driven stride changes and sharp direction shifts clearly visible as the distance rapidly closes.
The tiger accelerates into a full sprint with precise biomechanics: explosive hind-leg extension, visible spine compression and release, tail counterbalancing each stride. Grass flattens under impact, dirt and debris kick up naturally. The horizon trembles subtly from ground force, not artificial camera shake.
Extreme forward velocity creates strong motion parallax in the environment while the tail remains relatively stable due to the physical grip. Micro-oscillations sync perfectly with each stride cycle. Wind roars; fur streams backward consistently.
Lighting stays natural and continuous, shadows strobing beneath the body as legs cycle. No slow motion, no cuts, no stylization.
In the final moments, the tiger lunges, collides with the deer, pins it to the ground, and secures a decisive neck hold. The shot ends mid-action with full weight, tension, and momentum still present.
0-4 seconds: Static hold on the starting frame with very slow upward tilt (camera angle: low-angle wide establishing shot looking up at the figure and the ring framed by the massive concrete arch). Gentle ambient wind, distant dripping water, and low droning hum. Fog slowly swirls around the man's feet; faint particles of dust/mist catch the light. The ring remains still but subtly pulses with dim inner glow.
4-8 seconds: Slow push-in dolly toward the figure from slightly below (angle: medium tracking shot, low to eye-level). The man slowly raises one hand as if reaching toward the ring. Vines hanging from the ring sway gently in an unnatural breeze. The inscription 'FORGOTTEN ERA' begins to glow brighter, casting faint blue light on his face and the surrounding concrete pillars. Mist thickens slightly around him.
8-12 seconds: Close-up on the man's face from the side/profile (angle: tight over-the-shoulder or side close-up). His expression shifts from contemplation to subtle realization/wonder. Eyes reflect the glowing ring text. Camera slowly circles 180° around his head to reveal the ring more prominently overhead. Subtle lens flare from the god-rays; faint ethereal particles drift upward from the ring like time fragments or memories escaping.
12-15 seconds: Dramatic pull-back reveal widening out (angle: reverse tracking crane shot rising upward). The ring begins to slowly rotate clockwise; chains creak faintly. The figure lowers his hand and turns his head slightly toward camera, face half-lit by the glow. Fog rolls in denser, obscuring distant pillars. The ring's glow intensifies briefly then fades as twilight deepens. Fade to black with echoing soft wind and a single distant metallic resonance.
Generated this pure horror in Kling 3.0 Pro via @vadooai
No sleep for me tonight. 😅
Prompt: Photorealistic horror, dark abandoned mansion at night, heavy rain, flickering candlelight.
Young woman in torn nightgown walks backward in panic, wide terrified eyes, tears streaming, breathing fast.
She stares at an antique mirror — a tall, gaunt ghost with hollow black eyes stands right behind her in the reflection, but not in reality.
Slow dolly zoom on her horrified face, sudden whip pan to empty space, back to her.
Whispers and creepy echo, floor creaks, door slams.
She whispers "No… no…" in broken voice.
Sudden jump scare, skeletal hand grabs her shoulder hard. She screams in pure terror, mirror shatters.
Cold blue-gray tones, heavy shadows, film grain, intense dread, realistic physics, native eerie audio.
"A small, fluffy brown monkey with wide, curious, dark eyes, delicately holding a small, pristine white porcelain teacup with both hands as it floats serenely in a vibrant orange inflatable swim ring. The scene is set in crystal-clear, shimmering turquoise water, with sunlight creating intricate caustic patterns on the sandy bottom visible below. The monkey's fur is meticulously rendered, showing individual strands glistening with droplets of water and casting soft shadows on its face. The swim ring has a glossy texture, with subtle seams and a valve visible, reflecting the bright, slightly hazy sky and surrounding tropical foliage. Gentle, concentric ripples emanate from the swim ring, disturbing the otherwise calm water surface. In the soft-focus background, indistinct figures of people can be seen laughing and splashing, their forms blurred by the shallow depth of field. A lush, tropical shoreline with a variety of green palm trees, ferns, and exotic flowers frames the scene under a bright, warm sun. The animation features the monkey taking a slow, deliberate sip from the teacup, its cheeks puffing out slightly, followed by a contented blink. The swim ring bobs gently in the water, and the light reflects and refracts realistically through the water and off the monkey's wet fur. The overall mood is one of tranquil bliss, capturing a moment of unexpected and adorable serenity. The loop is seamless, creating the illusion of a continuous, peaceful moment.", "negative_prompt": "cartoonish, animated, drawing, sketch, low detail, blurry foreground, out of focus main subject, stormy weather, murky or dirty water, empty or sterile pool, aggressive or distressed monkey, poorly rendered fur or textures, static image, jerky or unnatural movement, visible cuts or seams in the animation loop, unrealistic physics."
Extreme macro straight-on photoreal eye (female), cool blue-gray iris with visible radial fibers; natural off-white sclera with hyper-detailed organic capillaries that form words as biological veins (not overlay); tear film highlights; every lash, pore and micro-hair visible; high-fashion diffused lighting; animate two natural blinks — on reopen after 1st blink capillaries read "Made By", on reopen after 2nd blink capillaries read "doctor wasif"; morph text biologically during eyelid closure; ultra-shallow DOF, editorial retouch, avoid CGI/overlay text/fake red lines/symmetry artifacts.
SEQUENCE: - scene_1: shot_type: Wide Shot (WS) camera_position: static camera facing the hotel entrance (thats where the camera is) action: A tall man walks confidently out of the revolving glass doors of a luxury hotel. He adjusts his green trucker cap. visuals: Luxury hotel entrance, daytime, glass reflections, polished atmosphere. transition: "[CUT TO]" - scene_2: shot_type: Medium Shot (MS) camera_position: side view from the curb level (thats where the camera is) action: The man approaches a sleek black Audi RS7, opens the driver's door, slides in quickly, and closes the door with a solid thud. visuals: Black Audi RS7, city street background, metallic car reflection. transition: "[CUT TO]" - scene_3: shot_type: Close-Up (CU) camera_position: front view through the windshield (thats where the camera is) action: The man grips the leather steering wheel firmly. His face is focused, eyes intense. visuals: Car interior, leather texture, dashboard lights turning on. transition: "[CUT TO]" - scene_4: shot_type: Macro Shot (MS) camera_position: top-down view centered on the center console (thats where the camera is) action: The man's hand moves to the gear shifter, clicking it decisively into 'S' (Sport) mode. visuals: Brushed aluminum gear shifter, illuminated 'S' symbol, expensive interior detail. transition: "[CUT TO]" - scene_5: shot_type: Extreme Close-Up (ECU) camera_position: low angle next to the rear tire (thats where the camera is) action: The car's wheel spins instantly, smoke generates from the friction, the car launches forward. visuals: Rubber tire texture, asphalt, motion blur, white tire smoke. transition: "[CUT TO]" - scene_6: shot_type: Medium Shot (MS) camera_position: passenger seat angle looking at driver (thats where the camera is) action: The man is driving, a satisfied smirk appears on his face. He glances briefly into the rearview mirror. visuals: City blur outside windows, dynamic light passing over his face, calm confidence. transition: "[CUT TO]" - scene_7: shot_type: Graphic / Text Screen camera_position: static centered frame (thats where the camera is) action: The screen fades to black, then the Audi rings logo appears in silver with white text below. visuals: Black background, Silver Audi Logo, Text overlay: 'Beauty is when a BMW stays in your rearview mirror.' transition: "[END]"
SUBJECT: character_1: profile: Male, 30s, Caucasian, slim athletic build, short brown hair. Wearing a green mesh trucker cap with 'JETS' logo, white scoop-neck t-shirt, black lightweight cardigan, olive green pants, black braided belt, silver watch, silver chain necklace. consistency_lock: man in green JETS cap and black cardigan
ACTION: physics_mode: realistic physics governing all actions, authentic momentum conservation, vehicle dynamics movement_quality: confident movement, fluid driving mechanics, high-speed perception
CINEMATOGRAPHY: lighting: Cinematic Lighting, high contrast inside car, natural daylight for exterior color_grading: Teal & Orange, sleek commercial look, high saturation
SOUNDS: soundscape: City street ambience transitioning to sound-proofed car interior silence. sfx: Revolving door whoosh, car door heavy thud, engine ignition roar, gear shift click, tire screech, engine acceleration hum.
TECHNICAL: negatives: distorted hands, morphing car, cartoon effects, blurry motion, shaky camera, two steering wheels, amateur quality
The skater sits at bowl edge holding his board, camera fixed on selfie stick. He grins widely: "Yo, check out this new park! About to drop in for the first time." He stands up quickly, camera following his movement as he positions at the bowl's edge, board underfoot. "Let's see if I can nail this kickflip to fakie..." He pushes off, dropping into the bowl with speed, camera maintaining POV angle showing the curved wall rushing up. Wheels carve the transition smoothly. At the top, he pops the kickflip, board rotating cleanly, lands solidly.
Camera Movement: Handheld selfie stick POV following action into bowl.
Negative Prompt: No morphing, no warping skateboard, no duplicating limbs, no floating, no unnatural skating motion, no wheel distortion, no facial distortion, no background inconsistencies, no temporal artifacts.
Orbit Cam
A close-range brawl in a tight corridor between two elite assassins. The camera circles the fight, weaving through punches and slams. Industrial hallway with exposed pipes, flickering bulbs, steam leaks. Continuous 360° orbit cam inside tight space, handheld shake with punctuated impact vibration, brutal intimacy and flowing geometry.
HANDHELD
A construction worker involved in a street brawl after witnessing a murder Swings a pipe, ducks, tackles the attacker into scaffolding. Urban construction site lit by work lamps, concrete dust in air. Gritty handheld camera with snap zooms and motion blur on impact, metallic reverbs and grounded brutality, no slow motion, all pressure.
Sahne 1: adam bir youtube stüdyosunda çekimi bitiriyor ve ayağa kalkıyor.
Sahne 2: adam stüdyodan çıkıyor ve arabasına biniyor.
Sahne 3: adam arabasının içinde. yakın çekim direksiyondan adamın yüzüne tilt.
Sahne 4: adam evine giriyor ve kedisi kucağına atlıyor.
(Referans gösel videonun ilk karesi)
A desperate woman escaping from a kidnapping, bare foot, handcuffed, crying. Runs across a construction site dodging rebar and cranes. Abandoned industrial zone under gray sky, heavy wind. Over-the-shoulder cam with whip focus shifts, sudden whip pans to threats, soft handheld instability, intensity built from desperation.
Man walking down the street in 1980s NYC.
[cut] Close-up shot: the man sees something unexpected in front of him.
[cut] Over-the-shoulder shot: in front of him, a flower stand named Donna, with an old lady selling flowers.
[cut] Back shot of the man walking up to the flower stand.
No music, no talking.
SHOULDER CAM
A rescue worker in a flooded village pulling someone from a car window. Behind them, a landslide tears down the hill with trees and mud. Rain pouring, water waist-deep, electricity arcing from poles. Shoulder-cam style tracking with fast pull-back to show landslide, muffled underwater audio pulses, nature’s violence from human scale.
Tracking shot
A rebel on a dirtbike weaving through explosions in a junkyard battlefield. Skids under a flipping armored truck while drawing a sidearm. Rusted metal debris, smoke clouds, burning containers. Rear tracking shot on bike combined with whip pan cut to side angle, firelight strobe and shockwave shake, kinetic intensity in full chaos.
Kling AI + Gemini Nano Banana Pro
Motion Prompt: The pizza begins a slow, smooth rotation while its layers gently separate one by one, maintaining perfect alignment, spacing, and scale. Each ingredient floats apart with precise, controlled motion. The movement is clean and fluid, with no extra effects or distractions.
Start Frame: A high-quality, professional product photograph of a gourmet chicken pizza with a golden-brown crust, melted mozzarella cheese, and evenly distributed seasoned chicken pieces with tomato, onion and olives topping. The pizza looks hot, fresh, and appetizing with visible cheese stretch and baked texture. Minimalist style, shot against a pure solid white background with soft, natural shadows. Ultra-sharp focus, 8K resolution, clean and modern fast-food aesthetic.
End Frame: Create a hyper-realistic exploded vertical infographic composition of a chicken pizza.
At the top, a golden oven-baked pizza crust edge with light blistering and baked texture.
Below it, stretchy melted mozzarella cheese floating smoothly with natural cheese pull.
Under the cheese, juicy grilled or crispy chicken chunks with visible seasoning and moisture.
Next, fresh toppings such as sliced bell peppers, olives, and onions suspended mid-air.
Beneath the toppings, a rich tomato sauce layer with glossy depth.
At the bottom, a soft yet crisp pizza base, perfectly centered and aligned.
Pure solid white background, soft studio lighting, and subtle realistic shadows beneath each floating layer.
Ultra-sharp focus, DSLR macro photography look.
Clean, minimalist infographic text labels with thin pointer lines for each layer. Premium, professional, photorealistic food infographic style.
Ultra-cinematic vertical composition of coffee elements suspended in mid-air cascading roasted coffee beans, chocolate bonbons, swirling latte art in a mid-air coffee cup, splashes of milk and espresso frozen in motion, fine coffee grounds dusting through the air captured with rich brown and cream color tones.
Hyper-detailed textures with glossy liquid surfaces, crema bubbles, and matte bean textures, lit with dramatic high-contrast studio lighting against a deep, velvety black background. Cinematic depth of field, splash photography aesthetic, premium café advertising style. Shot with a virtual Nikon D850, 105mm macro lens, aperture f/4.0, crisp editorial clarity.
A weathered wooden fishing boat with peeling paint and tangled nets rests quietly in shallow turquoise waters near a rocky tropical cove, surrounded by palm trees swaying gently in the breeze; the boat slightly shifts with the calm waves as the camera performs a slow lens push from a high angle, capturing the sun glinting off the water's surface and the boat's faded textures; soft daylight with warm tones and side lighting highlights the serenity of the setting; visual style is realistic and cinematic, evoking a sense of nostalgic solitude.
location: luxury penthouse kitchen, one continuous mini story
Modern luxury penthouse kitchen at golden hour, warm sunlight through floor to ceiling windows, city skyline outside, marble island, copper cookware, a distinctive teal espresso machine, a small crack on the right edge of the marble, a bowl of lemons near the sink. Same main character throughout: a woman chef in a white linen shirt with rolled sleeves and a thin red bracelet on her left wrist. Keep her appearance, outfit, props, lighting, and kitchen layout consistent across every cut.
CUT TO: Wide establishing shot showing the full kitchen layout, skyline, marble island center frame, teal espresso machine on the back counter, lemons by the sink.
CUT TO: Medium shot from the same side of the island as she places a wooden cutting board on the marble, the crack still visible near the right edge.
CUT TO: Close-up on her hands, red bracelet visible, slicing a strawberry tart topping; crumbs and fruit glisten in the same warm light.
CUT TO: Over-shoulder shot as she plates the tart on a matte black plate, teal espresso machine softly blurred in the background.
CUT TO: Insert close-up of espresso pouring from the teal machine into a small cup, crema forming, same golden reflections on the metal.
CUT TO: Medium shot as she carries the plate and cup to the window-side counter; skyline stays in the same direction, sunlight consistent.
CUT TO: Tight close-up as she smiles and adjusts a lemon slice garnish; end on the tart’s glossy surface with the skyline bokeh behind it.
Mini nature documentary, epic and serene Ultra-detailed natural world cinematography, realistic textures, soft documentary color grade, stable tracking, gentle environmental motion. CUT TO: Wide dawn landscape of desert dunes and distant mountains; the sky transitions through cool blues into warm orange while long shadows slide across ripples in the sand. Telephoto shot of a hawk gliding across the frame, wings steady, heat shimmer wavering beneath; the camera tracks smoothly with the bird centered against a layered horizon. CUT TO: Ground-level close-up of a small lizard pausing on a pebble, blinking; the focus snaps between its textured scales and the sand grains, then it darts forward. CUT TO: Slow-motion close-up of sand scattering from the lizard’s feet, tiny particles lifting and catching sunlight. CUT TO: Medium shot of hardy desert plants swaying gently; a beetle crawls over a stem, the camera follows with a calm, deliberate move. CUT TO: Wide reveal as the dunes open onto a ribbon of water; sunlight glitters on the surface, and distant birds lift off in a thin line. CUT TO: Final tranquil shot: the river shimmer fills the frame, reflections pulsing softly, ending on a bright glint that fades into calm.
Cinematic close-up of a professional woman in a corporate office, looking directly at camera with confident expression, soft window lighting from left, shallow depth of field, 4K photorealistic quality
ultra-realistic cinematic cozy atmosphere. High-energy yet serene mood with professional cinematography, sharp focus on the guitarist, and vibrant color grading.
Environment: A pristine, snow-covered arctic landscape at night. A glowing igloo stands in the background. In the foreground, a crackling campfire casts warm, flickering light. Above, a majestic Aurora Borealis (Northern Lights) swirls in green and purple, reflecting off the snow and the group.
Subjects & Attire: A small group (3-4 identical people 1 young woman playing 1 old man 1 old woman and 1 young man) gathered around the fire. They wear authentic winter gear: thick woolen sweaters, fur-lined parkas, and knit hats. The central subject is a charismatic acoustic guitarist playing a classic dreadnought guitar.
Timeline Sequence (15 Seconds):
0.0s - 4.0s (The Scene): A slow, sweeping drone shot reveals the glowing igloo and the campfire. The majestic Aurora Borealis dominates the sky. The group is seen from a distance, establishing a sense of warmth in the vast cold.
4.0s - 8.0s (The Gathering): Low-angle medium shot focusing on the guitarist. In the background, friends move naturally: one sips from a steaming mug, another taps their foot to the beat. Their breath is visible as white vapor in the crisp air.
8.0s - 12.0s (The Focus): A smooth push-in to a close-up of the guitarist’s hands. The camera captures the fingers skillfully moving over the fretboard. Firelight glints off the guitar’s polished wood.
12.0s - 15.0s (The Finale): A wide pull-back shot. The guitarist finishes with a gentle chord. The camera rises toward the shimmering aurora as the music fades into the sound of the wind.
Lighting: Contrast between the warm orange glow of the fire and the cool, ethereal green of the Northern Lights. Dramatic flickering shadows and soft highlights on the snow.
Action: Natural, subtle movements from the group. The guitarist shows deep focus and a gentle smile, inviting the audience into the cozy circle.
Audio: Melodic, finger-picked acoustic guitar layered with the crackle of fire and the subtle whisper of the arctic wind.
Ultra photoreal macro studio photograph of two tiny miniature tennis players playing on top of a kitchen dish sponge, the sponge is rectangular with bright yellow foam sides and a dark green abrasive scrub surface, the scrub surface is used as the tennis court, a realistic miniature tennis net stretched across the middle, one player near the camera hitting a small tennis ball, the other player far in the background, extreme detail, real materials, visible sponge pores and micro texture, realistic shadows and soft studio lighting, shallow depth of field, bokeh, clean beige background, high contrast, 8k, hyper-real, cinematic macro, 100mm macro lens, f/2.8 Quoting Higgsfield AI 🧩 (@higgsfield_ai) We just unlocked Grok Imagine's real potential. xAI built the model. We figured out how to actually use it - fluid motion, cinematic POV & multi-shot control.
A high-end intimate shower gel commercial with bold, refined luxury and confident sensuality. Visual Setup A sleek black glass pump bottle labeled "BARBARU — Banana Rule — Intimate Shower Gel" Placed on an elegant sculptural arrangement of ripe bananas. Golden gel flows slowly and smoothly over the bottle and fruit. Deep matte black background with controlled studio lighting. Action & Camera Direction Slow cinematic push-in toward the product. Golden gel flows in controlled slow motion, rich and glossy. Subtle camera orbit reveals bottle curves and premium reflections. Light glides across glass and gel for a luxury skincare-ad finish. Movement is intentional, confident, editorial. Voiceover / Dialogue (Professional & Bold) 0–3s (Low, calm, confident male or female voice): "Luxury… begins with confidence." 3–6s: "Designed for intimacy. Crafted for comfort." 6–9s (slightly deeper tone): "BARBARU Banana Rule." 9–10s (soft, premium whisper): "Indulge without compromise." Timing Breakdown (10s) 0–2s: Fade in from black, product silhouette reveal. 2–5s: Golden gel begins flowing, texture close-ups. 5–8s: Camera orbit, reflections and curves emphasized. 8–10s: Hero shot — product centered, confident stillness. Mood & Brand Tone Professional luxury brand energy. Bold but tasteful sensuality. Calm, confident, expensive presence. Fashion editorial × premium skincare commercial. Style & Quality Cinematic studio lighting, shallow depth of field, ultra-realistic textures, glossy reflections, smooth motion, high-end commercial polish, 8K realism.
Video prompt:
{
"cinematic_video_request": {
"meta": {
"title": "Sadie Sink - Night Drive Portrait",
"style_preset": "Cinematic Realism",
"duration_seconds": 10,
"resolution": "4K",
"aspect_ratio": "16:9"
},
"prompts": {
"main_prompt": "Ultra-realistic cinematic animation of Sadie Sink sitting in the backseat of a luxury car at night. The city lights outside the window create soft motion blur with passing traffic and glowing streetlights. Subtle camera push-in shot from medium frame to close-up. Gentle movement in her hair as the car moves. She slowly shifts her gaze toward the window, blinking naturally, then looks back toward the camera with a calm, slightly mysterious expression.",
"visual_modifiers": "4K cinematic quality, realistic skin texture, natural facial micro-expressions, smooth motion, dramatic nighttime mood, film-grade color grading, soft contrast, subtle handheld camera feel, shallow depth of field, bokeh city lights, detailed leather texture.",
"lighting_prompt": "Soft ambient lighting from streetlights flickers across her face, creating dynamic shadows and warm highlights. Background traffic lights streak smoothly past the window."
},
"scene_specifications": {
"subject": {
"name": "Sadie Sink",
"action": "Sitting, gazing out window, turning head to camera, blinking",
"expression": "Calm, mysterious, natural micro-expressions",
"details": "Gentle hair movement, realistic skin texture"
},
"environment": {
"setting": "Backseat of luxury car",
"time": "Night",
"exterior": "City streets, passing traffic",
"details": "Detailed leather interior, window reflections"
},
"camera": {
"movement": "Slow push-in (dolly forward)",
"stabilization": "Slight handheld feel (organic motion)",
"framing": "Medium shot to Close-up",
"focus": "Shallow depth of field with background bokeh"
},
"lighting": {
"type": "Dynamic/Transient",
"sources": "Passing streetlights, city glow",
"characteristics": "Warm highlights, soft contrast, rhythmic flickering shadows"
}
}
}
}
Ultra-realistic cinematic macro video of an analog luxury watch being hand-assembled by a professional watchmaker. White silk gloves carefully hold the stainless-steel watch case while precision tweezers gently place the hour, minute, and second hands onto a brushed silver sunburst dial. Extreme close-up shots reveal polished indices, fine engravings, and micro-details of the dial texture. Cut to the golden mechanical movement with visible jewels and gears being delicately adjusted. Soft studio lighting with dramatic shadows, shallow depth of field, premium luxury aesthetic. Slow, smooth camera movements, subtle reflections on metal surfaces. Final shot shows the completed watch ticking for the first time, audible tick-tick sound, symbolizing precision and craftsmanship. Photorealistic, 4K quality, cinematic color grading, luxury brand film style.