Style: High-Fantasy, Grimy, "Jackie Chan" Medieval Combat.[CINEMATIC SETUP] Film Style: Gritty High-Fantasy, 35mm film, torch-lit atmosphere (oranges/shadows). Lens: 14mm Ultra-wide for distorted, close-up action. Camera Behavior: "Slam-zooms" on impacts, low-angle tracking through legs, and dynamic whip-pans between combatants. Audio Style: Heavy wood thuds, ale splashing, metallic sword clangs, and raucous tavern laughter.[TIMELINE SECOND BY SECOND]0-3s: [The Bar-Slide] A rogue slides down the length of a wooden bar, kicking mugs of ale into the faces of two orcs. [Physics] Ale foam sprays realistically.3-7s: [The Chair Acrobatics] The rogue gets cornered; he flips a heavy wooden chair, stands on it as it's falling, and uses the momentum to leap to a chandelier.7-11s: [The Chandelier Swing] Swinging on the chandelier, he uses a large ham leg to club a guard, then throws the ham to a hungry dog who trips the next attacker.11-15s: [The Climax] The chandelier rope snaps. The rogue falls, landing perfectly in a giant barrel of flour. [Physics] A massive white cloud erupts. The camera zooms into the flour-white face as he sneezes a "puff" of flour.[STYLE & QUALITY BOOSTERS] Wood grain textures, realistic liquid and powder physics, stable character facial expressions, 8K.[NEGATIVES] CGI look, clean clothes, deformed limbs, blurry action.
Style: High-Fantasy, Grimy, “Jackie Chan” Medieval Combat.[CINEMATIC SETUP] Film Style: Gritty High-Fantasy, 35mm film, torch-lit atmosphere (oranges/shadows). Lens: 14mm Ultra-wide for distorted, close-up action. Camera Behavior: “Slam-zooms” on impacts, low-angle tracking through legs, and dynamic whip-pans between combatants. Audio Style: Heavy wood thuds, ale splashing, metallic sword clangs, and raucous tavern laughter.[TIMELINE SECOND BY SECOND]0-3s: [The Bar-Slide] A rogue slides down the length of a wooden bar, kicking mugs of ale into the faces of two orcs. [Physics] Ale foam sprays realistically.3-7s: [The Chair Acrobatics] The rogue gets cornered; he flips a heavy wooden chair, stands on it as it’s falling, and uses the momentum to leap to a chandelier.7-11s: [The Chandelier Swing] Swinging on the chandelier, he uses a large ham leg to club a guard, then throws the ham to a hungry dog who trips the next attacker.11-15s: [The Climax] The chandelier rope snaps. The rogue falls, landing perfectly in a giant barrel of flour. [Physics] A massive white cloud erupts. The camera zooms into the flour-white face as he sneezes a “puff” of flour.[STYLE & QUALITY BOOSTERS] Wood grain textures, realistic liquid and powder physics, stable character facial expressions, 8K.[NEGATIVES] CGI look, clean clothes, deformed limbs, blurry action.
**Create a seamless, cinematic transition between Image 1 and Image 2.**
Start fully inside Image 1, then guide the camera into a natural motion path that logically leads toward the transition point. Maintain consistent lighting, depth, and perspective so the viewer feels physically carried from one scene to the next.
Use **smooth blending of colors, shadows, and textures**, and ensure that surfaces and objects morph gradually and believably. The transition should feel like a continuous camera move, not an effect.
Keep the aesthetic **cinematic, realistic, and polished**, with:
- physically accurate camera movement,
- stable motion and depth,
- natural environmental lighting,
- high-resolution textures.
Your goal is to **make the cut invisible**, as if the camera is traveling through one continuous world that naturally evolves into Image 2.
Create a 15-second cinematic kung fu performance video.
Use @[image1] as the fixed character sheet reference. The character must strictly match the character sheet.
Use @[image2] as the storyboard reference.
Follow the storyboard shot by shot as the main source for action order, camera rhythm, body movement, framing, movement direction, camera angles and visual progression. Treat each storyboard panel as a sequential keyframe. Preserve the shot order and make the video feel like the storyboard has been translated into continuous live-action motion. The sequence must end on a frozen final frame while the performer is still airborne.
Do not add text, captions, storyboard labels, arrows, UI, logos or watermarks. Do not treat the storyboard as a single image. Do not redesign the character, change the costume or alter the face. Do not begin with a calm stance, preparation pose or slow introduction. Do not make the elemental effects look like superhero powers or excessive fantasy glow.
Visual style:
stylized cinematic realism, high-end 3D painterly animation quality, dynamic cloth simulation, expressive silhouette design, rich cinematic lighting, controlled color palette, natural motion blur, dramatic scale, beautiful but aggressive physicality, premium feature-animation aesthetic.
Environment:
vast ancient temple, towering stone columns, worn temple floor, drifting incense smoke, hanging fabric, harsh light shafts, faint dust in the air, subtle wet floor reflections, high contrast shadows.
The performance is a solitary female kung fu routine inside a vast ancient temple. The routine starts immediately in action, with no calm stance, no preparation pose and no slow introduction. The movement should feel aggressive, ritualistic, disciplined, physically extreme and spiritually charged.
This is not a fight against an enemy. It is a solo performance of force, control, exhaustion, fury and release.
Follow story board for choreography direction.
Element progression:
early sequence: subtle wind, dust and pressure lines responding to movement.
middle sequence: stronger air shockwaves, stone fragments, floor cracks and water-like ripples across the temple floor.
late sequence: controlled fire trails, heat distortion and energy spirals around explosive strikes and kicks.
climax: wind, dust, stone, water ripple and fire accents combine into a stronger elemental vortex.
final beat: the performer is airborne above the temple floor in a powerful kung fu strike, body twisted mid-air, hair and fabric flaring outward, with all elements converging around her before impact.
Elemental VFX must feel spiritual, ritualistic and cinematic. The effects should be integrated with the choreography and motivated by physical movement. Keep the energy raw, elemental, atmospheric and grounded in the temple environment.
Use Laban movement logic throughout:
weight: strong, heavy, grounded during impacts, with brief lightness during jumps and aerial twists
time: quick during strikes, kicks, drops and turns, sustained during suspended holds and recovery transitions
space: direct during attacks, blocks and lunges, indirect during spinning turns and elemental vortex moments
flow: bound during rooted stances and precise strikes, free during aerial motion, spinning fabric movement and elemental release
Create a 15-second cinematic action video.
REFERENCE USAGE:
Image 1 = strict reference for the female protagonist. Use her face, blue eyes, long straight black hair, pale skin, slim body type, and overall look. She must remain a realistic live-action woman.
Image 2 = strict reference for the male antagonist. Use his exact anime identity: messy black hair, sharp blue eyes, youthful face, slim athletic build, dark navy T-shirt, black shorts, sneakers, necklace, and handgun.
IMPORTANT STYLE RULE:
The male character from Image 2 must stay fully anime during the entire video.
He must NOT become realistic, photoreal, live-action, or humanized.
Keep him clearly anime in every shot: anime face, anime eyes, anime hair, anime proportions, cel-shaded / anime rendering.
The female character must remain realistic live-action.
Keep the contrast intentional: realistic woman vs anime man.
Do not copy the reference layouts, panels, borders, labels, or text. Use them only for identity, outfit, and weapon consistency.
SCENE:
The chase happens on Istanbul streets at night. It must clearly feel like Istanbul, not a generic city. Use narrow sloped streets, older apartment facades, Turkish storefronts, wet asphalt, parked cars, yellow taxis, street lamps, road reflections, and dense Beyoğlu / Karaköy style atmosphere. Cinematic night lighting, handheld energy, shallow depth of field, realistic motion blur.
0:00–0:03
Wide establishing shot of an Istanbul street at night. The realistic woman runs desperately through the street, weaving between parked cars and moving traffic. Behind her, the anime man chases her at high speed while firing a handgun. Muzzle flashes light the wet street. Bullets streak past her. Dynamic shaky tracking shot.
0:03–0:04
She gets hit in the shoulder, stumbles forward, and clutches the wound while trying to keep running. The male antagonist keeps advancing. He remains fully anime.
0:04–0:07
She crashes onto the wet asphalt and rolls. Shocked and exhausted, she looks up as the anime man approaches and tries to fire again, but the gun is empty. Only clicking sounds. He throws the handgun aside. She uses the chance to push herself back up.
0:07–0:12
Close-quarters combat begins. Fast punches, dodges, blocks, elbows, and kicks. Include a low-angle near-miss spinning kick, a close-up blocked strike, and a quick counterattack from the woman. The fight is aggressive and cinematic. The woman stays realistic; the man stays anime.
0:12–0:13
The anime man lands a powerful spinning back kick to her chest. Show the impact in slow motion. Water droplets and dust spray from the wet street.
0:13–0:15
She is thrown backward into a parked car, smashing the windshield. Glass scatters. She collapses against the vehicle, defeated. The anime man stands victorious in the middle of the Istanbul street in a strong combat stance as the camera slowly pushes in. Final shot: he is still fully anime.
STYLE:
Cinematic action, Istanbul street atmosphere, wet reflective asphalt, high-contrast night lighting, realistic impact physics, handheld camera movement, no gore.
NEGATIVE:
Do not turn Image 2 into live-action or photoreal. Do not use a generic city. No text, subtitles, logos, watermark, gore, dismemberment, extra fighters, duplicated characters, distorted anatomy, or extra weapons after the gun is thrown away.
A cinematic martial arts duel set in a traditional ancient Chinese courtyard paved with large weathered stone tiles, framed symmetrically by old wooden temple buildings, carved balconies, hanging red vertical banners with black Chinese calligraphy, and soft greenery along the sides. Two highly detailed martial artists face each other in the center in classic kung fu combat stances, captured mid-confrontation with intense focus and restrained tension.
On the left stands an older rugged martial artist wearing layered brown leather and fabric warrior robes with worn textures, stitched seams, dark belts, armored wrist guards, rugged boots, and flowing lower garments. His stance is low and grounded with one fist extended forward and the other hand pulled back defensively. His facial expression is sharp and concentrated, with realistic skin texture, subtle wrinkles, and cinematic lighting shaping the face.
On the right stands a disciplined martial arts master wearing an elegant dark navy-blue traditional Chinese robe with subtle embroidered patterns, red trim accents, long flowing fabric, and clean structured tailoring. His stance is balanced and defensive, one hand open in a Wing Chun-style guard while the other hand forms a fist. Calm but intense facial expression, highly detailed fabric folds, realistic posture, and natural movement in the robe.
Between them in the background is a wooden ceremonial table draped with vivid red cloth, positioned centrally in front of a misty temple entrance that creates strong depth and symmetry. Soft atmospheric haze fills the center background. Natural daylight with cinematic contrast, soft shadows, muted earthy tones, realistic stone textures, subtle depth of field, ultra-detailed realism, authentic kung fu movie aesthetic, balanced composition, dramatic tension, photorealistic cinematic still frame, high-end film production quality, 4K, ultra-sharp details, aspect ratio 16:9.
Style: Regency Period Drama, Elegant but Chaotic.[CINEMATIC SETUP]
Film Style: Period Drama (Bridgerton style), warm golden hour lighting, soft focus, high-speed cinematography. Lens: 85mm Portrait lens for shallow depth of field. Camera Behavior: Elegant sweeping crane shots, "snorrricam" (attached to subject) for the chaos, and smooth 120fps slow-motion. Audio Style: Vivaldi-style string quartet, the "tink" of silver spoons, gasps of horror, and fabric tearing.[TIMELINE SECOND BY SECOND] 0-3s: [The Slip] A duchess slips on a dropped cucumber sandwich. The camera follows her in slow-motion as her silk dress billows upward like a parachute.
3-6s: [The Chain Reaction] She grabs a tablecloth to steady herself, pulling a three-tier tea service toward her. Scones fly through the air like cannonballs.
6-10s: [The Butler’s Save] A butler performs a sliding knee-drop to catch a falling teapot. A corgi runs across his back, using him as a bridge to reach a flying tart.
10-13s: [The Fan Fight] Two ladies-in-waiting use their folding fans to bat away flying sugar cubes as if playing badminton. [Physics] Sugar cubes shatter on impact.
13-15s: [The Climax] A giant bowl of strawberry punch tips over. The red liquid douses a snobbish Duke perfectly. The camera zooms in on his monocle popping out into the punch.
[STYLE & QUALITY BOOSTERS] Intricate lace textures, liquid physics (tea/punch), realistic fabric movement, 8K, stable character features.[NEGATIVES] Modern objects, deformed hands, inconsistent dress physics, blurry faces.
[0:00–0:02] THE CROWD REACTION SHOT: Visual: Medium-close live broadcast shot, 9:16 vertical ratio, cold arena floodlights reflecting off rink glass. Subject: 24yo Indonesian-Korean woman, long straight black hair, porcelain skin, hourglass silhouette. Outfit: White fitted crop top, cropped metallic silver bomber jacket, black leather mini skirt, sheer white mesh sleeve on left arm, knee-high white boots. Action: Laughing and clapping naturally while watching the game, briefly waving toward the ice. Camera: Static broadcast camera from rink-side seating angle. Audio: Arena crowd roar, skate sounds, commentator saying, "What an incredible atmosphere tonight." Environment: "Arif N" physically engraved into the metallic armrest of the rink-side seat. The text must exist physically in the environment, not as a digital overlay.
[0:02–0:05] THE RINK ACCESS BREACH: Visual: Dynamic rink-side tracking shot. Subject: Same woman now wearing a heavy white faux-fur jacket over her outfit. Action: Walks down the rink-side aisle, removes the faux-fur jacket and tosses it over the rink boards, then quickly climbs over the barrier and steps onto the ice surface. Camera: Smooth cinematic tracking movement with realistic live-broadcast framing. Audio: Jacket impact sound, shocked crowd reaction, excited commentary.
[0:05–0:09] THE HIGH-SPEED ICE SPRINT: Visual: Wide low-angle lateral tracking shot across the rink. Action: Subject sprints rapidly over the ice in white figure-style heeled boots, hair flowing dramatically, mesh sleeve whipping in motion. Focused intense expression. Ice spray kicks behind every stride. Camera: Steadicam movement with subtle realistic broadcast shake and micro-vibrations. Audio: Loud skate scraping, swelling crowd roar, commentator shouting, "She's on the ice!"
[0:09–0:13] THE PRECISION SLAPSHOT: Visual: Mid-wide cinematic shot facing the hockey goal. Action: Subject plants left foot firmly on the ice, draws hockey stick back aggressively, and delivers an explosive slapshot. The puck rockets into the top corner of the net past the goalie. Environment: "Arif N" physically engraved onto the hockey stick shaft and embossed onto the puck surface. The text must appear naturally integrated into real-world objects, not as digital graphics. Camera: Fast whip-pan following puck momentum into the net. Audio: Sharp crack of stick impact, goal horn blasting, commentators screaming "SCORES!", massive crowd eruption.
[0:13–0:15] THE VICTORY CELEBRATION: Visual: Hard cut to medium close-up with dramatic bokeh arena background. Action: Subject spins toward camera with a joyful victorious smile, raises hockey stick briefly, then pushes open palm toward the lens in a playful blocking motion. Camera: Static close-up. Autofocus struggles against the approaching hand, creating heavy foreground blur and authentic camera breathing. Audio: Deafening arena roar, goal horn echo, commentators yelling excitedly.
TECHNICAL SPECS: 9:16 vertical format, ultra photorealistic detail, authentic NHL broadcast realism, cinematic sports lighting, subtle film grain, realistic motion blur, shallow depth of field, natural arena reflections, dynamic crowd atmosphere, ESPN-style hockey broadcast presentation, incredibly detailed textures, realistic ice reflections and skate marks. The text "Arif N" must always appear as a physical engraved, stitched, embossed, or branded part of real-world objects and environments — never as a digital overlay.