A hand slowly enters the frame naturally and gently taps the smartwatch screen once. The display softly illuminates with a subtle ripple-style activation animation spreading across the screen surface. Warm sunlight shifts slightly through the curtains, creating delicate moving shadows across the wooden nightstand. Camera begins as a static overhead composition, then slowly pushes into a smooth 2x zoom toward the watch face after the tap interaction. Preserve original composition, watch design, lighting, wood textures, colors, reflections, and Scandinavian aesthetic. Smooth natural motion, calm premium lifestyle realism, soft cinematic atmosphere, aspect ratio 16:9.
Use Image1 as the only identity reference. Strictly preserve his real face shape, facial features, skin tone, hairstyle, and body proportions. Use Image2 as the only armor design reference. Strictly preserve the dark volcanic stone-textured mecha armor with glowing molten orange lava crack energy lines, hexagonal arc reactor chest core emitting orange-white light, battle-worn charcoal black segmented plates with angular geometric cuts, illuminated eye visor with orange-red glow, and massive bulked shoulder pauldrons with spiked protrusions.
Cinematic VFX sci-fi transformation shot on Arri Alexa Mini LF with a low-angle steady-cam orbit. The scene features a rugged soldier Image1 standing in a dusty desert hangar; as he activates a wrist-mounted neural trigger, heavy volcanic dark-stone armor plates cracked with glowing molten lava veins and hydraulic pistons surge upward from his boots, encasing his body in a lava-core mecha frame. The camera circles closely as orange-lit copper wiring and obsidian-dark steel framework interlock with violent precision, capturinghis grimace of physical strain. The process accelerates with superheated steam and ember sparks venting from shoulder joints as a massive angular chest plate with a glowing hexagonal arc core slams shut. The transformation concludes with a deep volcanic resonance as a reinforced tactical helmet with burning orange eye slits locks into place. The atmosphere is gritty and visceral, driven by a rhythmic crescendo of clanking obsidian metal, crackling energy, and pressurized molten air, ending in a powerful, grounded stance with lava crack lines pulsing across the full armor silhouette
Convert each image in the frame into video and connect all clips to make an emotional vlog-style video. Natural hand shake. A young woman. Emotional BGM.
Generate a 15s cinematic sequence 🎬🔥
This is where it comes to life.
Here’s the exact prompt I used 👇
An elegant performer inspired by a legendary moonwalk dancer, dressed in a sleek black outfit with hat, transforming into a diamond trophy through motion
His moonwalk generates energy trails that evolve into abstract light and crystallize into a diamond sculpture
Luxury cinematic: slow dolly, orbit shots, macro details, seamless transitions, final push-in
Scene
dark premium studio, black reflective floor, golden spotlight, floating particles
Ultra high-end commercial, diamond reflections, golden light particles, smooth VFX transitions
- Movement phase wide shot, smooth moonwalk, subtle light trails appear
- Energy phase, camera closer, golden trails intensify, lingering in space
body begins dissolving into light
- Abstraction phase, camera orbit, body becomes swirling luminous ribbons, abstract silhouette forms
- Form phase, energy condenses into faceted diamond structure
limbs sharpen into reflective geometry
- Design phase, full diamond figure locked in iconic pose
light glints across surfaces
Final shot, trophy on pedestal, slow push-in, luxury finish
CUT
Unsteady handheld phone footage shot from [SHOOTER VANTAGE / SHOOTING POSITION] at [TIME OF DAY], [LENS/PHONE PHYSICAL POSITIONING DETAIL], producing [PRIMARY OPTICAL ARTIFACT, e.g. a faint smear of condensation fog, a smudge of fingerprint oil, a streak of rain] across the [FRAME LOCATION, e.g. lower quarter, right edge, top third] of the frame and intermittent [LIGHT FLARE TYPE, e.g. glare blooms, lens halation, prismatic streaks] from the [DOMINANT LIGHT SOURCE] reflecting back against the shooter's phone camera, the image is flat, auto-white-balance toggling between [COLOR CAST 1, e.g. a cool blue cast] and [COLOR CAST 2, e.g. an orange-amber push] as [LIGHT MIXING SCENARIO, e.g. ambient street light mixes with interior glow], color entirely ungraded and slightly washed out.
At 0s the camera swings erratically [DIRECTION, e.g. left-to-right, low-to-high, side-to-side] hunting for the subject, frame wildly off-center, catching [BLURRED ENVIRONMENTAL DETAILS visible during the search] before the autofocus locks momentarily on [SUBJECT 1: physical build, age range, hair, distinguishing features] [SUBJECT 1 POSITION/LOCATION IN SCENE], [SUBJECT 1 WARDROBE: detailed top-to-bottom clothing description with fabric, color, fit, accessories, and posture/demeanor]. [OPTIONAL SUBJECT 2: positioning relative to subject 1, full physical description, wardrobe, body language toward subject 1, exuding [RELATIONSHIP/INTERACTION DYNAMIC]].
At 2s the autofocus drifts off [the pair / the subject] and [BACKGROUND ELEMENT] sharpens while [the subject(s)] go soft and blurry, the camera operator [SHOOTER REACTION, e.g. whispering urgently off-mic, cursing under breath, holding their breath] before it snaps back to focus at 4s with a visible hunting pulse.
[INTERRUPTION 1, e.g. a pedestrian, a server, a passing vehicle] [crosses / passes / blocks] the frame at 5s, briefly obscuring [the subject(s)] behind [OBSCURING ELEMENT], the shooter dipping and angling to reacquire them through [SHOOTING MEDIUM, e.g. the glass, the foliage, between parked cars].
At 6s [INTERRUPTION 2, e.g. a camera flash from another phone further down the sidewalk, a passing headlight wash, a neon sign flicker] [REFLECTION BEHAVIOR] creating a hot chromatic flare across the [FRAME EDGE], cyan and red fringing visible on the high-contrast [HIGH-CONTRAST EDGE DETAIL].
[SUBJECT NAME/IDENTIFIER] [REACTION VERB, e.g. glances, flinches, turns] briefly toward [SHOOTING MEDIUM] at 8s, [REACTION DETAIL: expression shift, body movement, instinctive shielding gesture] before [returning to conversation / stiffening / leaning back into shadow]; the shooter holds completely still for two seconds, barely breathing.
At 10s the phone [CAMERA SHIFT, e.g. dips, tilts, drifts] as the operator shifts weight, the frame [FRAMING ERROR, e.g. cutting off both subjects at the shoulders, clipping a head, losing them entirely] momentarily, [OPTICAL ARTIFACT EVOLUTION, e.g. the window fog smearing the bottom edge thicker], before rising again to reframe them, composition still imperfect, slightly tilted, [SUBJECT BODY PART] clipped by the frame edge.
[AMBIENT AUDIO BED, e.g. distant traffic noise, low restaurant chatter, lobby murmur] bleeds through the audio throughout, [MICROPHONE INTERFERENCE, e.g. wind buffeting the microphone in a low rhythmic thump, fabric rustle as the operator shifts], faint [BACKGROUND VOICES OR SOUNDS] from [SOURCE, e.g. other pedestrians on the sidewalk, nearby tables], and at 12s [HALF-AUDIBLE EVENT, e.g. an excited whisper from someone just off camera says something unintelligible], followed by [SECONDARY AUDIO EVENT, e.g. the rapid-fire click-burst of a DSLR shutter from nearby, a car door slamming, a phone notification chime].
At 14s [ENVIRONMENTAL DETAIL, e.g. the interior light flickers slightly, possibly a server passing] and the autofocus hunts one final time before the clip ends at 15s with the frame still imperfectly held on [the subject(s)] through the [SHOOTING MEDIUM CONDITION, e.g. foggy reflection-streaked glass, leaf-broken sightline], [SUBJECT QUALITY 1] and [SUBJECT QUALITY 2] both visible but never cleanly captured.
Use the provided character sheet @[image1] as reference.
Create a cinematic character introduction video.
Open with the character looking into camera and speaking naturally, introducing herself in her own words.
Do not treat the sheet as a single image. Use its elements as separate shots.
Structure:
detail → identity → presence → full reveal
Make the character active:
she moves, reacts, interacts with her environment and prop while talking
short, natural gestures, small shifts, purposeful motion
Show acting range:
subtle emotional shifts while speaking (confidence, hesitation, curiosity, intensity)
express through micro-expressions, eyes, tone, and body language
Include:
face close-ups, outfit/material details, prop usage, expressive performance moments
Keep everything grounded and realistic.
Camera:
controlled, minimal movement (soft push-ins, light tracking, subtle handheld)
Lighting:
cinematic and consistent
End on a confident mid or full shot, character fully established.
character
ip-design
animation
portrait
image-to-video
1️⃣GPT Image 2.0で12フレームの画像生成
プロンプト:
A 12-frame collage of candid, emotional snapshots of a young Japanese woman traveling alone in Hawaii, casually captured on a smartphone.
Each frame feels like a fleeting personal memory — imperfect, sun-drenched, intimate, and unposed.
The woman has a naturally curvy figure with a soft, feminine silhouette, subtly emphasizing her bust without exaggeration. Her presence feels real and unstyled, like a private photo album.
Scenes include: walking barefoot on the beach, ძლიერი日差しの中での海辺、palm trees swaying, overexposed ocean reflections, small local cafés, a modest motel room, sunset الساحل, night markets, views from inside a moving car.
Shot with a smartphone aesthetic: slight motion blur, soft focus, blown-out highlights from tropical sunlight, lens flare, sun glare, high ISO noise at night, uneven framing, accidental cropping.
Composition feels random and spontaneous — subject sometimes off-center, partially cut off, mid-motion, or obscured by light leaks.
Lighting varies: harsh midday sun, warm golden hour glow, deep sunset tones, humid night street lighting.
Color grading: faded cinematic tones, slightly desaturated with warm highlights, nostalgic film-like look, subtle grain, lifted blacks.
Emotion: solitude, fleeting youth, bittersweet nostalgia, quiet introspection, like memories from a trip taken alone.
Layout: 12 images arranged in a loose, imperfect collage grid, slightly tilted and misaligned like a scrapbook.
No text, no watermark.
2️⃣Seedance 2.0で動画生成(i2v)
12フレームが収まった画像一枚を参照して動画化する。
プロンプト:
<<<image_1>>> の各フレームを動画化して繋げる。エモーショナルなvlog風動画。自然な手ブレ。若いzypisyの女性。エモーショナルなBGM。
The pizza slice starts to slowly spin while the ingredients separate gently and precisely, maintaining alignment and scale. The motion is smooth, rich, and controlled with no extra effects.
Image 1 Prompt: A high-end food product photograph of a loaded pizza slice centered against a warm rustic wooden-toned background. The slice features a thick golden crust with a crispy edge, stretchy melted mozzarella cheese with visible pull strands, rich tomato sauce underneath, and toppings including pepperoni slices, black olives, capsicum, and herbs. The textures are highly detailed with glossy melted cheese, slightly oily toppings, and realistic baked crust texture. Soft cinematic lighting creates depth and subtle shadows. Ultra-sharp focus, premium food advertisement style, hyper realistic, 8K.
Continue seamlessly from @ Video1
extending from the last frame. the subject
@ Image1 meets his worried family who was missing and they are happy to see him. they gently scold @ Image1
for disappearing last night and finally he meets his girlfriend and they share a hug and ends with a happy note. multi shot cinematic experience
animation
fantasy
adventure
cinematic
image-to-video
A wide shot of a busy gym floor where everyone is doing a different exercise. In the foreground, someone struggles with a heavy bench press, while in the background, a person runs on a treadmill, another does yoga on a mat, and a fourth person drinks from a water bottle while checking their phone.
Maintain the 5-panel split-screen layout, the red background, and all text overlays exactly as they are. In all 5 panels simultaneously, the man slowly and smoothly turns his head from looking to the right to facing directly forward at the camera. The movement must be perfectly synchronized across all panels. The camera framing in each panel remains completely static. Subtle light reflections shift on the red-tinted glasses as his head turns.
[00:00 – 00:03 | OVER-THE-SHOULDER MEDIUM SHOT - SLOW PUSH IN]
Camera sits just behind and above the red-haired girl's left shoulder, her wild crimson waves and the neck of her acoustic guitar filling the foreground in soft bokeh. Across from her, Ruby, brown curls, dark tee, quilt-covered bed at her back, comes into sharp focus. The warm amber glow of the bedside lamp halos the room. The red-haired girl's strumming hand begins to move. She opens her mouth and sings softly: "Ruby… oh Ruby…" The camera breathes forward almost imperceptibly, closing the emotional distance.
[00:03 – 00:06 | EXTREME CLOSE-UP - GUITAR HAND, LOW ANGLE]
Cut to a ground-level macro shot angled upward at the strumming hand against the guitar body. Fingers move with loose, unhurried grace across the strings. The warm lamplight catches the shimmer of each string vibration. The faint resonance of the acoustic body fills the frame. Her voice drifts over: "…do you know how much you mean to me?"
[00:06 – 00:10 | TIGHT CLOSE-UP - RUBY'S FACE, STATIC]
Hard cut to Ruby's face centered, lit entirely by the golden lamp behind her. Her lips are parted just slightly, cheeks flushed, eyes glistening with the quiet weight of being truly seen. A slow, disbelieving smile creeps in at the corner of her mouth. She doesn't speak. She doesn't need to. The song does it all: "Ruby… my Ruby…"
[00:10 – 00:13 | TWO-SHOT MEDIUM - WIDE, HANDHELD DRIFT]
Camera floats back on a subtle handheld drift, revealing both girls in the full frame for the first time the cluttered bookshelves, the faded quilt, the soft chaos of a lived-in room. The red-haired girl leans slightly forward as she reaches the final line, voice barely above a whisper: "…I'll love you till the very end." The last chord rings out and hangs in the air.
[00:13 – 00:15 | EXTREME CLOSE-UP - RUBY'S EYES, SLOW RACK FOCUS]
The final cut lands on Ruby's eyes alone. Focus pulls soft… then razor sharp. A single tear catches the lamplight on her lower lash line, not yet fallen. She exhales.