The full prompt
a single raindrop slides slowly down a foggy window at night, camera holds completely static, neon city lights blurred in the background, slow motion 120fps look, single vertical motion only, shallow depth of field, cool blue and amber color grading, no camera shake, 5-second cinematic clip
Structure breakdown
The reason this prompt works is the combination: static camera + single slow micro motion. AI video models hit 70%+ success here. Add multi-axis camera motion or multiple actions and the success rate often drops below 20%.
3 rewrites
a young woman in a black trench coat walks slowly forward through a rain-soaked Tokyo alley at night, camera tracks horizontally to the right at the same walking pace, neon reflections on wet ground, shallow depth of field, steady gimbal feel, 6-second cinematic shot, no jitter
Tracking shot: subject walks + camera follows. "At the same walking pace" stops the camera from overtaking the subject.
misty mountain valley at sunrise, slow steady drone dolly forward over the treetops, sunlight breaking through low clouds, very gentle pacing over 8 seconds, cinematic wide shot, smooth motion, no jitter
Drone push: "slow steady drone dolly forward" + "over 8 seconds" controls the push speed.
extreme close-up of melted chocolate slowly dripping onto a glossy croissant, camera holds completely static, single dripping motion only, warm side light from the right, shallow depth of field, slow motion 120fps look, 3-second clip, no shake
Food macros work as static + single slow action. "Single dripping motion only" blocks stirs and other parasitic motions.
Common pitfalls
"Moves", "interacts" are too soft. Use "slides slowly", "pours", "turns to the left".
"Walks in, sits down, picks up" in 5 seconds will fail. One core action per shot.
Without "3-second / 5-second clip", the model invents a default pacing that is usually too fast.
Without "no camera shake / steady", most models add jitter that breaks the still-life mood.
Heavy subject motion + heavy camera motion collapses the shot.
"Masterpiece, 8k, best quality" do almost nothing for video. "Cinematic, shallow depth of field, color grading" is enough.
Model comparison
| Model | Typical duration | Strength | Rating on this shot |
|---|---|---|---|
| Runway Gen-3 | 5–10 s | Cinematic, tracking | ★★★★★ static-shot king |
| Pika 2.x | 3–5 s | Short atmospheric | ★★★★ great in 3–5 s |
| Kling 2.x | 5–10 s | People, ads | ★★★★ excellent non-English support |
| Seedance 2.0 | 5–8 s | Widescreen cinematic | ★★★★★ best for 16:9 |
| Hailuo / MiniMax | 5–6 s | People, landscapes | ★★★ long descriptive prompts |
Where this skeleton fits
Use this skeleton for: opening atmosphere shots, mid-piece product ad b-roll, social-media story headers, trailer transitions. The general rule: static camera + single slow action is the highest-success AI video shape. For multi-shot stories, edit several stills together rather than asking one prompt to deliver a complex sequence.