Why product shots are harder than portraits
Three reasons. First, material credibility: glass needs refraction, metal needs a believable reflection, fabric needs drape, food needs sheen. One material off and the whole shot looks fake. Second, layout discipline: e-commerce hero shots need clean copy space, brand posters need a headline slot, luxury ads need negative breathing room. Third, lighting restraint: everyday photography rewards more light, but commercial product photography uses very few, very precise sources.
All three come from real-studio logic. If you only write "a perfume bottle", the model gives you a stock image. To get a commercial product shot, every studio convention — light position, background, negative space, lens — has to be made explicit.
Three product-shot skeletons
E-commerce hero shot (white / soft background)
[product name + material + color] + on a [background] + [light setup] + [composition with negative space] + ultra sharp edges, color accurate, no text
The hero shot puts the product on 60–70% of the canvas with clean negative space and a label-free finish. Specify background color (cream / white / soft gray), light position (top-left soft box / overhead diffused), negative-space direction (bottom third / right side), and finish with "no text" to suppress fake labels.
Brand poster (atmospheric / contextual)
[product] in [scene with mood props] + [stylized lighting] + [bold composition] + [palette] + commercial poster aesthetic + space for headline at [position]
Restraint matters. One coffee cup and an old book reads as a brand poster; an entire cafe interior reads as stock. Keep the palette directional (warm earth tones / cool minimalist palette) and always state the headline slot ("space for headline at top").
Luxury close-up (minimal / material detail)
extreme close-up of [product material detail] + [single hard light source] + [dark or monochrome background] + sub-millimeter texture + cinematic color grading + minimal composition
Luxury equals less. A single hard light from above-left, a deep navy or matte black background, sub-millimeter texture and 90% negative space. Strip any of these and the image loses its premium read.
Structure diagram for a product shot
Wrong vs. right examples
✗ Wrong
luxury perfume bottle, beautiful, premium, expensive, high-end, glossy, shiny, gorgeous, masterpiece
Pure adjectives. No material, no light, no background, no negative space. "Luxury" is the hardest abstract word for any model to render and usually returns the most generic look.
✓ Right
transparent perfume bottle with amber liquid and brushed gold cap, on a cream linen surface, soft top-left light, centered composition with bottom third negative space, glass refraction and metal grain, ultra sharp edges, no text --ar 4:5
Bottle material, liquid color, cap finish, background, light position, composition, negative space, sharpness and parameters all present. To switch to a night-tone version, swap cream for charcoal — nothing else changes.
5 real samples
matte green tea can with minimal Japanese typography, frozen condensation droplets, single ice cube melting beside it, white seamless background, overhead diffused light, centered composition with bottom third negative space, sharp edges, color accurate, no extra text --ar 1:1
Condensation droplets and a melting ice cube tell the brain "cold" without spelling it out. No extra text blocks the model from inventing fake labels.
blush pink lipstick standing on a rose marble slab, soft morning window light from the right, scattered dried petals around the base, warm muted palette, bold centered composition, space for vertical headline on the left third, commercial poster aesthetic --ar 3:4
Two scene props only (marble + petals). The explicit "space for vertical headline on the left third" reserves real copy area.
extreme close-up of a brushed titanium smartwatch case, single hard light from above-left, matte black background with faint vertical gradient, micro reflection on the bezel, sub-millimeter brushed grain visible, cinematic dark color grading, 90% negative space --ar 16:9
Titanium + brushed grain + micro reflection is the material trifecta. 90% negative space tells the model to leave everything else alone.
flatlay of a beige linen shirt neatly folded, on a warm sand-colored fabric backdrop, soft overhead diffused light, top-down angle, centered with two leather buttons and a small price tag as styling props, muted earth palette, fashion catalog style --ar 4:5
Flatlays need an explicit angle (top-down) and a style anchor (fashion catalog). Keep styling props to two or fewer.
close-up of a glossy chocolate croissant on rustic ceramic plate, golden flaky layers with visible butter sheen, single warm side light from the right, soft shadow on the left, shallow depth of field, espresso cup blurred in background, food magazine cover style --ar 4:5
Food magazine style = directional warm light + shallow DoF + plump specular highlights. Replace "delicious" with concrete cues like "butter sheen" and "flaky layers".
5 common pitfalls
Without an explicit background, models default to leaves, water, neon — anything that kills the commercial read.
Most models will scribble fake English on the bottle. Add "no extra text on package" on SDXL/Flux, or --no text on Midjourney.
"Soft light + warm light + golden hour + rim light + hair light" produces a lighting traffic jam. Use one key + at most one fill.
"With negative space" is too loose. Say "bottom third negative space" or "right third for headline".
Luxury splits into two schools — minimal (white/gray/clean) and material-heavy (dark/single light/close-up). Pick one; both beat "luxury".
Aspect ratios and platform parameters
| Use | Aspect | Midjourney | SDXL size |
|---|---|---|---|
| E-commerce square | 1:1 | --ar 1:1 --v 6 --s 200 | 1024×1024 |
| E-commerce portrait | 4:5 | --ar 4:5 --v 6 --s 200 | 1024×1280 |
| Brand poster | 3:4 | --ar 3:4 --v 6 --s 300 | 1024×1360 |
| Horizontal banner | 16:9 | --ar 16:9 --v 6 --s 250 | 1820×1024 |
| Luxury close-up | 16:9 / 3:2 | --ar 16:9 --v 6 --s 400 | 1820×1024 |