← Back to Journal·// AI & IMAGERY·8 MIN READ·MAY 02, 2026

Why generic AI photography ruins your product (and how compositing fixes it).

Pure-generation AI tools regenerate your bottle, your label, and your logo into nonsense. We dig into why compositing is the only viable approach for e-commerce.

Niko Bergsson
AI ENGINEER · SHELFGEN
Why generic AI photography ruins your product (and how compositing fixes it).
FIG. 01 — Generative AI reinterpreting a product label as illegible text.

Pure text-to-image models are extraordinary at generating plausible scenes. They are extraordinary in the wrong way for e-commerce. A buyer who clicks a listing expects the photograph to match the physical object they will receive — down to the typography on the label and the curve of the cap.

// SECTION 01

What generation gets wrong

Run a state-of-the-art image model on the prompt “a 30ml amber serum bottle with the label 'Foundry Botanical Oil'.” You will get an amber bottle. The label will say something close to “Foundry Botinacal” or “Foundary Bot Oil.” The cap will be a vague approximation. Reverse-image search will not find your real product anywhere in the result.

This is not a quality issue that scales away with bigger models. Generative models are trained to produce plausible objects, not exact objects. Your product is not a category; it is a specific SKU with a specific identity. The two goals are in direct tension.

// SECTION 02

Why compositing works

Compositing inverts the problem. The product itself is treated as fixed input — pixels that pass through the pipeline untouched. What gets generated is the scene around the product: the marble counter, the kitchen window, the lifestyle context. The product never gets reinterpreted.

0
Label re-generations across 18k Shelfgen outputs
97%
Listings accepted on first Amazon submission
Faster than re-shooting per SKU
Generation makes everything plausible. Compositing makes everything verifiable. E-commerce needs verifiable.
// SECTION 03

The hand-off

The cleanest pipeline is a hybrid: the seller uploads a single clean source photo (phone-quality is fine), an automated cutout isolates the product, and the model only renders the surrounding scene. The product pixels are stitched back into the generated scene with edge-aware blending. Labels stay readable. Logos stay sharp. Marketplace compliance stays intact.

Composited generation, never re-rendered

Shelfgen composites — your product pixels are never reinterpreted. 5 free AI credits, no card.

Start free
AICompositingProduct photographyGenerative imaging