[HTML payload içeriği buraya]
31.2 C
Jakarta
Sunday, May 17, 2026

Bringing 3D shoppable merchandise on-line with generative AI


Third technology: Generalizing with Veo

Our newest breakthrough builds on Veo, Google’s state-of-the-art video technology. A key power of Veo is its skill to generate movies that seize complicated interactions between gentle, materials, texture, and geometry. Its highly effective diffusion-based structure and its skill to be finetuned on quite a lot of multi-modal duties allow it to excel at novel view synthesis.

To finetune Veo to remodel product photos right into a constant 360° video, we first curated a dataset of thousands and thousands of top of the range, 3D artificial belongings. We then rendered the 3D belongings from numerous digicam angles and lighting situations. Lastly, we created a dataset of paired photos and movies and supervised Veo to generate 360° spins conditioned on a number of photos.

We found that this method generalized successfully throughout a various set of product classes, together with furnishings, attire, electronics and extra. Veo was not solely in a position to generate novel views that adhered to the out there product photos, nevertheless it was additionally in a position to seize complicated lighting and materials interactions (i.e., shiny surfaces), one thing which was difficult for the first- and second-generation approaches.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles