Machine learning algorithms were already able to label objects in images, and now they were learning to convert those labels into natural language descriptions.
DIFFUSION MODELS are generative models that learn to create images by gradually noising and denoising their training samples.聽
Diffusion models are like autoencoders, which transform input data into an embedding representation and then reproduce the original data from the embedding information.
ORIGINAL GIOTTO
ORIGINAL GIOTTO
FAKE GIOTTO
FAKE GIOTTO
A new AI System- Converts text prompts into images.- Maintain semantic consistency in the images it creates.- Can generate multiple variations of the same image.- Can edit an existing image.
WHAT IS DALL路E 2 ?
WHAT IS DALL路E 2 ?
DALL-E 2 is a new AI system that can create realistic images and art from a description in natural language.
Can we make our prompt even better by combining image with text generation?
Can we make our prompt even better by combining image with text generation?
Yes we can!
Yes we can!
Here are a few experiments that we鈥檙e running to improve the quality of our prompts.
We basically evaluate the quality of the completion (the AI generated product description) by generating an image with DALL路E 2.Then we compare the visual similarity between the generated image and the real photo of that product to see what completion works best.
ORIGINAL IMAGE
ORIGINAL IMAGE
Synthetic Images聽(generated using DALL路E 2)
Synthetic Images聽(generated using DALL路E 2)
We compare synthetic images with the real product image to evaluate the best prompt.