Have you seen Google‘s latest text-to-image generator Imagen?
Its an AI system that creates photorealistic images from input text
As the paper states, ‘ Imagen builds on the power of large transformer language models in understanding text and hinges on the strength of diffusion models in high-fidelity image generation’
‘Human raters strongly prefer Imagen over other methods, in both image-text alignment and image fidelity’
Most importantly, I love how they acknowledge and call out limitations and impact
‘The potential risks of misuse raise concerns regarding responsible open-sourcing of code and demos. At this time we have decided not to release code or a public demo’
Additional reading:
Paper: https://imagen.research.google/paper.pdf
Results: https://imagen.research.google/
Diffusion models:
Diffusion Models Beat GANs on Image Synthesis –
https://arxiv.org/abs/2105.05233
Diffusion Models made easy – https://towardsdatascience.com/diffusion-models-made-easy-8414298ce4da
***********************************************************
#reviewswithranjani #AI #Imagen
#Technology | #Books | #BeingBetter