About 7 results
Open links in new tab
  1. Imagen: Text-to-Image Diffusion Models

    Imagen is an AI system that creates photorealistic images from input text Visualization of Imagen. Imagen uses a large frozen T5-XXL encoder to encode the input text into embeddings. A …

  2. Imagen Editor & EditBench

    A key challenge is to generate edits that are faithful to input text prompts, while consistent with input images. We present Imagen Editor, a cascaded diffusion model built by fine-tuning …

  3. Imagen Video

    Generative modeling has made tremendous progress, especially in recent text-to-image models. Imagen Video is another step forward in generative modelling capabilities, advancing text-to …

  4. To probe image quality, the rater is asked to select between the model generation and reference image using the question: “Which image is more photorealistic (looks more real)?”.

  5. cascade of video diffusion models. By extending the text-to-image diffusion models of Imagen (Saharia et al., 2022b) to the time domain, and training jointly on video and images, we …