• @[email protected]
      link
      fedilink
      English
      -2
      edit-2
      4 months ago

      Mhm, I’m sure that’s why you couldn’t find a single sentence about compositing images

      DALL·E 2 uses a diffusion model conditioned on CLIP image embeddings, which, during inference, are generated from CLIP text embeddings by a prior model.

      You’re either projecting or being dishonest