• A_Very_Big_Fan@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    arrow-down
    3
    ·
    edit-2
    8 months ago

    Mhm, I’m sure that’s why you couldn’t find a single sentence about compositing images

    DALL·E 2 uses a diffusion model conditioned on CLIP image embeddings, which, during inference, are generated from CLIP text embeddings by a prior model.

    You’re either projecting or being dishonest