“Permission is hereby granted” comes from Suno AI engine that creates new songs on demand.

  • BrikoX@lemmy.zipOPM
    link
    fedilink
    English
    arrow-up
    2
    ·
    8 months ago

    Bark model uses ChatGPT as a base. So it is LLM, it’s just tailored for text-to-audio.

    • Lmaydev@programming.dev
      link
      fedilink
      English
      arrow-up
      3
      ·
      edit-2
      8 months ago

      Interesting.

      Suno that describes the service as a collaboration between OpenAI’s ChatGPT (for lyric writing) and Suno’s music generation model

      That would seem to suggest it’s two models and the audio is generated by the second.

      Edit: from GitHub

      Bark is fully generative text-to-audio model devolved for research and demo purposes. It follows a GPT style architecture similar to AudioLM and Vall-E and a quantized Audio representation from EnCodec