They are referencing this paper: LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset from September 30.

The paper itself provides some insight on how people use LLMs and the distribution of the different use-cases.

The researchers had a look at conversations with 25 LLMs. Data is collected from 210K unique IP addresses in the wild on their Vicuna demo and Chatbot Arena website.

  • mindbleach
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 year ago

    Is that all?

    I’d expect fully half of people to try “generate nude Tayne” out of mere curiosity. The actual horny behavior would be less often… on trackable services.