Elon has responded to the criticism and is increasing the limits to a whopping:

Verified accounts: 8000 posts/day
Unverified accounts: 800 posts/day
New unverified accounts: 400 posts/day
  • rlspam
    link
    fedilink
    English
    arrow-up
    2
    ·
    2 years ago

    How true is the LLM data scraping threat?

    • Skaryon@lemmy.world
      link
      fedilink
      English
      arrow-up
      13
      ·
      2 years ago

      Who knows. One thing for sure is that this is just one more thing he pulled out of his ass without any backing

    • nottheengineer@feddit.de
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      1
      ·
      2 years ago

      Meta has shown that getting huge amounts of training data can lead to great results with a model that’s much simpler than what openAI uses and it looks like they are taking a more open approach to LLMs because of that. Twitter has shitloads of possible training data, but it’s Twitter so that data isn’t great.

      Elon is known to be afraid of AGIs becoming hostile, so that explains the decision.

      I don’t think it’ll slow down AI development too much. There are new Llama-based models coming out every month that are better than the previous ones.

      Reddit is a much better source of data and if they don’t want to lose SEO, their data can still be gathered by scraping even after the API changes take effect.