You probably need some technical restrictions as well, but from the legal perspective: is there a license that is like Creative Commons EXCEPT for use cases like use the content for training an LLM by OpenAI or google?
You probably need some technical restrictions as well, but from the legal perspective: is there a license that is like Creative Commons EXCEPT for use cases like use the content for training an LLM by OpenAI or google?
@[email protected] sure that is what I have but I’d like to believe there are standard licenses that could/should be automatically enforced by a crawler. A custom license won’t work I think.
Regardless probably the nets policy is to update robots.txt and block ips