

3·
2 days agoGuru meditation.
Guru meditation.
Meta got shit on for pirating books.
Anthropic got shit on for not pirating books.
Right, a 12 GB model trained on 100,000,000 images isn’t big enough to contain an MD5 checksum of each.
The same people expect it to identify the authorship of sentence fragments, but never quote one whole paragraph from any book ever. Now: gigabytes of text could be a significant fraction of all books. But finding a single recognizable page is news. Storing text is not what these companies spent a bajillion dollars on.
Turning books into a language model is transformative. No LLM is a substitute for the original works.
Most anti-AI sentiments seem like misplaced hatred of awful companies forcing nonsense on everybody, or a refusal to place judgement on capitalism itself.