this post was submitted on 05 Apr 2026
119 points (100.0% liked)

Fuck AI

6732 readers
148 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.

founded 2 years ago
MODERATORS
 

cross-posted from: https://ibbit.at/post/219495

From Fark.com RSS via this RSS feed. Fark comments are available here.

---

By Wednesday morning, Anthropic representatives had used a copyright takedown request to force the removal of more than 8,000 copies and adaptations of the raw Claude Code instructions - known as source code - that developers had shared on programming platform GitHub.
It later narrowed its takedown request to cover just 96 copies and adaptations, saying its initial ask had reached more GitHub accounts than intended.

Source [web-archive]

---

Many unresolved legal questions over LLMs and copyright center on memorization: whether specific training data have been encoded in the model’s weights during training, and whether those memorized data can be extracted in the model’s outputs.

While many believe that LLMs do not memorize much of their training data, recent work shows that substantial amounts of copyrighted text can be extracted from open-weight models... We investigate this question using a two-phase procedure...

We evaluate our procedure on four production LLMs: Claude 3.7 Sonnet, GPT-4.1, Gemini 2.5 Pro, and Grok 3, and we measure extraction success with a score computed from a block-based approximation of longest common substring...

Taken together, our work highlights that, even with model- and system-level safeguards, extraction of (in-copyright) training data remains a risk for production LLMs...

...we were able to extract four whole books near-verbatim, including two books under copyright in the U.S.: Harry Potter and the Sorcerer’s Stone and 1984...

Source: https://arxiv.org/pdf/2601.02671

top 3 comments
sorted by: hot top controversial new old
[–] aarch0x40@piefed.social 31 points 1 week ago

Phase 1: Steal IP

Phase 2: Claim IP rights over stolen IP

Phase 3: Sell IP rights under Chapter 7

[–] TwilitSky@lemmy.world 7 points 1 week ago

AI is so bad I think I'd rather watch the new Sex & The City series "and just like that" than continue worrying about this universe l.

[–] brokenwing@discuss.tchncs.de 4 points 1 week ago