There are so many reasons not to include any AI generated code.
hperrin
Unless the code the AI generated is a copy of copyrighted code, of course. Then it would be copyright infringement.
I can cause the AI to spit out code that I own the copyright to, because it was trained on my code too. If someone used that code without including attribution to me (the requirement of the license I release my code under), that would be copyright infringement. Do you understand what I mean?
The copyright office said material generated by AI is not copyrighted, even if that material is subsequently revised by the AI through additional prompts. That includes code. The GPL can only be used on copyrighted code. It is a copyleft license because it uses copyright law as a mechanism to enforce its terms. If you believe you can enforce a license on public domain material, that’s simply a gross misunderstanding of copyright law.
Yes, it will hopefully be a very small part of the kernel, but what happens thirty years from now if the kernel is all AI generated code? It may be a slippery slope, but it’s a valid slippery slope. The more the kernel is AI generated, the less of it the license can cover.
Sure, you can license them, but that license is unenforceable, because you don’t own the copyrights, so you can’t sue them for copyright infringement. And you’d have to be a fool to agree to a license for public domain material. You can do whatever you want with it, no license necessary.
If the author is an LLM, then the author is not a human.
I think you’re misunderstanding what I’m saying. Any portions of the kernel that are public domain can be used by anyone for any purpose without following the terms of the GPL. AI generated code is public domain. To make sure all parts of the kernel are protected by the GPL, public domain code should not be accepted unless absolutely necessary.
Ok, well here are quotes from the US Copyright Office that establish that what I said is true:
https://sciactive.com/human-contribution-policy/#More-Information
Distributing under the GPL is a software license agreement which is absolutely a contract:
A software license agreement is a legal contract that grants you permission to use software without transferring ownership. The software creator retains intellectual property rights while giving you specific usage rights under defined terms and conditions.
- https://ironcladapp.com/journal/contracts/software-license-agreement
Sure, you can license it whatever you want, but I can too, because it’s public domain. And neither of us can enforce those license terms on the other, because again, it’s public domain.
The copyright office has made it explicitly clear that those tools do not interfere with the traditional elements of authorship, and that the use of LLMs does. So, if you don’t want to take my word for it, take the US Copyright Office’s word for it.
Copyleft doesn’t mean it’s not copyrighted. Copyleft is not a legal term. “Copyleft” licenses are enforced through copyright ownership.
Did you read the quotes from the copyright office I linked to? I am going to go ahead and trust the copyright office over you on issues of copyrightability.
Yes, that makes sense. People have always been able to intentionally commit copyright infringement. However, it has historically been fairly difficult to unintentionally commit copyright infringement. That’s no longer the case. AI makes it very easy to unintentionally commit copyright infringement. That’s a good reason to ban it outright.