this post was submitted on 04 Jun 2026
217 points (96.2% liked)

World News

56413 readers
3033 users here now

A community for discussing events around the World

Rules:

Similarly, if you see posts along these lines, do not engage. Report them, block them, and live a happier life than they do. We see too many slapfights that boil down to "Mom! He's bugging me!" and "I'm not touching you!" Going forward, slapfights will result in removed comments and temp bans to cool off.

We ask that the users report any comment or post that violate the rules, to use critical thinking when reading, posting or commenting. Users that post off-topic spam, advocate violence, have multiple comments or posts removed, weaponize reports or violate the code of conduct will be banned.

All posts and comments will be reviewed on a case-by-case basis. This means that some content that violates the rules may be allowed, while other content that does not violate the rules may be removed. The moderators retain the right to remove any content and ban users.


Lemmy World Partners

News !news@lemmy.world

Politics !politics@lemmy.world

World Politics !globalpolitics@lemmy.world


Recommendations

For Firefox users, there is media bias / propaganda / fact check plugin.

https://addons.mozilla.org/en-US/firefox/addon/media-bias-fact-check/

founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] boonhet@sopuli.xyz 6 points 2 days ago (2 children)

The training data is as important as the source code here to replicate the end result. The weights are more like a binary distribution. You can run the model and you can technically edit it just like you can technically edit a binary file.

They also only release some libraries and tools for running the model if you have a set of weights (which they do graciously provide), but they do NOT release the source code for their training pipeline itself. That's up to you to reverse engineer from the whitepapers. Right now even if you had the exact training data and the compute available, you could not train your own Deepseek V3.2, let alone V4.

[–] xep@discuss.online 5 points 2 days ago

If people on Lemmy can't understand this I have no hope for the average person.

[–] humanspiral@lemmy.ca 0 points 2 days ago

training data is as important as the source code here to replicate the end result

this is the nature of this flame war. Perfect replication of the end result, which is extremely opaque in how it works, is not nearly as important as the weights, that you can post train for any domain specific/general improvement with any other dataset. Which is how the authors would improve/change the weights further as well.