this post was submitted on 03 Jun 2026
955 points (99.7% liked)

People Twitter

10067 readers
1193 users here now

People tweeting stuff. We allow tweets from anyone.

RULES:

  1. Mark NSFW content.
  2. No doxxing people.
  3. Must be a pic of the tweet or similar. No direct links to the tweet.
  4. No bullying or international politcs
  5. Be excellent to each other.
  6. Provide an archived link to the tweet (or similar) being shown if it's a major figure or a politician. Archive.is the best way.

founded 3 years ago
MODERATORS
955
Managers (media.piefed.zip)
submitted 1 week ago* (last edited 1 week ago) by inari@piefed.zip to c/whitepeopletwitter@sh.itjust.works
 
you are viewing a single comment's thread
view the rest of the comments
[–] Evotech@lemmy.world 1 points 1 week ago (2 children)

And then add 200k context on top

And then add hundred of users needing to do things in paralell

[–] boonhet@sopuli.xyz 1 points 1 week ago (1 children)

If it's a large enough company to have hundreds of users, it can afford several beefy machines tbh

[–] Evotech@lemmy.world 1 points 1 week ago (1 children)

It's a capex and that type of hardware needs to be replaced every 3 years minimum and you need people to set it up and maintain a cluster. And it's not straight forward.

You are never going to get that approved without a serious business case.

Claude on the other end is a opex and much easier to just try out and then build a solution on it

Not saying it doesn't happen but it's not as easy as people make it sound like

[–] boonhet@sopuli.xyz 1 points 1 week ago (1 children)

It's 3 years if you're trying to be competitive on frontier models and generally capex is preferred to opex because opex never ends

I don't think anyone's building a cluster for their business right now, but one single rack after Claude gets rid of their subscription options? Might be a good deal.

[–] Evotech@lemmy.world 1 points 1 week ago (1 children)

Capex never ends either if it's hardware. Also you need opex to run it

[–] boonhet@sopuli.xyz 1 points 1 week ago (1 children)

400k on a DGX node starts seeming like a great deal when your employees each start using a few hundred dollars worth of Claude tokens every month. That one node can handle a lot of users depending on the model used.

It's an expense once every maybe 5 or 6 years in reality and you don't need to hire new people, you just give your existing sysadmins some extra work. They'll complain, but they'll still do it.

Of course the sensible alternative is to use a decent model off openrouter for peanuts but then you're sending all your sensitive business secrets to China which is even worse than sharing them with a US AI company. And people WILL be sharing secrets lol

[–] Evotech@lemmy.world 1 points 1 week ago (1 children)
[–] boonhet@sopuli.xyz 1 points 1 week ago (1 children)

You don't have to run Claude Opus for it to be useful lol

[–] Evotech@lemmy.world 1 points 1 week ago

It's always going to be second rate. And you'll have to defend that

[–] lime@feddit.nu 1 points 1 week ago

nobody said anything about it being a large company :P

anyway, seems the framework is hampered by a slow gpu so the memory issues are apparently moot.