this post was submitted on 28 Nov 2025
338 points (98.6% liked)

Selfhosted

60281 readers
669 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules:

Detailed Rules Post

  1. Be civil.

  2. No spam.

  3. Posts are to be related to self-hosting.

  4. Don't duplicate the full text of your blog or readme if you're providing a link.

  5. Submission headline should match the article title.

  6. No trolling.

  7. Promotion posts require active participation, with an account that is at least 30 days old. F/LOSS without a paywall has exceptions, with requirements. See the rules link for details.

Resources:

Any issues on the community? Report it using the report flag.

Questions? DM the mods!

founded 3 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[โ€“] tal@lemmy.today 3 points 7 months ago* (last edited 7 months ago) (1 children)

You can have applications where wall clock tine time is not all that critical but large model size is valuable, or where a model is very sparse, so does little computation relative to the size of the model, but for the major applications, like today's generative AI chatbots, I think that that's correct.

[โ€“] NotMyOldRedditName@lemmy.world 3 points 7 months ago* (last edited 7 months ago)

Ya, that's fair. If I was doing something I didn't care about time on, it did work. And we weren't talking hours, it it could be many minutes though.