SuspiciousCarrot78

joined 1 month ago
[–] SuspiciousCarrot78@aussie.zone 0 points 1 hour ago* (last edited 1 hour ago)

I have a strong feeling that lite is Qwen 3.5 27B and Max is GLM 5.2, based on the AAII scores on protons announcement post. 27B thinking is closest to 34 AAII and only GLM hits 51. So as fingerprints go....

But the fact that Lumo won't reveal specifics / inhibits that at system prompt level (despite broad family id being cooked into weights) sort of belies the transparency angle for me. What else is being obfuscated? I can't even uncover what fair use is - how many messages per hour etc

Logical inference then - opacity is driven by profit margin preservation and competitive insulation. If you can't calculate the mark up by query (vs OR) then you might be paying 5-20x mark up.

Sits funny amidst all the privacy and transparency cosplay.

At least they're not building autonomous weapons but treating open weight models as commercial secrets is sorta off putting.

 

FYI Lumo 2.0 just dropped and it seems to be comprised of open weight models (GLM 5.2 as the big model, Qwen 3.6 35B as the small, few others to boot)

https://proton.me/support/lumo-privacy

Having played around with it a bit...I dunno. It seems pretty locked down / security theatre heavy / refusals on benign things.

I can't run GLM locally (who can lol) but all things given I dunno how good value for money it is vs other options.

(Security and privacy promises notwithstanding that is - which perhaps, are the whole point with Lumo).

Has anyone used Lumo for any period of time? What quants do they use?

Nice LLM you have there. Be a real shame if one that costs 1/10 the price (and another that can run on a consumer rig self hosted) could outperform it.

[–] SuspiciousCarrot78@aussie.zone 21 points 8 hours ago* (last edited 8 hours ago)

Lol

#JustMicrosoftThings

Hey assholes? Eat a fat bag of shit. You're all surplus to requirement.

https://www.luanti.org/en/

[–] SuspiciousCarrot78@aussie.zone 1 points 19 hours ago* (last edited 18 hours ago)

Ive seen LTO for sale in person at the $300 USD mark...but I admit that's a rare occurrence / fire sale. So that "as little as" is probably not fair. Sorry and retracted.

EBay shows some in the $1200-1500 USD range (and maybe closer to $3-4K brand new).

That now makes me feel stupid for walking past one at $300 ("tape drive? Who the fuck needs that ancient shit") but I'm willing to bet that wasn't a lto-8, in hindsight.

Even my LTO-3 claim is not as remembered; I can find a lto-3 for $60 USD here locally (not the $5 I jokingly claimed), with cartridges in the $15 USD range.

https://ebay.io/m/1UST3Y

https://ebay.io/m/Gcf77T

Not bad, but not 12TB per cartridge.

All of this to say; the DVD shuffler + pi intermediary (+ NVIDIA shield if needed) is probably the genuinely better version of this. Bizarrely.

[–] SuspiciousCarrot78@aussie.zone 3 points 1 day ago* (last edited 1 day ago) (1 children)

Huh. Thought it was stock standard AOSP - perhaps the Aussie version is different? There are a few rebranded versions of the same hardware; you might be able to find something non proprietary. I think the underlying model is UNIWA if you want to go spelunk direct listings

https://opelmobile.com.au/wp-content/uploads/2023/08/OM-TouchFlip-A4.pdf

Where did you find the info that the TTFone uses a fork - XDA? Possibly it can take CFW?

In any case, if it's no go, it's no go.

PS: The other target might be a the Cat 22 flip but that thing has a face only a mother could love. I have seen clean CFW of the Duoquin models too - multiple threads on XDA - but that's candybar not flip

PPS: let me spelunk the 8020 for a minute. I have to imagine it's an off the shelf re-badge. EDIT: Hmm...looks like it's bespoke enough to NOT be a simple Shenzhen rebadge job.

[–] SuspiciousCarrot78@aussie.zone 2 points 1 day ago* (last edited 1 day ago) (3 children)

Yes - I have the rebranded Aussie version

https://www.officeworks.com.au/shop/officeworks/p/opel-mobile-touchflip-4g-flip-phone-optouchfp

It runs Android 8.1 and had no issue running Molly the last time I tried it. I really like that little nugget - let me know if you want a list of apps or launchers for it. Note: you'll have to use Fdroid, Droidify, Aurora store or direct apks, as it doesn't meet play store compliance.

If the TT990 is android 14, then it should work even better.

I can also confirm that my Duoqin F21 runs either just fine, but that's cheating

https://qinphone.com/products/qin-f21-pro-smart-keypad-phone-compact-2-8-inch-touchscreen-android-11-4g-lte-single-camera-google-play-support-ideal-backup-work-phone-porcelain-white-iron-grey

Back to the TT970; the keyboard is fantastic (download the true TT9 app) and it even runs futo voice (albeit a touch slowly). If you get one, try to get the 1750mah battery - it helps.

Standby is just bang on 2 days for me.

What's an ad? 🤔

Some people just want to watch the world burn

[–] SuspiciousCarrot78@aussie.zone 3 points 1 day ago* (last edited 1 day ago)

Good luck with that. I still have functional PCs that are 20 yrs old and perfectly fine as Jellyfin servers.

...because Kaios is essentially dead?

 

Do you host your own ML / AI / LLM? What do you use, and what do you use it for?

 

Based on recent comments this feels like a discussion we should have. So..topic, basically.

I'm not looking to be chief noisemaker on this, but I stand by what I wrote in !privacy and what's in my post history.

https://lemmy.ml/post/48724623/26190950

Let's have at; do we want a [AI] and [NOT AI] tag. Why or why not?

20
submitted 1 week ago* (last edited 1 week ago) by SuspiciousCarrot78@aussie.zone to c/selfhosted@lemmy.world
 

Re: the recent meta discussion and ongoing chats about self hosting, open vs closed source, AI etc, I wanted to share some food for thought.

I'll explain why this is related to self hosting at the bottom (section bolded): our lovely new mods can call it. I hope it inspires some out loud thinking.

Disclosure: I am not the content creator nor am I paid by them to signal boost. I just like their stuff and think this is an important topic, from multiple angles.


"The future of AI depends on the moral compass of five people."

I've been watching "AI in context" for a few weeks (they make long form biopic content on current state of AI - really good stuff).

This dropped today; it's about the wheeling and dealing behind closed doors at OpenAI re: Sam Altman's firing. It's a lot more watchable than that sounds :)

https://www.youtube.com/watch?v=_eYTkvZqbnQ

The line that brought me to a stand still was "the future of AI depends on the moral compass of like 5 people".

I think folks here (and Lemmy generally) are more savvy about AI then the gen pop (though Lemmy is famously FuckAI)....but even if you're training a nanoGPT model from scratch on hardware you own ...you're still beholden to outside forces.

Eg: the people's champion - Qwen - seems to have split or gone closed weights for 3.7. That's not a good sign.


Reason for post

In the recent [Meta] chat, I noted an undercurrent of "no man is an island" - that is, yes, you might host your own X, but you're still dependant on external Y (eg: SearXNG).

Self hosting / FOSS / forking mitigates some of the "we changed the terms of service after the sale" enshittification we see occurring in related spaces (eg: right to repair). But there's only so much leverage you can enact before it becomes pyrrhic.

I would like to believe "fuck you, I won't do what you told me" is our bulwark against market forces.

At the same time, it's sad to see so many "Don't be Evil" mission statements not survive contact with reality (watch the vid: OAi was founded on the ideal of "don't let AGI kill us")

I think what happens upstream has effects down stream too (see prior X vs Y examples)...

Not sure where this leaves us. It's a weird time to be alive.

Enjoy the video (and their others - Ai2027 is eye opening). Look forward to any productive chat this post might inspire.

 

Ive been watching "AI in context" for a few weeks (they make long form biopic content on current state of AI - really good stuff).

This dropped today; it's about the wheeling and dealing behind closed doors at OpenAI re: Sam Altman''s firing. It's a lot more watchable than that sounds :)

https://www.youtube.com/watch?v=_eYTkvZqbnQ

The line that brought me to a stand still was "the future of AI depends on the moral compass of like 5 people".

I know we're all about local LLMs here...but it's sad to see yet another "Don't Be Evil" mission statement get swallowed up market forces. OpenAI was meant to bring balance to the Force, not leave it in ruins, and yet....

I think folks here (and Lemmy generally) are more savvy about AI then the gen pop....but even if you're training a nanoGPT model from scratch on hardware you own ...you're still beholden to outside forces. Eg: the people's champion - Qwen - seems to have split or gone closed weights for 3.7.

Things that make you go hmm.

Anyway...just signal boosting a cool video.

 

I'm starting to develop arthritis in my fingers, which makes typing an interesting challenge.

I'm wondering, what is the best self-hosted solution for speech-to-text generally? Dragon dictate use to be the thing, but is there anything open source, self-hostable that's superseded it?

I would love to be able to have something that I can speak into that can interact with pretty much any app, be that notepad++, or my web browser when I'm entering stuff or even when I'm creating this Lemmy post (which I actually made using futo voice on my phone).

Windows and/or Linux ideally.

Any leads? Getting old sucks.

 

Recent post re: AI as utility

https://www.tomsguide.com/ai/people-will-buy-intelligence-from-us-on-a-meter-chatgpts-ceo-sam-altman-has-critics-worried-with-his-ai-vision

Myself, I'm a fan of local LLM / self hosted ML.... but if you ever needed a clarion call that a hard pivot is coming (soon) for online/ cloud based AI...Altman et al are making some concerning mouth noises (to say nothing of broader concerns with OAI, Anthropic etc).

Right now, I'm sketching out a plan where my Raspberry Pi (always on, 2-3w) uses a magic packet to wake up my modest AI server (Lenovo P330 with Tesla P4) if/when needed (Qwen 3.6-35B-A3B); no point in chugging down 80-100w, 24/7 for no good reason.

If the trend continues the direction it appears to be (increasing costs, environmental impacts etc) then I'd feel a lot better hosting my own as port of first call and replacing simpler tasks with more traditional programs. YMMV.

1
submitted 1 month ago* (last edited 1 month ago) by SuspiciousCarrot78@aussie.zone to c/localllama@sh.itjust.works
 

More often than not, AI and LLM gets conflated in the public consciousness...and then gets mixed with "Agentic", "SaaS" and other well...slop. So, here is a farmer in Japan, using a raspberry pi, to sort cucumbers.

https://www.newsweek.com/artificial-intelligence-cucumber-farm-raspberry-pi-495289

PS: 2016 article. I expect by now the tractor is self driving and named Betty.

If you have any other "dude does cool AI shit with a box of scraps in a cave", I'm all EARS.md

 

I was browsing Reddit (yetch) while waiting for some stuff to finish when I came across this post

https://old.reddit.com/r/LocalLLM/comments/1tek00h/why_is_llm_is_so_expensive/

The author make a (very) interesting claim: if table stakes are $6K (they're not...but go with it for now), then most folks are cooked from the get go.

Personally, I have been figuring out how to get more from less. For example, people have found ways to run Qwen3.6 35B on a 6GB VRAM GTX 1060 at ~20tok/s (--ctx 64K IIRC, but go check the vids yourself)

https://youtu.be/8F_5pdcD3HY

I think there's a lot of juice to squeeze by turning LLMs from "all seeing sages" into basically mouth pieces for shit that actually runs fast on regular silicon - but that's just me and my crazy brain. YMMV.

1
Token Speed visualiser (mikeveerman.github.io)
submitted 1 month ago* (last edited 1 month ago) by SuspiciousCarrot78@aussie.zone to c/localllama@sh.itjust.works
 

https://mikeveerman.github.io/tokenspeed/?rate=20&mode=agent&think=15

Exactly what it says on the tin :)

Pretty good simulator this. May it cause you to reconsider your expensive GPU upgrade :)

view more: next ›