Actually Useful AI

3232 readers

2 users here now

Welcome! 🤖

Our community focuses on programming-oriented, hype-free discussion of Artificial Intelligence (AI) topics. We aim to curate content that truly contributes to the understanding and practical application of AI, making it, as the name suggests, "actually useful" for developers and enthusiasts alike.

Be an active member! 🔔

We highly value participation in our community. Whether it's asking questions, sharing insights, or sparking new discussions, your engagement helps us all grow.

What can I post? 📝

In general, anything related to AI is acceptable. However, we encourage you to strive for high-quality content.

What is not allowed? 🚫

🔊 Sensationalism: "How I made $1000 in 30 minutes using ChatGPT - the answer will surprise you!"
♻️ Recycled Content: "Ultimate ChatGPT Prompting Guide" that is the 10,000th variation on "As a (role), explain (thing) in (style)"
🚮 Blogspam: Anything the mods consider crypto/AI bro success porn sigma grindset blogspam

General Rules 📜

Members are expected to engage in on-topic discussions, and exhibit mature, respectful behavior. Those who fail to uphold these standards may find their posts or comments removed, with repeat offenders potentially facing a permanent ban.

While we appreciate focus, a little humor and off-topic banter, when tasteful and relevant, can also add flavor to our discussions.

Related Communities 🌐

General

Chat

!chatgpt@lemmy.world

Image

Open Source

!fosai@lemmy.world

Please message @sisyphean@programming.dev if you would like us to add a community to this list.

Icon base by Lord Berandas under CC BY 3.0 with modifications to add a gradient

founded 2 years ago

MODERATORS

sisyphean@programming.dev

What is the goto booru image tags model? (lemmy.world)

submitted 7 months ago by j4k3@lemmy.world to c/auai@programming.dev

0 comments fedilink hide all child comments

I'm looking for the least biased for a large training set on water parks and water slides where there is a lot of content models struggle to deal with.

No, not in some creepy way. I actually want to eventually try training a LoRA on O'Neill cylinders but I don't have the experience, and have a limited dataset. Water slides are a similar set of issues with model internal thinking and comprehension. Presently, all images of water slides start off as stairs, models do not understand gravity as a propulsion method and water as a lubricant on a slide. The water falling is literally a waterfall and falling/slipping/sliding are associated.

I've made several trained attempts at LoRAs already with limited success and around 200 images but there are too many mixed concepts and difficulties.

Now I'm trying to make a much larger training set. I have a well tuned Qwen 2.5 VL 7B doing captions, but want to compare the results with tags training in various models. I need something that is not going to freak out because a normal-clothed not-at-all-sexualized kid is in some images or especially ignore someone in the middle of the air on a launch slide or making a thrill-ride facial expression.

My dataset is a bunch of YT type videos and I am just using FFMPG to do periodic screenshots. The videos are not at all NSFW and most are years old, so they pass moderation. I only focused on getting as many images as possible that include sections of people sliding with the camera on their face. My goal is to capture the true cultural diversity and unique cultural norms of a water park similar to how models know beaches have different cultural norms with internal thinking and alignment.

So, do you all know what is the most recent and reliable tags generating model where alignment does not get in the way?

no comments (yet)

sorted by: hot top controversial new old

there doesn't seem to be anything here