this post was submitted on 26 May 2026
36 points (86.0% liked)

Fediverse

42569 readers
528 users here now

A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, Mbin, etc).

If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!

Rules

Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)

founded 3 years ago
MODERATORS
 

Hi!

While I really enjoy seeing many of my fellow man being accommodating to people with disabilities. I find manually transcribing every image I post to be very tiring.

I thought that I could at least use some sort of AI to help with image transcripts, tho, that could probably be better used by the actual person with the disability.

So thats the question, should I skip the transcribing of an image or let an AI do it?

you are viewing a single comment's thread
view the rest of the comments
[–] x74sys@programming.dev 14 points 3 weeks ago* (last edited 3 weeks ago) (1 children)

In my opinion, no. It has to be heavily curated. You’re not saving yourself a lot of work if you have to read it word by word (and probably correct stuff) anyway.

I think just one very short sentence describing what’s on there (it doesn’t have to be detailed) is a lot better than whatever an LLM will give you.

[–] basxto@discuss.tchncs.de 1 points 2 weeks ago (2 children)

It depends a lot on the image. Multi panel comics have pretty long alt texts and AI can make it faster to reproduce the text in tge image.

[–] x74sys@programming.dev 3 points 2 weeks ago

But then you’re primarily extracting text, which you don’t need LLMs for. OCR tools will do the job much cheaper and more effective.

[–] lambisio@feddit.cl 3 points 2 weeks ago (1 children)

and AI can make it faster to reproduce the text in tge image

That was solved decades ago without AI. It's called OCR.

[–] rain_worl@lemmy.world -1 points 1 day ago

ackshyually ai is a broad term that applied to ocr when it was first made, but nobody uses it that way anymore, they refer to machine learning. but wait! machine learning is used for ocr!!