1137

In a similar vein, alt-text your image posts here! (sopuli.xyz)

submitted 1 year ago by crmsnbleyd@sopuli.xyz to c/microblogmemes@lemmy.world

66 comments fedilink hide all child comments

crossposted from https://wandering.shop/@meredithw/113983437672884349

you are viewing a single comment's thread
view the rest of the comments

[-] rumschlumpel@feddit.org 160 points 1 year ago

I found that even when you can see the image, alt-text often helps significantly with understanding it. e.g. by calling a character or place by name or saying what kind of action is being done.

[+] AcesFullOfKings@feddit.uk 49 points 1 year ago* (last edited 1 year ago)

[deleted]

[-] ilinamorato@lemmy.world 31 points 1 year ago

Honestly I think that sort of training is largely already over. The datasets already exist (have for over a decade now), and are largely self-training at this point. Any training on new images is going to be done by looking at captions under news images, or through crawling videos with voiceovers. I don't think this is a going concern anymore.

And, incidentally, that kind of dataset just isn't very valuable to AI companies. Most of the use they're going to get is in being able to create accessible image descriptions for visually-disabled people anyway; they don't really have a lot more value for generative diffusion models beyond the image itself, since the aforementioned image description models are so good.

In short, I really strongly believe that this isn't a reason to not alt-text your images.

[-] Venator@lemmy.nz 3 points 1 year ago

Maybe the AI can alt text it for us.

[-] ilinamorato@lemmy.world 5 points 1 year ago* (last edited 1 year ago)

It sort of can. Firefox is using a small language model to do just that, in one of the more useful accessibility implementations of machine learning. But it's never going to be capable of the context that human alt text, from the uploader, can give.

[-] Venator@lemmy.nz 2 points 1 year ago

True, but I was thinking maybe something in the crate post flow(maybe running client side so as not to overload the lemmy servers 😅) that generates a description that the uploader can edit before(and after) they post it, that way it's more effort for the poster to not add it than to add it, and if it's incorrect people will usually post comments to correct it. Maybe also adding a note at the end that its ai generated unless the user edits it.

But that's probably way too complicated for all the different lemmy clients to be feasible to implement tbh.

[-] ilinamorato@lemmy.world 2 points 1 year ago

I think that would make a great browser extension. I'm not in a position to make it right now, but wow, that could potentially be really useful.

[-] flamingos@feddit.uk 7 points 1 year ago* (last edited 1 year ago)

AI training data mostly comes from giving exploited Kenyans PTSD, alt-text becoming a common thing on social media came quite a bit after these AI models got their start.

[-] art@lemmy.world 4 points 1 year ago

I would gladly train a million AI robots just to make the web a lot cooler for those visual impairment.

[-] Mouselemming@sh.itjust.works 2 points 1 year ago

Just be sure not to specify how many fingers, or thumbs, or toes, or that the two shown are opposites L/R. Nor anything about how clown faces are designed.

[-] LePoisson@lemmy.world 1 points 1 year ago

What do you think is creating all those descriptions?

[-] otter@lemmy.ca 7 points 1 year ago

It's been great on pixelfed, I appreciate the people that put some time into it

this post was submitted on 27 Feb 2025

1137 points (100.0% liked)

Microblog Memes

11879 readers

784 users here now

A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.

Created as an evolution of White People Twitter and other tweet-capture subreddits.

RULES:

Your post must be a screen capture of a microblog-type post that includes the UI of the site it came from, preferably also including the avatar and username of the original poster. Including relevant comments made to the original post is encouraged.
Your post, included comments, or your title/comment should include some kind of commentary or remark on the subject of the screen capture. Your title must include at least one word relevant to your post.
You are encouraged to provide a link back to the source of your screen capture in the body of your post.
Current politics and news are allowed, but discouraged. There MUST be some kind of human commentary/reaction included (either by the original poster or you). Just news articles or headlines will be deleted.
Doctored posts/images and AI are allowed, but discouraged. You MUST indicate this in your post (even if you didn't originally know). If an image is found to be fabricated or edited in any way and it is not properly labeled, it will be deleted.
Absolutely no NSFL content.
Be nice. Don't take anything personally. Take political debates to the appropriate communities. Take personal disagreements & arguments to private messages.
No advertising, brand promotion, or guerrilla marketing.

RELATED COMMUNITIES:

founded 3 years ago

MODERATORS

ReadyUser31@lemmy.world

aeronmelon@lemmy.world

needanke@feddit.org