42
submitted 1 year ago* (last edited 1 year ago) by mookulator@mander.xyz to c/fediverse@lemmy.ml

I have a fair amount of experience with data visualization, analysis etc and thought it would be a fun project to try to visualize the Lemmy network, specifically which instances have strong links to one another via subscriptions from users in one to communities in the other.

How/where can I get that data?

EDIT: It sounds like many people would find this a violation of their data privacy and I simply shouldn’t do it. I had thought this kind of data was intended to be entirely accessible by design, but I learned something new!

you are viewing a single comment's thread
view the rest of the comments
[-] mookulator@mander.xyz 1 points 1 year ago

Thanks for the support! I thought it would be a useful tool too. People could use it to find instances they didn’t know about, but which are popular in their network, or to find under appreciated instances to build bridges.

Anyway, sounds like database access is a must, and I’m not trying to take on a massive data wrangling exercise. I had it in my mind that the data were just sitting their for download, which was wishful thinking!

[-] Danterious@lemmy.world 2 points 1 year ago

If you are interested there is a professor named Damon Centola that might be interested in dedicating time to gathering that data and might help with the visualization as well. If you are in the sociologist space or just interested you could reach out to him.

[-] mookulator@mander.xyz 1 points 1 year ago
[-] Danterious@lemmy.world 1 points 1 year ago

No problem just notify me if you ever decide to follow through.

[-] Danterious@lemmy.world 1 points 1 year ago

Sorry for commenting again but I think there is a way for you to do this in a completely open, easy, and privacy-preserving way. You don't need to access their database.

  1. Get a list of instances that you want to look at the subscription patterns for. (All the instances here + Lemmy.world)

  2. Go to that instance's website, click the "Communities" tab at the top, and then click "All" It shows how many users from that instance are subscribed to that community (both communities from that instance and outside of that instance)

If you find a way to automatically (or manually) scrape this data from all of those websites you can create the visualization that you were talking about.

So you were right, the data is open source it is just specific to each website.

this post was submitted on 15 Aug 2023
42 points (100.0% liked)

Fediverse

17677 readers
23 users here now

A community dedicated to fediverse news and discussion.

Fediverse is a portmanteau of "federation" and "universe".

Getting started on Fediverse;

founded 4 years ago
MODERATORS