165
submitted 2 years ago* (last edited 2 years ago) by Buttons@programming.dev to c/programming@programming.dev

My first experience with Lemmy was thinking that the UI was beautiful, and lemmy.ml (the first instance I looked at) was asking people not to join because they already had 1500 users and were struggling to scale.

1500 users just doesn't seem like much, it seems like the type of load you could handle with a Raspberry Pi in a dusty corner.

Are the Lemmy servers struggling to scale because of the federation process / protocols?

Maybe I underestimate how much compute goes into hosting user generated content? Users generate very little text, but uploading pictures takes more space. Users are generating millions of bytes of content and it's overloading computers that can handle billions of bytes with ease, what happened? Am I missing something here?

Or maybe the code is just inefficient?

Which brings me to the title's question: Does Lemmy benefit from using Rust? None of the problems I can imagine are related to code execution speed.

If the federation process and protocols are inefficient, then everything is being built on sand. Popular protocols are hard to change. How often does the HTTP protocol change? Never. The language used for the code doesn't matter in this case.

If the code is just inefficient, well, inefficient Rust is probably slower than efficient Python or JavaScript. Could the complexity of Rust have pushed the devs towards a simpler but less efficient solution that ends up being slower than garbage collected languages? I'm sure this has happened before, but I don't know anything about the Lemmy code.

Or, again, maybe I'm just underestimating the amount of compute required to support 1500 users sharing a little bit of text and a few images?

top 50 comments
sorted by: hot top controversial new old
[-] dudeami0@lemmy.dudeami.win 80 points 2 years ago

The numbers are a little higher than you mention (currently ~3.2k active users). The server isn't very powerful either, it's now running on a dedicated server with 6 cores/12 threads and 32 gb ram. Other public instances are using larger servers, such as lemmy.world running on a AMD EPYC 7502P 32 Cores “Rome” CPU and 128GB RAM or sh.itjust.works running on 24 cores and 64GB of RAM. Without running one of these larger instances, I cannot tell what the bottleneck is.

The issues I've heard with federation are currently how ActivityPub is implemented, and possibly the fact all upvotes are federated individually. This means every upvote causes a federation queue to be built, and with a ton of users this would pile up fast. Multiply this by all the instances an instance is connected to and you have an exponential increase in requests. ActivityPub is the same protocol used by other federated servers, including Mastodon which had growing pains but appears to be running large instances smoothly now.

Other than that, websockets seem to be a big issue, but is being resolved in 0.18. It also appears every connected user gets all the information being federated, which is the cause for the spam of posts being prepended to the top of the feed. I wouldn't be surprised if people are already botting content scrapers/posters as well, which might cause a flood of new content which has to get federated which causes queues to back up; this is mostly speculation though.

As it goes with development, generally you focus on feature sets first. Optimization comes once you reach a point a code-freeze makes sense, then you can work on speeding things up without new features breaking stuff. This might be an issue for new users temporarily, but this project wasn't expecting a sudden increase in demand. This is a great way to show where inefficiencies are and improve performance is though. I have no doubt these will be resolved in a timely manner.

My personal node seems to use minimal resources, not having even registered compared to my other services. Looking at the process manager the postgres/lemmy backend/frontend use ~250MB of RAM.

For now, staying off lemmy.ml and moving communities to other instances is probably best. The use case of large instances anywhere near the scale of reddit wasn't the goal of the project until reddit users sought alternatives. We can't expect to show up here and demand it work how we want without a little patience and contributing.

[-] glorbo@lemmy.one 49 points 2 years ago

Yup was just typing a comment to basically this effect. Federation adds a ton of overhead -- you can still do things fairly efficiently, but every interaction having to fan out to (and fan in from!) many servers instead of like a single RDBMS is gonna cost you.

In all likelihood the code is not as efficient as it could be, but usually you get time to work those out gradually. A giant influx of users quickly turns "TODO: fix in the next six months" into "Oh god the servers are melting fuck fuck."

That said, assuming the devs can get over this hump, I suspect using a compiled language will pay off long-term. Sure things will still be primarily IO-bound, but making things less CPU-bound is usually a good thing.

For some illustrative examples: Mastodon is in Ruby and hits dumb scaling limitations far more often than other fedi microblogs. Pleroma/Akkoma are Elixir (and BEAM is super well optimized for fast message passing/scaling/IO), Calckey (primarily Typescript) is moving some code to Rust, GoToSocial (Golang) is able to run in a fraction of the resources of Mastodon. The admins of one of the bigger tech instances recently announced they're basically giving up on administrating Mastodon and are instead going to write a new server from scratch in a compiled language because it's easier for them than scaling a Rails monolith.

TL;DR everything is IO-bound til it's not.

[-] agressivelyPassive@feddit.de 17 points 2 years ago

I'm pretty sure the fediverse needs a new kind of node at some point. If we assume, that almost every larger instance is connected to almost every other larger instance directly, then there's a ton of duplicated and very small messages.

There needs to be some kind of hub in-between to aggregate and route this avalanche. Especially if, like you wrote, every upvote is a message, the overhead (I/O, unmarshalling, etc) is huge.

[-] chris@l.roofo.cc 5 points 2 years ago

You mean like centralizing the fediverse? Who hosts the hub? Who maintains it? In which country?

load more comments (6 replies)
load more comments (1 replies)
[-] bouncing@partizle.com 13 points 2 years ago* (last edited 2 years ago)

I can’t imagine what possessed them to use websockets other than “gee whiz websockets.”

Probably the same appeal as rust though: gee whiz.

load more comments (2 replies)
[-] Buttons@programming.dev 11 points 2 years ago

Well said. Thanks for sharing your experience and those insightful links.

[-] codepy@livy.one 5 points 2 years ago

What specs are you running your instance on?

[-] dudeami0@lemmy.dudeami.win 13 points 2 years ago

An old Chromebox G1 (i7-4600U and 8GB of ram) with a 128GB internal NGFF and external 1TB NVMe. It's by no means powerful or expensive hardware. I'm also only receiving federated posts for my subscribed communities, and sending out these comments I write so it's a lot lower workload than the larger instances.

load more comments (2 replies)
[-] KindaABigDyl@programming.dev 53 points 2 years ago

It could be the devs just like programming in Rust. It's a nice language lol

load more comments (1 replies)
[-] Espi@kbin.social 43 points 2 years ago

I would say that it's extremely unlikely.

Websites in general are never limited by raw code execution, they are mostly limited by IO. Be that disk IO as files are read and written, database IO as you need to execute complex queries to gather all the data to build the user timeline, and network IO to transfer data to and from the user. For decentralized social media like Kbin or Lemmy its even more IO limited as each instance needs to go back and forth to other instances to keep up-to-date data.

Websites usually benefit much more from caching and in-memory databases to keep frequently used data in fast storage.

This is why simple, high level, object oriented, garbage collected languages have become so common. All the CPU performance penalties they incur don't actually affect the website performance.

[-] tortoise@tortoisewrath.com 15 points 2 years ago* (last edited 2 years ago)

Not relevant to lemmy (yet), but this does break down a bit at very large scales. (Source: am infra eng at YouTube.)

System architecture (particularly storage) is certainly by far the largest contributor to web performance, but the language of choice and its execution environment can matter. It's not so important when it's the difference between using 51% and 50% of some server's CPU or serving requests in 101 vs 100 ms, but when it's the difference between running 5100 and 5000 servers or blocking threads for 101 vs 100 CPU-hours per second, you'll feel it.

Languages also build up cultures and ecosystems surrounding them that can lend themselves to certain architectural decisions that might not be beneficial. I think one of the major reasons they migrated the YouTube backend to C++ isn't really anything to do with the core languages themselves, but the fact that existing C++ libraries tend to be way more optimized than their Python equivalents, so we wouldn't have to invest as much in developing more efficient libraries.

load more comments (1 replies)
[-] th3raid0r@tucson.social 8 points 2 years ago* (last edited 2 years ago)

In lemmy's case, my perusal of the DB didn't really suggest that the queries would be that complex and I suspect that moving it to a higher performance NoSQL DB might be possible, but I'd have to take a look at a few more queries to be sure.

I wonder if this could be made to work with Aerospike Community Edition...

Obviously it could be more effort than it's worth though.

[-] terebat@programming.dev 19 points 2 years ago

The issues I've seen more are around images. Hosting the images on an object store (cloudflare r2, s3) and linking there would reduce a lot of federation bandwidth, as that's probably cause higher ram/swap usage too.

pict-rs supports storing in object stores, but when getting/serving images, it still serves through the instance as the bottleneck IIRC. That would do quite a bit to free up some resources and lower overall IO needed by the server.

[-] hungrybread@lemmygrad.ml 8 points 2 years ago

There's no need to migrate the database, that shouldn't be an issue at this size. Caching should be implemented as another comment suggested.

load more comments (5 replies)
load more comments (1 replies)
load more comments (3 replies)
[-] 24Vindustrialdildo@sh.itjust.works 40 points 2 years ago

I think the devs openly stated they aren't backend bods and asked for help optimising the database as a priority. There's a bit of work going on on github to sort that out I think. Anyone reading this who can optimise postgresql or contribute to a database agnostic retool should probably speak to the devs as I imagine you'd be welcome.

I wish I could help so much but I doubt they're going to retool into .net haha.

[-] Buttons@programming.dev 15 points 2 years ago* (last edited 2 years ago)

Which is fine. If they wanted to learn Rust and wrote inefficient code, good for them. I appreciate their efforts. Rust can certainly be beaten into shape and perform well enough in the end.

[-] 21trillionsats@infosec.pub 8 points 2 years ago

Rust itself or the way the Rust logic is implemented is not the bottleneck. Like most decent web applications the bottleneck is the database and how the decentralized protocols themselves are reconciled there.

Scaling massive amounts of records like Lemmy has been forced to is almost always IO bound at the database level even when a web service is centralized; this is much more difficult in federated architectures. This is why “NoSQL” databases have increased in popularity, but they are also not a magic bullet as there are major ACID trade offs one needs to consider.

[-] jeltz@programming.dev 7 points 2 years ago

NoSQL databases are no silver bullet and the costs of ACID are usually exaggerated (plus most NoSQL databases actually implement ACID anyway). NoSQL databases and SQL databases often have similar performance characteristics since most of the technology is typically the same under the hood.

Plus from my experience as a database consultant: databases are rarely IO bound, NoSQL or SQL unless you have a strange workload. Most time for query execution is usually spent waiting on loads or executing CPU instructions, not waiting on disk IO.

One benefit of using rust for webservers in general is that it's possible to have a consistently lower latency compared to GCd langues: discord mentions this in their article about migrating to rust from go: https://discord.com/blog/why-discord-is-switching-from-go-to-rust

Another difference between rust and e.g. python is that rust expects you to invest more time to get code that's runnable in the first place, but likely more well optimized and correct.

In my experience from writing rust, the language pushes you to write more efficient code compared to python because it makes things like copying visible and also because it's easier to reason about memory usage compared to garbage collection which means that you're more likely to have a useful model of the performance cost of various things while you're programming.

It's possible that a hypothetical lemmy written in python would have allowed the devs to do some big picture optimizations that they haven't had the time to do yet in the rust version, so for the time being it might be slower than a python alternative.

Rust is likely to catch up though: eventually the rust version will probably also have this optimization while the python version has to resort to make smaller optimizations that the rust version already had in the first version of the code or that you get for free from the language.

[-] balder1991@lemmy.world 6 points 2 years ago* (last edited 2 years ago)

I mean, comparing it to Python is kinda unfair as Rust is closer to C and Python is one of the slowest languages out there. There is a whole spectrum of languages between Rust and Python. The final reason was probably that the devs were comfortable enough with Rust.

load more comments (2 replies)
[-] icesentry@lemmyrs.org 17 points 2 years ago

What makes you think that inefficient rust is slower than efficient Python? I mean, sure it could be possible if you are actively trying to make rust slow, but rust is multiple order of magnitude faster than python. If rust was to blame here then I don't think any language could be fast enough.

[-] Buttons@programming.dev 11 points 2 years ago* (last edited 2 years ago)

Rust is not to blame, but that code that has been written in Rust might be to blame.

The algorithms used have more effect than the language used, and Rust might make using certain algorithms more painful and thus steer programmers towards less efficient algorithms. Using clone is often an example of this, it's a little easier and gets around some borrow checker difficulties. (This is true in general, but I don't know if this is what has happened with Lemmy.)

Look at salvo [diesel] coming it at #200+ on this benchmark1, lots of programming languages have at least one framework that is faster on the microbenchmark. This isn't especially meaningful, but it does show that, let's say, a feature rich framework in Rust might end up being slower than a Python framework that's laser focused on the specific use case.

load more comments (2 replies)
load more comments (1 replies)
[-] sznio@lemmy.world 16 points 2 years ago

Pulling this out of my ass, but I think the problem might be in Lemmy using websockets.

I feel like supporting 1500 simultaneous users making a request every 10-20 seconds is easier than keeping 1500 websockets alive.

Irregardless, Lemmy does feel very snappy compared to other websites I've had the displeasure of using. Main problem is low robustness in the RPC layer.

[-] binwiederhier@discuss.ntfy.sh 15 points 2 years ago

I maintain and host ntfy.sh, an open source push notification service. I have a constant 9-12k WebSocket and HTTP stream connections going, and I host it on a two core machine with an average load average of less than 1. So I can happily tell you that it's not WebSockets. Hehe.

My money would be on the federation. Having to blast/copy every single comment to every single connected instance seems like a lot.

load more comments (3 replies)
load more comments (2 replies)
[-] yourstruly@dataterm.digital 13 points 2 years ago* (last edited 2 years ago)

Hear me out, but I believe that using Rust holds Lemmy back.

Writing Rust code is difficult, and fairly time consuming. It's difficult to get right, and as other commenters have noted, Lemmy code seems to do a lot of things for the "hype factor" (like Websockets). It's difficult to find enough devs as well.

The article about Discord switching to Rust from Go in the top comment is misleading in my opinion. They totally rearchitected their service while rewriting it, so it's an apples to oranges comparision.

[-] argv_minus_one@beehaw.org 15 points 2 years ago* (last edited 2 years ago)

Rust isn't difficult to get right. JavaScript is difficult to get right. That's because Rust tells you when you get it wrong, and JavaScript doesn't.

[-] frostphunk@lemmy.world 13 points 2 years ago* (last edited 2 years ago)

Learning rust is hard, yeah. But I’ve found that once you get the hang of it, actually using rust and writing code day to day is as fast or faster than using other languages (Jacascript, Pythong, etc). Rust tells you exactly what you did wrong and why it’s dangerous, this is incredibly useful in avoiding bugs and speeds up productivity. Therefore, I have no doubt the main creators of Lemmy have issues with writing rust code.

I agree that finding developers can be hard, though. Especially since rust is still a relatively new language. I also agree that new programmers who recently came to lemmy and want to help it succeed by contributing code will take a good bit of time to get to that point of ‘being comfortable enough with rust that it increases productivity’. However, I also think we have to consider the inverse, that the influx of new users will also see experienced rust programmers who want to help contribute to Lemmy and be successful at it (and in fact I think we’ve already seen this if I take Lemmy PR activity as any indication). Indeed even the rust subreddit stickied protest post hinted that rust developers are among the few who can help enact new alternative platforms to Reddit, and there are definitely OG and amazing programmers in that community.

That’s also not mentioning that rust has continued to gain traction and is continually one of the most favorite languages for devs (according to stackoverflow)

TL;DR: I’m a rust optimist and have faith in the rust community to help out Lemmy where it matters.

Source: been rust dev for 4+ years, do it for day job. Also considering helping out Lemmy when my life becomes a little bit less crazy

Lemmyrs/rustlang for rust instance btw

[-] terebat@programming.dev 12 points 2 years ago

On the other hand, Rust is fairly resilient. The issues Lemmy is experiencing wouldn't be fixed in Python vs Java, it's more of an architectural constraint. Those issues, experienced devs can fix mostly regardless of language.

[-] SomeGuyNamedMy@vlemmy.net 10 points 2 years ago

If your college educated in cs, and your main issue with a codebase is the language its writen in, i have some serious questions as to how the hell you graduated

load more comments (3 replies)
[-] TheGreenGolem@lemmy.dbzer0.com 5 points 2 years ago

Ha! They are ditching Websockets in the next release. Good riddance.

load more comments (5 replies)
[-] dragontamer@lemmy.world 13 points 2 years ago

Its not code that seems to have difficulty scaling, it seems to be user curation and user trust that is being difficult to scale.

[-] Buttons@programming.dev 12 points 2 years ago

The lemmy.ml post that asks people to go elsewhere is still stickied and the first thing it mentions is that they have upgraded the server. So that does suggest that compute scaling is an issue.

load more comments (1 replies)
[-] ruckblack@sh.itjust.works 6 points 2 years ago

I mean, with all the server issues we've seen with the influx of users, it does seem like the code is difficult to scale

[-] spaduf 12 points 2 years ago

At this point I think the bottleneck is largely that every server has needed to upgrade for this wave of the reddit migration.

load more comments (1 replies)
[-] JackbyDev@programming.dev 12 points 2 years ago

When it comes to architecture I'm not a genius, but I think a lot of how federation works is tough.

Plus, even if you have a perfectly efficient backend it can only handle so much based on the hardware you're running on. "Struggling to scale" can just mean "struggling to afford better hardware."

[-] TeeTwoLee@lemmy.world 10 points 2 years ago

I think threads like this one: https://news.ycombinator.com/item?id=28440742 help explain the complexity of performance. There's considerations from the development process, existing technology, simplicity, and more.

[-] DiagnosedADHD@kbin.social 10 points 2 years ago* (last edited 2 years ago)

I'm not convinced using rust is all that useful for web development. There are already plenty of other mature well optimized solutions for backend web development that includes a lot of security, qol features ootb like mature orm's with optimized database access which regardless of how fast your code is can bottleneck your site much faster if you don't have smart database access.

Yeah, theoretically rust can be faster than ruby/python/node, but it's harder to optimize it enough to get to that point and you will have a much harder time finding devs to work on such a project because the amount of backend devs that have enough rust experience is so small, they're like on the opposite sides of the ven diagram of languages for web developers.

On top of all that, languages which are heavily used for web development often get low level optimizations baked into the frameworks/languages so you can get pretty amazing performance uplifts over time to bring it to the level of lower level languages.

[-] AnonymousLlama@kbin.social 7 points 2 years ago

Whatever the performance bottlenecks are, I hope they can get them sorted so that any exponential growth caused by Reddit at the end of the month can be capitalized on.

I've got a feeling that other redditors who want to flee / branch out will look at these sites and so long as they see posts, comments and that it has a decent UI and loads well then people will stick around and build communities here.

[-] knoland@kbin.social 7 points 2 years ago

The real benefit as I see it for using rust for backends is memory safety.

[-] loren@sh.itjust.works 6 points 2 years ago

All the major languages for web backends are memory safe. Java, C#, etc

[-] C8H10N4O2@kbin.social 7 points 2 years ago

These are garbage collected languages and come with the overhead of such a process. Rust has no GC process and instead relies on reference counters to statically track live memory.

[-] eddythompson@kbin.social 16 points 2 years ago

"GC overhead" only matter for extreme realtime applications, like emulators, games, drivers, simulators, etc. a 10msec (or even a 100msec) pause in a request processing isn't gonna even be noticed when your network, database and disk IO are literally orders of magnitude higher. Use Rust for web services if you like the language, comfortable with it, etc. Don't use it because you think it'll give you "more performance" or "reduce GC overhead".

Java, C#, Python, Node, or even PHP as languages will never be your web backend bottleneck. Large scale web services performance tuning is entirely architectural. What caches you keep, how you organize your data, how many network operation does 1 user interaction translate to, stateful vs stateless components etc.

load more comments (9 replies)
load more comments (2 replies)
[-] TauZero@mander.xyz 7 points 2 years ago

I agree, hearing about scaling issues so early into adoption is concerning. Lemmy advocates say "horizontal scaling is already built in! just add more instances!", but that doesn't explain the problem.

It's all just text! By my guess too, handling text alone a server should easily support a thousand concurrent users, and hundreds of thousands of daily users. A RasPI should handle thousands. I've heard the bottleneck is the database? In that case Rust is not to blame, Postgres is.

But my fear is that the data structures are implemented in a trivial way. If you have a good reddit-sized thread with a thousand comments, but you store each comment as a separate database entry, then every pageview will trigger a thousand database lookups! The way I imagined making a reddit clone is that I would store the comments as a flat list with some tree data on top, such that serving a single page with 1000 comments is no different that streaming a 100K text file. I'll go take a look how Lemmy does it currently once I get the courage!

[-] heartlessevil@lemmy.one 6 points 2 years ago

If you have a good reddit-sized thread with a thousand comments, but you store each comment as a separate database entry, then every pageview will trigger a thousand database lookups!

No it wouldn't, that's called the N+1 query problem and it can be avoided by writing more efficient queries

load more comments (10 replies)
[-] Kaldo@kbin.social 5 points 2 years ago

But my fear is that the data structures are implemented in a trivial way. If you have a good reddit-sized thread with a thousand comments, but you store each comment as a separate database entry, then every pageview will trigger a thousand database lookups!

I mean, I hope that surely can't be the case. There is no reason why they wouldn't get all of those comments in a single request even if it were separate rows in the db. I initially thought that they'd have to query each individual instance participating in a thread but even that isn't the case because the fediverse protocol makes each instance mirror the content from others, meaning that your local instance should already have the necessary data in one place, making it easy to load it to the users.

load more comments
view more: next ›
this post was submitted on 15 Jun 2023
165 points (100.0% liked)

Programming

17670 readers
125 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Rules

  • Follow the programming.dev instance rules
  • Keep content related to programming in some way
  • If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev



founded 2 years ago
MODERATORS