107
SearNGX should be a federated search engine
(github.com)
A community to talk about the Fediverse and all it's related services using ActivityPub (Mastodon, Lemmy, KBin, etc).
If you wanted to get help with moderating your own community then head over to !moderators@lemmy.world!
Learn more at these websites: Join The Fediverse Wiki, Fediverse.info, Wikipedia Page, The Federation Info (Stats), FediDB (Stats), Sub Rehab (Reddit Migration)
I really want to use this, but from what I read it basically requires a minimum of 20-30GB of RAM to be performant. Also the documentation appears to be a mess and highly outdated. I'd also want to cluster it internally and connect with outside peers still which seems possible, but with the large resource requirement not as feasible with my setup.
I've run it in containers, never used that many resources. The whole server (running a few dozen containers) was 32gb, and it wasn't impacted in any sensible way.
That is misinformation. It doesn't need anywhere close to that amount of RAM. It's pretty much like other webapps and I used to run it on an old computer. It'll fill up your harddisk, though. If you allow it to do that.
There also seems to be a lot of settings so perhaps they had it misconfigured. It also is Java so I wouldn't put it past it for such a monolith of a Java program to require so much to be performant. Perhaps I'll try a cluster of them then and see how it fares.
Well initial setup was definitely interesting. I didn't want to expose 8090 and wanted it behind a web proxy and I finally got that working and actually received my first remote crawl overnight. I had to change to 80/443 internally so it would map correctly for p2p connections, public port setting doesn't apparently cut it. I kinda dislike the whole setup with it micromanaging CPU load, but otherwise it doesn't seem atrocious for a new peer at least, I guess this and the web proxy problems are likely awkward due to the age of the software.