11
submitted 5 days ago by anzo@programming.dev to c/selfhost@lemmy.ml

cross-posted from: https://programming.dev/post/51407459

Check what can you use and at what rate of token per seconds would it be... It has examples of many models and quantization levels. Huge resource!

top 6 comments
sorted by: hot top controversial new old
[-] CallMeAl@piefed.zip 4 points 5 days ago

I don't know why you are spamming this across the Threadiverse but this is a data harvesting site with no respect for privacy loudly announcing how important privacy is. No thanks.

[-] anzo@programming.dev 2 points 4 days ago

I am regretting having done so much crossposts. It was an impulse, not being sure where to post... I was also interested on seeing what was the take from fellow lemmings. And it's been enriching to me in that regard. I wonder how much were people really self hosting LLMs... Apparently not that much.

[-] YodaDaCoda@aussie.zone 1 points 5 days ago

How do you figure it's a data harvesting site? It's actually suggested a model that works pretty well for me.

[-] CallMeAl@piefed.zip 2 points 5 days ago

Because I looked at the source code on the site. Being a data harvester doesn't mean they don't give real answers. Just that there is no such thing as a free lunch. You got a suggestion and they got your data.

[-] rob951951@piefed.social 1 points 4 days ago

I really like the layout and suggestions on this site great work!!!

[-] shellington@piefed.zip 1 points 5 days ago

Thank you it suggested a model for my GPU that is far better than what i was using and still has a decent output speed.

this post was submitted on 03 Jun 2026
11 points (100.0% liked)

Self Hosted - Self-hosting your services.

19876 readers
1 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules

Important

Cross-posting

If you see a rule-breaker please DM the mods!

founded 5 years ago
MODERATORS