11
submitted 5 days ago by anzo@programming.dev to c/selfhost@lemmy.ml

cross-posted from: https://programming.dev/post/51407459

Check what can you use and at what rate of token per seconds would it be... It has examples of many models and quantization levels. Huge resource!

you are viewing a single comment's thread
view the rest of the comments
[-] shellington@piefed.zip 1 points 5 days ago

Thank you it suggested a model for my GPU that is far better than what i was using and still has a decent output speed.

this post was submitted on 03 Jun 2026
11 points (100.0% liked)

Self Hosted - Self-hosting your services.

19876 readers
1 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

Rules

Important

Cross-posting

If you see a rule-breaker please DM the mods!

founded 5 years ago
MODERATORS