360

Edward Snowden slams Nvidia's RTX 50-series 'F-tier value,' whistleblows on lackluster VRAM capacity (www.tomshardware.com)

submitted 1 year ago by avieshek@lemmy.world to c/technology@lemmy.world

72 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Viri4thus@feddit.org 2 points 1 year ago

I also have a 3060, can you detail which framework (sglang, ollama, etc) you are using and how you got that speed? i'm having trouble reaching that level of performance. Thx

[-] TheHobbyist@lemmy.zip 4 points 1 year ago* (last edited 1 year ago)

Ollama, latest version. I have it setup with Open-WebUI (though that shouldn't matter). The 14B is around 9GB, which easily fits in the 12GB.

I'm repeating the 28 t/s from memory, but even if I'm wrong it's easily above 20.

Specifically, I'm running this model: https://ollama.com/library/deepseek-r1:14b-qwen-distill-q4_K_M

Edit: I confirmed I do get 27.9 t/s, using default ollama settings.

[-] jeena@piefed.jeena.net 2 points 1 year ago

Thanks for the additional information, that helped me to decide to get the 3060 12G instead of the 4060 8G. They have almost the same price but from what I gather when it comes to my use cases the 3060 12G seems to fit better even though it is a generation older. The memory bus is wider and it has more VRAM. Both video editing and the smaller LLMs should be working well enough.

[-] Viri4thus@feddit.org 2 points 1 year ago

Ty. I'll try ollama with the Q-4-M quantization. I wouldn't expect to see a difference between ollama and SGlang.

this post was submitted on 02 Feb 2025

360 points (100.0% liked)

Technology

86112 readers

3013 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 3 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws