submitted 6 months ago by Lugh@futurology.today to c/futurology@futurology.today

25 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] pennomi@lemmy.world 3 points 6 months ago

It depends. A lot of LLMs are memory-constrained. If you’re constantly thrashing the GPU memory it can be both slower and less efficient.

this post was submitted on 01 Dec 2024

49 points (100.0% liked)

Futurology

2722 readers

79 users here now

founded 2 years ago

MODERATORS