66
submitted 9 months ago by ylai@lemmy.ml to c/fosai@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] bassomitron@lemmy.world 2 points 9 months ago

Do you know if there are any plans to quantize it? I'd love to test it, but my 3090 can't handle 70b models without quantization, unfortunately.

[-] midnight@kbin.social 3 points 9 months ago* (last edited 9 months ago)

There are quantized versions on hugging face. There's a q2 version, but idk how well that performs

[-] fhein@lemmy.world 2 points 9 months ago

Only quantized versions of the model were leaked. If you see any unquantized version of it then it's something which was recreated from these, and not the original model. People have also requanted it from GGUF to EXL2 and probably other formats too.

this post was submitted on 01 Feb 2024
66 points (100.0% liked)

Free Open-Source Artificial Intelligence

2886 readers
1 users here now

Welcome to Free Open-Source Artificial Intelligence!

We are a community dedicated to forwarding the availability and access to:

Free Open Source Artificial Intelligence (F.O.S.A.I.)

More AI Communities

LLM Leaderboards

Developer Resources

GitHub Projects

FOSAI Time Capsule

founded 1 year ago
MODERATORS