[-] vivendi@programming.dev 3 points 4 months ago* (last edited 4 months ago)

That's for quanting a model yourself. You can instead (read that as "should") download an already quantized model. You can find quantized models from the HuggingFace page of your model of choice. (Pro tip: quants by Bartowski, Unsloth and Mradermacher are high quality)

And then you just run it.

You can also use Kobold.cpp or OpenWebUI as friendly front ends for llama.cpp

Also, to answer your question, yes.

[-] vivendi@programming.dev 3 points 5 months ago

YES, BASED

IN THIS HOUSE WE COMMIT TO MASTER

[-] vivendi@programming.dev 3 points 6 months ago

You need multi-shot prompting when it comes to math. Either the motherfucker gets it right, or you will not be able to course correct it in a lot of cases. When a token is in the context, it's in the context and you're fucked.

Alternatively you could edit the context, correct the parameters and then run it again.

On the other side of the shit aisle

Shoutout to my man Mistral Small 24B who is so insecure, it will talk itself out of correct answers. It's so much like me in not having any self worth or confidence.

[-] vivendi@programming.dev 3 points 6 months ago

There is a HUGE compilation on the subreddit for cyberpunk2077, but basically we got promised a vast, in depth RPG and instead got something mechanically on part with GTA Vice City and Call of Duty

[-] vivendi@programming.dev 3 points 6 months ago

Had a friend who was a freak ass like that actually

People's dicks really are putting in overtime

[-] vivendi@programming.dev 3 points 6 months ago

Exactly, a lot of the "AI Panic" is from people using ClosedAI's dogshit system, non-finetuned model and Instruct format.

[-] vivendi@programming.dev 3 points 7 months ago* (last edited 7 months ago)

I will cite the scientific article later when I find it, but essentially you're wrong.

[-] vivendi@programming.dev 3 points 7 months ago* (last edited 7 months ago)

Yeah this is on me, I've never done web dev in my life. Always been low level shit for me. (The project is a highly networked system with isolates, async/await, futures, etc)

But damn man comparatively JS/TS seems a lot easier ¯\_ (ツ) _/¯

[-] vivendi@programming.dev 3 points 7 months ago

Honestly? They're based for being so easy to make

For the record, I am a C/Dart/Rust native dev 2+ years deep in a pretty big project full of highly async code. This shit would've been done a year ago if the stack was web based instead of 100% native code

[-] vivendi@programming.dev 4 points 7 months ago

AI would do a better job

[-] vivendi@programming.dev 3 points 7 months ago

Another great thing about BTRFS is that it can detect hardware problems sooner: if your BTRFS drive keeps losing data to corruption; that's because it has detected a corruption that other FS's would silently work with

view more: ‹ prev next ›

vivendi

joined 8 months ago