[-] morrowind@lemm.ee 3 points 1 day ago

I want to clarify something. Reranker is a general term that can refer to any model used for reranking. It is independent of implementation.

What you refer to

because reranker models look at the two pieces of content simultaneously and can be fine tuned to the domain in question. They shouldn't be used for the initial retrieval because the evaluation time is O(n²) as each combination of input

Is a specific implementation known as CrossEncoder that is common for reranking models but not retrieval ones for the reasons you described. But you can also use any other architecture

19
11
[-] morrowind@lemm.ee 1 points 4 days ago

autotracers can't generate svgs from text

[-] morrowind@lemm.ee 3 points 6 days ago

Claude frequently draws svgs to illustrate things for me (I'm guessing it's in the prompt) but even though it's better at it than all the other models, it still kinda sucks. It's just fudamentally dumb task to do for a purely language model, similar to the arc-agi benchmark , just makes more sense for a vision model and trying to get an llm to do is a waste

19
[-] morrowind@lemm.ee 1 points 1 week ago

what is the license? The link on hf just 404s

4
[-] morrowind@lemm.ee 2 points 2 weeks ago

Very similar to chain of draft but seems more thorough

12
5
18
[-] morrowind@lemm.ee 3 points 3 weeks ago

It matches R1 in the given benchmarks. R1 has 671B params (36 activated) while this only has 32

[-] morrowind@lemm.ee 2 points 3 weeks ago

insane, absolutely insane

8
13

morrowind

joined 1 month ago