494
Apple study exposes deep cracks in LLMs’ “reasoning” capabilities
(arstechnica.com)
Real headline: Apple research presents possible improvements in benchmarking LLMs.
The part of the study where they talk about how they determined the flawed mathematical formula it used to calculate the glue-on-pizza response was mindblowing.
^(I^ ^did^ ^not^ ^read^ ^the^ ^study.)^
This is a most excellent place for technology news and articles.