140
Chinese AI startup DeepSeek overtakes ChatGPT on Apple App Store
(www.reuters.com)
This is a most excellent place for technology news and articles.
My point was a mixture of Experts model could suffer from generalization. Although in reading more I'm not sure if it's the newer R model that had the MoE element.