Plain English Papers Chain-of-thought is so hot right now. You shouldn't always use it. When does CoT prompting make AI worse? Just because everyone else is using CoT prompting doesn't mean you should! You have to consider the USE CASE!
All LLMs use tokenization. Are we doing it totally wrong? Slashing model size by 85% while redefining how we build adaptable, efficient LLMs