Home
Creators
Models
Notes
Advertise
🎉 Support my work
Sign in
Subscribe
Mike Young
Bye tokens, hello patches
Meta announces a better way to scale LLMs
Input prompt, output a playable world
Google built a world model that actually works
ChatGPT naturally colludes to raise prices
Making AI See Better
Chain-of-thought is so hot right now. You shouldn't always use it.
Get ready to lose to Transformers on Lichess
They can hit 2895 Elo … without memorizing patterns
Long Context Compression with Activation Beacon