AIModels.fyi
  • Home
  • Creators
  • Models
  • Notes
  • Advertise
  • 🎉 Support my work
Sign in Subscribe

Latest

WebVoyager framework

Teaching AI to see websites like a human made it more capable

Tencent's AI can now complete the majority of its tasks on Google, Amazon, and Wikipedia
aimodels-fyi Jan 28, 2024
"Flow engineering" doubles code generation accuracy (19% vs 44%)

"Flow engineering" doubles code generation accuracy (19% vs 44%)

The authors of a new paper present an approach that "intensifies" code generation.
aimodels-fyi Jan 20, 2024
AMIE is much more accurate than a real doctor

Google's new LLM doctor is right way more often than a real doctor

The LLM's differential diagnosis list had the correct diagnosis 59% of the time, vs. 34% for human doctors.
aimodels-fyi Jan 13, 2024
All LLM improvements are just task contamination?

All LLM improvements are just task contamination?

Current benchmarks are probably overestimating the true capabilities of LLMs
aimodels-fyi Jan 4, 2024
Prompting with unified diffs makes GPT-4 write much better code

Prompting with unified diffs makes GPT-4 write much better code

A developer for an open-source paired-programming tool discovered the trick
aimodels-fyi Dec 30, 2023
Example Paint3D generation

Finally, a model to paint 3D meshes with high-res UV texture maps

Finally, we have an AI tool that can paint high-res UV maps on 3D meshes using text or image prompts.
aimodels-fyi Dec 22, 2023
Google DeepMind says AI has discovered new solutions to 2 famous math problems

Google DeepMind says AI has discovered new solutions to 2 famous math problems

This announcement is hot on the heels of another DeepMind paper that used AI to discover hundreds of new materials.
aimodels-fyi Dec 16, 2023

Subscribe to AIModels.fyi

Don't miss out on the latest news. Sign up now to get access to the library of members-only articles.
  • Sign up
AIModels.fyi © 2025. Powered by Ghost