Latest
Meta traded its biggest community asset for a commerce engine
Introduction to Stable Diffusion Img2Img: Shaping the Future of Image Generation
Netflix's VOID shows video editing has finally learned the laws of physics
Turning sound to sight with the Audio-To-Waveform AI Model
Simplifying transformer blocks
Transforming Spaces with Artificial Intelligence
Plain English Papers
Netflix's VOID shows video editing has finally learned the laws of physics
By treating object removal as a causal simulation rather than a pixel-patching job, VOID eliminates "ghost" physics from edited scenes
Simplifying transformer blocks
SmolDocling: An Ultra-Compact VLM for Document Understanding
What's the best AI model to handle $1 million in freelance software engineering?
Creating artificial doubt significantly improves AI math accuracy
Step-by-step reasoning can fix madman logic in vision AI
All LLMs use tokenization. Are we doing it totally wrong?
Guides
Turning sound to sight with the Audio-To-Waveform AI Model
Turn spoken audio or music into a waveform using a simple AI model
From AI Models to AI Products: Turning Intelligence into Impact
How to improve your semantic search with hypothetical document embeddings
From chaos to clarity with AI-driven categorization
Swap Faces Seamlessly with the Faceswap Model
Build Your Own Epik-inspired App: Transform Selfies into '90s Yearbook Photos with Node.js and AI
One-Shot Face Stylization with JoJoGAN
Build In Public
Transforming Spaces with Artificial Intelligence
How sites like Arlington Avenue use AI-Powered Interior Design
Different now
How to improve your semantic search with hypothetical document embeddings
You're invited to submit your AI tool to AIModels.fyi
GPT-4 Doesn’t Know It’s Wrong: An Analysis of Iterative Prompting for Reasoning Problems
How Sibylline built a cybersecurity offering while handling foundational LLM quirks