AIModels.fyi

Sign in Subscribe

Latest

Chunk compression diagram from the paper

Long Context Compression with Activation Beacon

Differential Transformers

Differential Transformers

Hallucination reduction strategies.

LLMs will lie forever

Screenshot of o1 models

What's (actually) up with o1

Creed making an acronym but not about AI

AI can (kinda) generate novel ideas

AI agents can collude using hidden messages!

AI agents can collude using hidden messages!

🥇Top ML papers of the week

🥇Top ML papers of the week

Guides

How to improve your semantic search with hypothetical document embeddings

How to improve your semantic search with hypothetical document embeddings

How to use a simple LLM call to dramatically improve the quality of your semantic search results

From chaos to clarity with AI-driven categorization

From chaos to clarity with AI-driven categorization

Swap Faces Seamlessly with the Faceswap Model

Swap Faces Seamlessly with the Faceswap Model

Build Your Own Epik-inspired App: Transform Selfies into '90s Yearbook Photos with Node.js and AI

Build Your Own Epik-inspired App: Transform Selfies into '90s Yearbook Photos with Node.js and AI

Example input and output image

One-Shot Face Stylization with JoJoGAN

DiffAE: How to use AI to make your friends look bald, happy, young, blond, old - you name it!

DiffAE: How to use AI to make your friends look bald, happy, young, blond, old - you name it!

How to turn text into music with Facebook's MusicGen

How to turn text into music with Facebook's MusicGen

Plain English Papers

Chunk compression diagram from the paper

Long Context Compression with Activation Beacon

Differential Transformers

Differential Transformers

Hallucination reduction strategies.

LLMs will lie forever

Creed making an acronym but not about AI

AI can (kinda) generate novel ideas

AI agents can collude using hidden messages!

AI agents can collude using hidden messages!

🥇Top ML papers of the week

🥇Top ML papers of the week

Training on code improves LLM performance on non-coding tasks

Training on code improves LLM performance on non-coding tasks

Build In Public

Different now

Different now

The week the internet actually changed forever

AI and Video Game Development (with the DataScienceAtHome podcast!)

AI and Video Game Development (with the DataScienceAtHome podcast!)

How to improve your semantic search with hypothetical document embeddings

How to improve your semantic search with hypothetical document embeddings

You're invited to submit your AI tool to AIModels.fyi

You're invited to submit your AI tool to AIModels.fyi

Architecture

GPT-4 Doesn’t Know It’s Wrong: An Analysis of Iterative Prompting for Reasoning Problems

Cybersecurity image

How Sibylline built a cybersecurity offering while handling foundational LLM quirks

Before and after view of the landing and results pages for AIModels.fyi.

AIModels Release Notes: Search, Share, and Deep Dives - All the New Features You Asked For

News

More

AI and Video Game Development (with the DataScienceAtHome podcast!)

ChatGPT can now remember custom instructions across prompts. Here's how to use it.

New study validates user rumors of degraded GPT-4 performance

AIModels.fyi update - New additions to the site

AIModels.fyi - June 2023 Monthly AI Model Roundup

LangChain

More

A Plain English Guide to Reverse-Engineering Reddit's Source Code with LangChain, Activeloop, and GPT-4

Getting Started with the Vercel AI SDK: Building Powerful AI Apps

A Plain English Guide to Reverse-Engineering the Twitter Algorithm with LangChain, Activeloop, and DeepInfra

A Beginner's Guide to Unstructured Data Analysis with LangChain and DeepInfra

Building a Customer Support Chatbot with LangChain and DeepInfra: A Step-by-Step Guide

OpenAI

More

What's (actually) up with o1

The GPT store is stupid and dead

How Sora (actually) works

ChatGPT can now remember custom instructions across prompts. Here's how to use it.

New study validates user rumors of degraded GPT-4 performance