Sign in Subscribe

Mike Young

Results from the study.

Can Large Language Models Self-Correct Their Own Reasoning? Probably Not.

A new paper takes a critical look at the promise and limits of self-correction

PIT diagram

Enabling Language Models to Implicitly Learn Self-Improvement

Rather than manually distilling criteria into prompts, implicit information in preference data can be leveraged.

Diagram from the paper

LLMs can be extended to infinite sequence lengths without fine-tuning

LLMs trained with a finite attention window can be extended to infinite sequence lengths without any fine-tuning.

Example architecture from the paper

Tool-Integrated Reasoning: A New Approach for Math-Savvy LLMs

TORA combines both rationale-based and program-based reasoning to deliver results to math problems that were previously too difficult for LLMs to solve

Showing the result of adding explicit registers

Researchers discover explicit registers eliminate vision transformer attention spikes

When visualizing the inner workings of vision transformers (ViTs), researchers noticed weird spikes of attention on random background patches. Here's how they fixed them.

Examples

Marrying Pixel and Latent Diffusion Models for Efficient and High-Quality Text-to-Video Generation

A new paper proposes Show-1, a hybrid model that combines pixel and latent diffusion for efficient high-quality text-to-video generation.

VideoDirectorGPT diagram showing how the tool works

UNC Researchers Present VideoDirectorGPT: Using AI to Generate Multi-Scene Videos from Text

The key innovation proposed is decomposing multi-scene video generation into two steps: a director step and a "film" step.