Sign in Subscribe

Latest

Turning images into 3D models in minutes, not hours

Turning images into 3D models in minutes, not hours

A new method that extracts accurate and editable meshes from 3D Gaussian Splatting representations within minutes on a single GPU

The Chosen One: Generating Consistent Characters from Text Descriptions Using AI

The Chosen One: Generating Consistent Characters from Text Descriptions Using AI

Among a large set of images from the same text prompt, some will naturally share common visual features.

Architecture for the research

Researchers tried to build an autonomous scientist with AI. How'd it go?

Researchers decided to see if GPT-4 could generate and test hypotheses without human guidance. What happened?

Meta unveils Emu Video: Text-to-Video Generation through Image Conditioning

Meta unveils Emu Video: Text-to-Video Generation through Image Conditioning

The new approach uses "explicit image conditioning" for higher quality videos

Emu Edit Example

Meta unveils Emu Edit: Precise image editing via text instructions

Existing systems struggle to interpret edit instructions correctly. Emu Edit tackles this through multi-task training.

Latent space symptom visuals

You can predict disease progression by modeling health data in latent space

Forecasting personalized disease progression by modeling clinical data in a latent space

Researchers taught GPT-4V to use an iPhone and buy things on the Amazon app

Researchers taught GPT-4V to use an iPhone and buy things on the Amazon app

It's still early, but a GPT-4V agent can navigate smartphone GUIs using a combination of image processing and text-based reasoning.