Turning images into 3D models in minutes, not hours A new method that extracts accurate and editable meshes from 3D Gaussian Splatting representations within minutes on a single GPU
The Chosen One: Generating Consistent Characters from Text Descriptions Using AI Among a large set of images from the same text prompt, some will naturally share common visual features.
Researchers tried to build an autonomous scientist with AI. How'd it go? Researchers decided to see if GPT-4 could generate and test hypotheses without human guidance. What happened?
Meta unveils Emu Video: Text-to-Video Generation through Image Conditioning The new approach uses "explicit image conditioning" for higher quality videos
Meta unveils Emu Edit: Precise image editing via text instructions Existing systems struggle to interpret edit instructions correctly. Emu Edit tackles this through multi-task training.
You can predict disease progression by modeling health data in latent space Forecasting personalized disease progression by modeling clinical data in a latent space
Researchers taught GPT-4V to use an iPhone and buy things on the Amazon app It's still early, but a GPT-4V agent can navigate smartphone GUIs using a combination of image processing and text-based reasoning.