Follow my blog to learn about artificial intelligence and stay up-to-date with the most interesting news in the field!
You can find all the previous articles I ever wrote on my Medium account: https://whats-ai.medium.com/membership
The hottest image editing AI, InstructPix2Pix, explained!
The VALL-E Model explained.
An interview with the Director of Perception at Zoox, Ruijie (RJ) He, with the goal of demystifying what is a good profile to get an ML engineer job and perform at the interviews.
Disney's New Model Explained
What is a prompt engineer and how to improve at it…
OpenAI's most recent conversational AI explained
Efficient NeRFs for Real-Time Portrait Synthesis (RAD-NeRF)
Text Embedding Explained
Galactica, Meta AI's most recent model: The AI Scientist
Generate infinite new frames as if you would be flying into your image!
NVIDIA's new model has better results, more control and more fidelity than DALLE and Stable Diffusion!
Here's every vision application Diffusion models were a game changer in 2022: image, text, video, 3D, and more!
Imagic: Manipulate images using pre-trained image generator models!
How AI generates 3d models from only text!
A good transcription tool that would accurately understand what you say and write it down
Generates videos from text!
What does such a model understand when it sees such a picture or, even more complex, a video?
Personalizing Text-to-Image Generation using Textual Inversion
A New Challenging Task for AI: panoptic scene graph generation
A High-Resolution Image Synthesis Architecture: Latent Diffusion
Create deformable 3D models from pictures with BANMo!
"Make-A-Scene": a fantastic blend between text and sketch-conditioned image generation.
DALL·E 2 Pre-Training Mitigations
They reconstruct sound using cameras and a laser beam on any vibrating surface, allowing them to isolate music instruments, focus on a specific speaker, remove ambient noises, and many more amazing applications.