ByteDance announces OmniHuman-1: AI tool that creates lifelike videos

OmniHuman-1 was trained on over 18,700 hours of human videos and uses multiple input types

Tech Desk - Feb 08, 2025

ByteDance, the parent company of TikTok, has introduced OmniHuman-1, a powerful new AI tool that can generate realistic videos of people from just a single photo.

This AI can make a still image appear to talk, play instruments, and even move naturally, making it more advanced than existing tools.

How OmniHuman-1 works?

According to ByteDance, OmniHuman-1 significantly improves upon current AI video tools by creating lifelike human videos using minimal input, such as a simple photo and an audio clip.

It can animate portraits, half-body, and full-body images, ensuring natural facial expressions, hand gestures, and body movements.

A research paper published on arXiv highlights that this AI tool outperforms competitors, delivering high-quality results in various scenarios. Unlike other AI models that mainly modify facial expressions, OmniHuman-1 can generate full-body animations and even animate animals.

What makes OmniHuman-1 unique?

Researchers shared sample videos on Beehiiv, showcasing OmniHuman-1's ability to create realistic movements from multiple angles. One example features a black-and-white video of Albert Einstein, where the AI makes him speak and gesture naturally in front of a blackboard.

Moreover, ByteDance says OmniHuman-1 was trained on over 18,700 hours of human videos and uses multiple input types, including text, audio, and physical movements. This deep learning approach allows it to generate hyper-realistic videos.