Speaking Portrait

Enabling photorealistic avatars,
using just text or audio as input

Speaking Portrait allows users to create a realistic video of a human presenter, without any video production. Simply input an image and either text or an audio file, and a video is automagically created by our AI-based reenactment technology.

The technology enables companies to easily transform articles, websites, and corporate marketing materials into videos, at scale, without the need for costly productions and studios, and without actually filming an actor.
Speaking Portrait is just one element of D-ID’s AI Face Platform, which also includes Live Portrait and Face Lit. Our reenactment-based products offer groundbreaking capabilities, enabling the creation of highly personalized media using AI, specifically in e-learning, corporate training, marcoms, AI assistants, history and the Metaverse.

How It Works

1 Upload Image
2Upload Audio or Text
3Create Magic



create training videos

Make text more engaging by turning it into video. Enhance learning by experiencing history in a more emotional, human way.


educational video maker

Convert text or audio into a video presenter for faster and cheaper production of marketing, training, and customer service

Video production

Enable AI-based media creation for faster and cheaper video production