D-ID’s API now supports synchronistic generation of videos from audio files. With a rendering time of 100 FPS, it’s 4X faster than real-time! Handling tens of thousands of requests in parallel, over 150 million videos have been generated to date.
Step 1: Add a face
A single image is all it takes to create a talking head video. Use any image of a face and make it talk with a simple API request. Use them to make business content more cost-effective, engaging and human.
Give your AI Presenter a voice by choosing from hundreds of available text-to-speech options or uploading an audio recording of your own. D-ID’s software lets you personalize video, at scale, in over 100 languages, and with zero technical knowledge.
Real-time video streaming opens up a new world of possibilities
D-ID’s API enables synchronistic generation of video of digital people from an image and an audio file. Integrate it with your AI chatbot to create face-to-face CX conversations, use it to create real-time video call avatars or add it to your character-based online game. The possibilities are endless.