Tech publication Fast Company sat down with our CEO Gil Perry and heard from him about the new features in the Creative Reality studio and the wide range of use cases this technology will enable. Perry also offers a glimpse of what’s to come down the pipeline. Spoilers… Talking avatars enter the streaming space.
Like Lensa, D-ID’s new web app uses Stable Diffusion to generate synthetic humans—only instead of resulting in static avatars, D-ID creates animated upper torsos and heads generated via prompts like, “blonde woman with elvish ears and green skin.”
After you pick your avatar from whatever Stable Diffusion spits out, D-ID’s interface has a field in which you can write a script or ask OpenAI’s GPT3 to write one for you. You can enter “five reasons why you should never put chorizo in paella,” for instance, and the AI will generate a script right then and there. D-ID’s app lets you pick the language, the voice, and intonation your avatar should use. After that, it’s just a matter of clicking create. This is where the system uses text-to-speech AI by Amazon and D-ID’s proprietary animation algorithm to produce the final output: a Harry Potter-esque animated portrait that can say whatever you want.
To read the full article, click the Read More button