Microsoft has recently introduced an exciting new feature called Azure AI Speech text, which allows users to create lifelike talking video avatars and interactive bots. This text-to-speech feature with vision capabilities enables users to input text and generate synthetic videos of photorealistic avatars speaking in real-time.
During the ‘Microsoft Ignite’ event, the company revealed that their Neural text-to-speech Avatar models are trained using deep neural networks based on human video recording samples. The voices of these avatars are provided by text-to-speech voice models. With this new feature, users can now create more engaging digital interactions by utilizing avatars to build conversational agents, virtual assistants, chatbots, and more.
The introduction of text-to-speech avatars by Microsoft aims to protect the rights of individuals and society, promote transparent human-computer interaction, and combat the proliferation of harmful deepfakes and misleading content. To ensure these goals are met, the custom avatar feature is currently available through a limited access registration process that is only open to specific use cases.
Microsoft offers two versions of their text-to-speech avatars: prebuilt avatars and custom avatars. The prebuilt avatars are available as out-of-the-box products on Azure, where subscribers can select avatars that can speak various languages and voices based on the text input. These avatars can be used to create video content and interactive applications with real-time avatar responses.
In order to utilize the custom text-to-speech avatar feature for business applications, users must apply for access through the registration process specified by Microsoft. By adhering to these measures, Microsoft seeks to ensure responsible and ethical use of their technology.
The introduction of Azure AI Speech text brings an exciting new dimension to digital interactions, allowing users to create lifelike talking videos and interactive bots. With its vision capabilities and ability to generate real-time synthetic videos, this feature opens up a world of possibilities for businesses and individuals alike. It enables them to engage with their audience in a more dynamic and interactive way, while also safeguarding against the risks associated with deepfakes and misleading content.
Microsoft’s commitment to creating a responsible and transparent framework for the use of text-to-speech avatars is evident through their limited access registration process. This ensures that the technology is used in a manner that aligns with ethical standards and protects the rights of individuals. By offering prebuilt avatars and custom avatars, Microsoft caters to the varied needs and preferences of their subscribers, further enhancing the versatility and applicability of this innovative feature.
Overall, the introduction of Azure AI Speech text marks another important milestone in the field of artificial intelligence. Microsoft continues to push the boundaries of what is possible, revolutionizing the way we interact with technology and paving the way for a more immersive and engaging digital future.