The New Voice Capabilities of Copilot Studio: A Comprehensive Overview
This article is the second in a mini-series. In the first part, I discussed integrating the voice channel for Dynamics 365 Contact Center and Dynamics 365 Customer Service Premium.
Introduction
In the rapidly evolving landscape of artificial intelligence, Copilot Studio has consistently been at the forefront, pushing the boundaries of what is possible. The latest addition to its impressive suite of features is the introduction of new voice capabilities. This innovative feature has the potential to revolutionize the way we interact with AI, making it more natural, intuitive, and user-friendly.
Advanced Speech Recognition and DTMF Support
The cornerstone of these new voice capabilities is an advanced speech recognition system. This cutting-edge technology accurately transcribes spoken words into text. It’s designed to work effectively even in noisy environments or when dealing with various accents, demonstrating the robustness and versatility of the technology that powers Copilot Studio.
In addition to speech, Copilot Studio also supports Dual-Tone Multi-Frequency (DTMF) input. This feature allows users to interact with the AI by pressing keys on their phone keypad. This dual mode of interaction - voice, and keypad - ensures that users have multiple ways to communicate with the AI, enhancing accessibility and convenience.
Multilingual Support
One of the most impressive features of the new voice capabilities is the multilingual support. Copilot Studio can understand and respond to commands in multiple languages. This makes it a truly global AI companion, capable of serving users from around the world. By breaking down language barriers, this feature makes AI technology accessible to a much broader audience.
Real-Time Responses and Error Handling
Another noteworthy feature is the ability to provide real-time responses. As soon as a user finishes speaking, Copilot Studio can immediately process the command and provide a response. This real-time interaction significantly enhances the user experience, making the conversation with the AI feel more like a conversation with a human.
Copilot Studio is equipped with a comprehensive error handling mechanism in case of any issues. It provides detailed error codes to help troubleshoot and diagnose runtime errors, ensuring a smooth and seamless user experience.
Getting Started with Voice Capabilities
Users must configure their Copilots for Voice in Copilot Studio to get started with the new voice capabilities. This involves creating a new copilot, configuring the voice-related settings, and connecting the copilot to Omnichannel for Customer Service. Users also have the option to build a voice-enabled copilot from a template, which includes specific voice system topics for handling voice scenarios.
Create a new Copilot
create a new bot on https://copilotstudio.microsoft.com
Configure Voice Settings
go to Settings > Copilot Details > Voice
Mark "Use voice as a primary author mode"
Configure Telefony Channel
- Go to Channels > Channels, click "Telephony" and activate it.
Configure Handover to Dynamics 365 Customer Service or Dynamics 365 Contact Center
go to Channels > Customer Engagement Hub > Dynamics 365 Customer Service
If you have activated it, the result should look similar to the picture below.
Navigate to the Dynamics 365 "Customer Service Admin Center" or "Contact Center Admin Center," proceed to your voice workstream, and click 'Add bot.'
Then, select the bot you have recently created. The outcome should resemble the screenshot provided.
Greet your calling customer by name
To greet a caller by name, you can utilize the caller ID and an associated contact in Dynamics Customer Service by adding a variable to a text node.
Testing Your Voice-Enabled Copilot
Once the copilot is set up, users can test it using text inputs that simulate a user’s input through speech or DTMF. This allows users to see the speech response in the text output and verify voice features such as Barge-in and DTMF in their text input.
Conclusion
The new voice capabilities of Copilot Studio mark a significant advancement in AI technology. They offer enhanced speech recognition, multilingual support, and real-time responses for contact centers. These features make interactions with AI more natural and intuitive and broaden accessibility for users worldwide. Following the steps outlined in this article, companies can easily configure and test their voice-enabled copilots, ensuring a seamless and efficient experience. As AI continues to evolve, innovations like these will undoubtedly play a crucial role in shaping the future of human-computer interaction.
Subscribe to my newsletter
Read articles from Holger Imbery directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by
Holger Imbery
Holger Imbery
I talk and write about conversational AI, robotic process automation, and things that keep an architect awake at night. Your feedback on my articles is highly appreciated, and I would like to invite you to a professional discussion. I'm a Principal Architect @NTTDATA focusing on Microsoft Power Platform and Azure.