cancel
Showing results for 
Search instead for 
Did you mean: 

RPA speech to text

HPB-RPA
Level 2

Hello, Is it possible to program an RPA solution that should enter certain data into the application based on a telephone conversation? As far as I have researched the same is not possible without speech to text application, am I right or is there a way?

1 BEST ANSWER

Helpful Answers

steven.boggs
Staff
Staff

You're correct that Blue Prism does not have in-built "voice to text" functionality, but it should be possible in the course of exploring solutions that can perform this function to automate them using Blue Prism. You could research and prototype voice-to-text solutions that are widely available and how to best interface with them using Blue Prism code stages. Most of this functionality provided by the big vendors (Google/Microsoft/etc) have very usable APIs and/or Web Services so they should be straightforward to interface with via Blue Prism. As an example, Microsoft has the following function which you probably can call from Blue Prism: Speech-to-text overview - Speech service - Azure Cognitive Services | Microsoft Learn

View answer in original post

2 REPLIES 2

steven.boggs
Staff
Staff

You're correct that Blue Prism does not have in-built "voice to text" functionality, but it should be possible in the course of exploring solutions that can perform this function to automate them using Blue Prism. You could research and prototype voice-to-text solutions that are widely available and how to best interface with them using Blue Prism code stages. Most of this functionality provided by the big vendors (Google/Microsoft/etc) have very usable APIs and/or Web Services so they should be straightforward to interface with via Blue Prism. As an example, Microsoft has the following function which you probably can call from Blue Prism: Speech-to-text overview - Speech service - Azure Cognitive Services | Microsoft Learn

Nandhakumar
Level 6

Hi  IPrimorac

Yes , steven.boggs. is right.

To provide you and insight about our Blueprism Gen AI Hackathon Usecase related to you query.

Our usecase was to read the content from the audio(Telephone call) and book an appointment based on the information given by the caller.

We have used Google Speech to Text (Speech-to-Text | Google Cloud) to get the details from the telephone conversation and feed to GEN AI to understand the context to help the Hospital with Appointment booking with correct specialist, rescheduling and other services.

Please connect with me if you have more qureies on this 🙂