TrustRadius: an HG Insights company
Google Cloud Speech-to-Text Logo

Google Cloud Speech-to-Text Reviews and Ratings

Rating: 6.8 out of 10
Score
6.8 out of 10

Reviews

46 Reviews

Turning your words into insights is easier with Google speech to text

Rating: 9 out of 10
Incentivized

Use Cases and Deployment Scope

Previously, converting the speech to text seemed very time-consuming. The team often needed quick access to the information from the calls, and this real-time transcription enables faster decision-making and keeps the process smoother. Certain times it's very hard and difficult to analyze the large volume of the data. Once the audio is converted into text, we can easily search for any keyword and perform data analysis, as a result of which it will help in improving the report. We as a technical support team use this tool daily to convert the customer conversations into text for quality checking purpose and sentimental analysis We also use this tool for transforming the audio of our field offers into text.

Pros

  • Provides high-speed real time streaming transcription like live captioning, automatic note capturing during the the meeting etc
  • It supports more than 120 languages, which keeps this product globally recognized. Well, it helps in multilingual call centers that majorly relayed on Google speech-to-text.
  • The transcription is formatted very clearly with proper punctuation, commas, and question marks; therefore, no human intervention is needed for correcting the data

Cons

  • Real-time transcription needed high-quality audio
  • Cost is high for the large-scale operations
  • Integration seems to be complex; for certain vocabulary, there is no special GUI for the nontechnical users to make any corrections

Likelihood to Recommend

Our real-time field service agents use this very much, as it converts the audio into text and handles moderate background noise, and it supports more than 120 languages. Performing the code switching is also very easy. Voice-based data entry inside internal applications and CRM systems. This does not work well when there is an heavy background noise, as this will drop the accuracy in loud environments. Certain high technical language words cannot be added automatically, as it wont have capacity to phrase it

Transforming voice into the text is easier with Google Cloud Speech-to-Text

Rating: 9 out of 10
Incentivized

Use Cases and Deployment Scope

Earlier we used to completely rely on the notepad or scribble-based notebook during the call to capture the important discussion, but that seems to be hectic and time-confusing. While documenting itself is a very big task, we got a solution to this via Google Cloud Speech-to-Text. Where it has very great features like capturing the audio and converting the data to text. Also, it helps in making our documentation and knowledge management easier. That way we can share the same information across different teams without the manual effort. Below are the couple of business problems that were been addressed via Google Cloud Speech-to-Text, like manual transcription overhead and improving customer experience.

Pros

  • it has a capacity to support over 125 plus languages and dialects, which helps every customer over the globe
  • Also integrates seamlessly with analytics and AI workflows
  • High-accuracy transcription in noisy environments.
  • Works great with the long-form audio

Cons

  • While we observed there is an inconsistent accuracy on domain-specific jargon, like it doesn't guarantee recognition. Certainly it requires trial and error tuning
  • There is a limited support for the advanced data structures like heading and paraphrasing
  • confusing pricing models where different pricing tiers
  • uploads are taking longer processing time based on the audio files

Likelihood to Recommend

Real-time meeting notes for the smaller group audience. Strong language coverage of over 125+ languages. Handles mobile phone recordings and environmental noise effectively. Fast transcription turnaround also supports phrases, which improves industry-specific terminology. Generating QA/compliance audit logs. Also builds the sentences with accurate punctuation and sentence boundaries. It has vast global support centers whose primary focus in resolving customer issues and help multinational engineering in building great products

Disappointed in Google Cloud Speech-to-Text

Rating: 2 out of 10
Incentivized

Use Cases and Deployment Scope

As a pastor, I preach sermons on a regular basis. While I prepare a manuscript before preaching, I often incorporate elements in my public sermons that are extemporaneous. In order to document these verbal emendations, I had hoped to use Google Cloud Speech-to-Text to efficiently transcribe my recorded sermons after preaching.

Pros

  • Supposedly helpfully transcribes audio files
  • Presents a professional front in its interface
  • Stores digital transcriptions in the cloud

Cons

  • Interface is very confusing
  • Instructions are not clear in how to upload files
  • Full scope of the purposes of this program are not succinctly stated

Likelihood to Recommend

Google Cloud Speech-to-Text would appear to be well suited to the tech-savvy pastor who wishes to keep an accurate transcription of his weekly sermons. This would help ensure such a pastor had a reliable manuscript if he ever desired to preach those same sermons in the future. However, based on my personal experience, Google Cloud Speech-to-Text seems to be less appropriate for a pastor such as myself who is not intuitively adept with programs such as this.

Vetted Review

Google Cloud Speech to Text - Proving Google Is The AI Leader

Rating: 9 out of 10
Incentivized

Use Cases and Deployment Scope

I use Google Cloud Speech-to-Text during any and all brainstorming sessions, and also while leaving sales related voicemail. I do this so that no ideas fall behind or between the cracks, and that I can improve future voicemail that I leave. I cant record conversations unless the other party is aware, so voicemails allow me to practice and listen back to what the decision.maker is hearing from me l.

Pros

  • Accurate
  • Doesnt skip a beat
  • Has great hearing

Cons

  • Specific words can have funny output
  • It sometimes stops recording to quickly
  • Poor Grammer at times

Likelihood to Recommend

It is well suited for stream of consciousness vibe creating, and certainly as helped alleviate the years I have suffered from carpal tunnel syndrome. It is not well suited in loud environments or when people are talking over one another like in a work meeting or something. Sometimes the words get garbled.

Vetted Review
Google Cloud Speech-to-Text
1 year of experience

Great Product that is Plug and Play Ready

Rating: 8 out of 10
Incentivized

Use Cases and Deployment Scope

we use it to transcribe audio recordings from meetings, phone calls, reviews, and such. Then use it in connection with notetaker to organize the thoughts and keep better track of meeting points and action items. The product is pretty accurate with the spoken words. Plus it plugs into other applications pretty easily

Pros

  • capturing speech
  • plug and play into other applications
  • keeping track of notes

Cons

  • low volumne recording
  • time limit
  • the start/stop action

Likelihood to Recommend

great product that is easy to use. It's easy to add this product to other applications and teach the team how to quickly utilize it. The option for translation if working with partners who speak a different lanuage makes this product great. It quick and easy to start talking and the note taker to generate the speech

Vetted Review
Google Cloud Speech-to-Text
2 years of experience

Google Speech to Text Your gateway to connect the world.

Rating: 8 out of 10
Incentivized

Use Cases and Deployment Scope

I prefer Google Cloud Speech to Text for translating people's queries because my team members are from different countries, and I need to communicate with them effectively. So, it's good to understand their language and speak with them. Apart from that, I implemented its API in my various Python scripts to automate my virtual assistant in different languages. Its custom models and phrase hints improve the accuracy and maintain the process well. Sometimes I also used it for my YouTube video subtitles and podcasts. We can use it in many ways and enhance our capability to work in extreme conditions.

Pros

  • So, first of all it gives the answer or translates in real time which is awesome.
  • It has speaker diarization, which detects who spoke each segment. This is a great feature because it can track the number of people as well.
  • It has an automatic punctuation system that detects each punctuation mark, such as a dot and a comma, and places it in the text.
  • Lastly, it offers a variety of language translations, providing a global platform for interaction with people from different countries.

Cons

  • It has a limited accuracy in a noisy and accented environment so, it can be improved.
  • If there are 5+ people in a conversation, then the speaker diarization will fail. So, this can be enhanced.
  • There are limited emotions for voice, so these can be enhanced. We can add more emotions to the models and train them.

Likelihood to Recommend

So, I've had scenarios like when I collaborate with a team where the people are from around the world. So, I used it there, and we spoke to each other in their native language. That boosts everyone's confidence in our collaborative efforts. I've also utilized its model and the API in my projects, including a Virtual assistant and a multilingual application that allows us to learn languages from around the world. We tested it with a group of 12 people, and that's when it failed. I mean, it's not a failure, but it can't detect every person.

A nice advantage to your workflow

Rating: 7 out of 10

Use Cases and Deployment Scope

I do a lot of writing, and I do a lot of speaking. I want to keep records of both just in case I need to edit later, and with this Google product it is like carrying an old-fashioned dictaphone with you. Is this a bad thing? Nope - it's just another app that can solve a need without carrying a lot of equipment with you, and the lag time is good - meaning that there isn't a lot of lag.

Pros

  • deciphers tougher words
  • keeps up with my speech speed and patterns
  • maintains an accurate record of what is spoken

Cons

  • It could be faster - there is lag
  • I would like to see a different interface - just a personal thing
  • Better in more of a real time

Likelihood to Recommend

I think in settings where the speech or conversations need to be recorded it is effective. I think as the venue gets larger, this gets harder and harder to both record and accurately hear - this is one thing that I didn't try - I haven't had a need for yet.

Making your audio commands to text easier with Google Cloud Speech-to-Text

Rating: 9 out of 10
Incentivized

Use Cases and Deployment Scope

Transcribing customer support calls for quality analysis were made easier with Google Cloud Speech-to-Text where it transcribe the communication and help us in elevating the business smoothly. We also use certain configuration parameters like language,model,speaker etc and send an audio data as soon as this is sent the API will return us the transcribed text that way we can reduce maximum manpower and increase the productivity. Earlier creating the captions for the real time meeting seems to be very hard like post meeting if we would like to clarify any information we didn't have the captions available and we relay totally on the manual notebook entry but post this we can recheck the caption and fetch any information we needed. Easy to copy and secure it safe.

Pros

  • Transcribing customer support calls for quality analysis
  • Creating the real-time captions for meetings and webinar
  • Automate the documentations based on the speech API's
  • Streaming real-time transcription using streaming API's
  • Converting audio's to text from different languages is also easier

Cons

  • Integration outside of the google eco system is challenging here.
  • Google Cloud Speech-to-Text works only with active internet connection if the internet bandwidth is low it effect the transcription process and can lead to data inaccuracy.
  • In terms of the pricing also this is at higher range which all the companies cannot afford like small scale organisation if they would like to use the tool they would look over the price to make the decision. Reducing the price can increase the product usage more

Likelihood to Recommend

In our real time meetings or webinars where larger audience are expected we have enabled the captions options with Google Cloud Speech-to-Text tool this start transcribing the complete audio conversation in the neat text format. Also while performing the interview process as well we use this tool to make sure that we adhere to certain rules and are being checked by the superior management team to make sure the transcription has required questions being asked on for quality analysis. Also during the customer call we use this tool to make sure two way communication is transcribed and will be later reviewed when there is an escalation by the superior management

Great for converting speech to text

Rating: 8 out of 10
Incentivized

Use Cases and Deployment Scope

We use it as an assistant while transcribing our customer interviews into text, which helps us save time and energy on transcriptions and allows us to focus more on complex and interesting tasks. We have also tried using the text-to-speech function to add audio to our interfaces and we found it very convenient.

Pros

  • Transcribe speech into text
  • Transcribe text into speech
  • Share transcriptions among the team members

Cons

  • It is very expensive when you start work with big files
  • It has some troubles with accents
  • Doesn't work good when some people speak simultaneously

Likelihood to Recommend

Google Cloud Speech-to-Text works well in situations where you have audio files and need to quickly extract information from them, convert it into text, and share it with your colleagues. I conduct interviews with customers where they share their experiences, and it's very convenient to quickly distribute the information to my team without making them watch the videos or listen to the audio files.

A Reliable Tool for Real-Time Transcription and Automation

Rating: 8 out of 10
Incentivized

Use Cases and Deployment Scope

We use Google Cloud Speech-to-Text in our company mainly to convert voice recording - like me1etings, customer calls, and voice notes—into written text. Is also capable of converting various sorts of audio sources to text, which is convenient for some who may have trouble hearing or are not present

Pros

  • Speech to text
  • Accuracy
  • Text format can be seen by all people in the meeting.

Cons

  • A feature that focuses on only the speaker.
  • Pricing is a bit on a higher side.
  • Depending upon your accent it can be hard but rarely

Likelihood to Recommend

It helps us save time and multitask accurately. The multi-language support is great for diverse teams.