{"id":16004,"date":"2022-07-07T23:28:31","date_gmt":"2022-07-07T23:28:31","guid":{"rendered":"https:\/\/www.transcribeme.com\/?p=16004"},"modified":"2024-07-03T04:32:53","modified_gmt":"2024-07-03T04:32:53","slug":"evaluating-automatic-speech-recognition-technology","status":"publish","type":"post","link":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/","title":{"rendered":"Evaluating Automatic Speech Recognition Technology"},"content":{"rendered":"[vc_row type=&#8221;full_width_background&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; top_padding=&#8221;5%&#8221; constrain_group_1=&#8221;yes&#8221; bottom_padding=&#8221;5%&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; advanced_gradient_angle=&#8221;0&#8243; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221; gradient_type=&#8221;default&#8221; shape_type=&#8221;&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_spacing=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;2\/3&#8243; tablet_width_inherit=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text]Let\u2019s talk about Automatic Speech Recognition (ASR) technology: the state of the art; user\/customer expectations; ASR output vs user\/customer expectations; and ask whether there is one ASR engine that meets all requirements? Spoiler alert\u2013the answer to that last question is, no. There&#8217;s no single ASR Engine that can satisfy all industry needs. Why not? We will dive into that answer in a bit.<\/p>\n<p>Here\u2019s another question to ask about ASR Technology. Why should I, the consumer, look beyond the big three\u2013Google, Apple, Microsoft\u2026make that four, IBM to assist me in meeting all my ASR Requirements? Obviously they have the biggest R&amp;D budgets and attract the best talent so their technology should be the best, right?<\/p>\n<blockquote><p>The answer is, it depends.<\/p><\/blockquote>\n[\/vc_column_text][\/vc_column][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_spacing=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/3&#8243; tablet_width_inherit=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][image_with_animation image_url=&#8221;16005&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;Fade In&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;&#8221; border_radius=&#8221;15px&#8221; box_shadow=&#8221;small_depth&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;125%&#8221; max_width_mobile=&#8221;default&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full_width_background&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; equal_height=&#8221;yes&#8221; content_placement=&#8221;middle&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; bg_color=&#8221;#f7f7f7&#8243; scene_position=&#8221;center&#8221; top_padding=&#8221;5%&#8221; constrain_group_1=&#8221;yes&#8221; bottom_padding=&#8221;5%&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; advanced_gradient_angle=&#8221;0&#8243; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221; gradient_type=&#8221;default&#8221; shape_type=&#8221;&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_spacing=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;2\/3&#8243; tablet_width_inherit=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text]\n<h2>ASR Technology Hits and Misses<\/h2>\n<p>For example, you want Google to turn on the lights\u2013\u201dGoogle! Turn on the driveway lights.\u201d Or, \u201cSiri! Play my, I\u2019m really depressed mix.\u201d Or \u201cAlexa, I need a Vegan pizza, light pepperoni and cheese.\u201d All of these technologies that use ASR to pick up on voice commands work pretty well.<\/p>\n<p>However, there are a number of cases where these ASR technologies have challenges. One pretty simple example is when I use the speech to text feature on a phone. Between auto correct and incorrect words, it\u2019s definitely not perfect. In fact, what is most frustrating is that it doesn\u2019t learn. I always have to correct my daughters\u2019 names as well as that of my engineering VP\u2013EVERY TIME! This is a slightly different use case than query response, but it\u2019s similar. Typically short sentences, real time transcription and the errors are because the context is free form so there isn\u2019t the possibility of comprehensive training.[\/vc_column_text][\/vc_column][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_spacing=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/3&#8243; tablet_width_inherit=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][image_with_animation image_url=&#8221;16009&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;Fade In&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;&#8221; border_radius=&#8221;none&#8221; box_shadow=&#8221;none&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;125%&#8221; max_width_mobile=&#8221;default&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full_width_background&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; bg_color=&#8221;#ffffff&#8221; scene_position=&#8221;center&#8221; top_padding=&#8221;5%&#8221; constrain_group_1=&#8221;yes&#8221; bottom_padding=&#8221;5%&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; advanced_gradient_angle=&#8221;0&#8243; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221; gradient_type=&#8221;default&#8221; shape_type=&#8221;&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_spacing=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;2\/3&#8243; tablet_width_inherit=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text]\n<h2>How TranscribeMe Uses ASR<\/h2>\n<p>The TranscribeMe use case for ASR is neither of these. Eg, \u201cOk Google! Listen to this one hour audio file and transcribe it with timestamps for every speaker change.\u201d As they say colloquially, \u201cthat dog don\u2019t hunt.\u201d Why not? That\u2019s not the use case for Google.<\/p>\n<p>Simplistically, the ASR industry breaks down into two use cases, query\/response and audio to text transcription. TranscribeMe continually tests vendors&#8217; speech engines and the big 3 or 4 are never at the top of the list in terms of word error rate for our use case\u2013and that makes sense\u2013audio to text, where audio, not \u2018spoken speech\u2019 is not their design target.<\/p>\n<p>An example of a TranscribeMe virtual request might be, \u201cTranscribe this six hour legal deposition with five speakers using the state of Iowa output format and include speaker IDs and speaker change timestamps.\u201d Well, truth time, no ASR engine is going to get that right. But some may be better than others.<\/p>\n<p>So that\u2019s where ASR analysis becomes more sophisticated. We\u2019re not simply looking at word error rates but at other factors, such as which engine punctuates or capitalizes best? Which works best w\/ crosstalk? Which is stellar with single speaker or multichannel vs that which can handle multiple speakers on a single channel?<\/p>\n<p>Why do these qualifications matter? Because the speech engine is not going to produce the final output that will be acceptable to the customer. Maybe it will produce output that\u2019s 90% correct\u2013that\u2019s pretty good. What if your car worked 90% of the time\u2013pretty good or totally unacceptable?[\/vc_column_text][\/vc_column][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_spacing=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; advanced_gradient_angle=&#8221;0&#8243; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/3&#8243; tablet_width_inherit=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221; gradient_type=&#8221;default&#8221;][image_with_animation image_url=&#8221;604&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;Fade In&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;&#8221; border_radius=&#8221;none&#8221; box_shadow=&#8221;none&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;125%&#8221; max_width_mobile=&#8221;default&#8221;][\/vc_column][\/vc_row][vc_row type=&#8221;full_width_background&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; bg_color=&#8221;#f7f7f7&#8243; scene_position=&#8221;center&#8221; top_padding=&#8221;5%&#8221; constrain_group_1=&#8221;yes&#8221; bottom_padding=&#8221;5%&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; advanced_gradient_angle=&#8221;0&#8243; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221; gradient_type=&#8221;default&#8221; shape_type=&#8221;&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_spacing=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;2\/3&#8243; tablet_width_inherit=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221;][vc_column_text]\n<h2>No ASR Engine\u2019s Output is Perfect<\/h2>\n<p>No ASR engine with a few caveats can produce output that will be acceptable to the customer as a finished product. The ASR engine produces an output that then requires human review and correction for completion. And that human in the loop dictates which engine we use for various customer and use cases\u2013those distinctions I mentioned above: dial up the ASR that excels at single speaker clear audio; or dial up the ASR that accurately timestamps speaker changes; or we need the engine that doesn\u2019t insert gibberish when it doesn\u2019t understand the audio.<\/p>\n<p>In summary, the TranscribeMe use case requires different engines for different types\/qualities of audio and for specific use cases. Since we don\u2019t build our own ASR we can shop and use any vendor that fits our needs and provides the best output for human review and correction.<\/p>\n<p>I mentioned a caveat where there are cases that a one pass ASR output can satisfy customer requirements and in our case we have a customer who does further analytics on the ASR output. That analysis may be keyword spotting or sentiment analysis, or other.<\/p>\n<p>As an aside, be wary of any company using their own home grown ASR to process files\u2013one size does not fit all and companies that do produce their own ASRs continually narrow the niches where they play.<\/p>\n<p>Do you have some examples of ways where you have found ASR technology challenging with a project you have worked on? We\u2019d love to know. Are you looking for a company like TranscribeMe to help you with any of your <a style=\"font-weight: bold; text-decoration: underline; color: #FF875A;\" href=\"https:\/\/www.transcribeme.com\/transcription-services\/\">Transcription<\/a> or <a style=\"font-weight: bold; text-decoration: underline; color: #FF875A;\" href=\"https:\/\/www.transcribeme.com\/ai-machine-learning\/\">AI Datasets and Machine Learning<\/a> needs?<\/p>\n<p><a style=\"font-weight: bold; text-decoration: underline; color: #FF875A;\" href=\"https:\/\/www.transcribeme.com\/hipaa-compliance\/\">Contact us today!<\/a>[\/vc_column_text][\/vc_column][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_spacing=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243; column_shadow=&#8221;none&#8221; column_border_radius=&#8221;none&#8221; column_link_target=&#8221;_self&#8221; column_position=&#8221;default&#8221; advanced_gradient_angle=&#8221;0&#8243; gradient_direction=&#8221;left_to_right&#8221; overlay_strength=&#8221;0.3&#8243; width=&#8221;1\/3&#8243; tablet_width_inherit=&#8221;default&#8221; tablet_text_alignment=&#8221;default&#8221; phone_text_alignment=&#8221;default&#8221; animation_type=&#8221;default&#8221; bg_image_animation=&#8221;none&#8221; border_type=&#8221;simple&#8221; column_border_width=&#8221;none&#8221; column_border_style=&#8221;solid&#8221; gradient_type=&#8221;default&#8221;][image_with_animation image_url=&#8221;16010&#8243; image_size=&#8221;full&#8221; animation_type=&#8221;entrance&#8221; animation=&#8221;Fade In&#8221; hover_animation=&#8221;none&#8221; alignment=&#8221;&#8221; border_radius=&#8221;none&#8221; box_shadow=&#8221;none&#8221; image_loading=&#8221;default&#8221; max_width=&#8221;125%&#8221; max_width_mobile=&#8221;default&#8221;][\/vc_column][\/vc_row]\n","protected":false},"excerpt":{"rendered":"<p>[vc_row type=&#8221;full_width_background&#8221; full_screen_row_position=&#8221;middle&#8221; column_margin=&#8221;default&#8221; column_direction=&#8221;default&#8221; column_direction_tablet=&#8221;default&#8221; column_direction_phone=&#8221;default&#8221; scene_position=&#8221;center&#8221; top_padding=&#8221;5%&#8221; constrain_group_1=&#8221;yes&#8221; bottom_padding=&#8221;5%&#8221; text_color=&#8221;dark&#8221; text_align=&#8221;left&#8221; row_border_radius=&#8221;none&#8221; row_border_radius_applies=&#8221;bg&#8221; overflow=&#8221;visible&#8221; advanced_gradient_angle=&#8221;0&#8243; overlay_strength=&#8221;0.3&#8243; gradient_direction=&#8221;left_to_right&#8221; shape_divider_position=&#8221;bottom&#8221; bg_image_animation=&#8221;none&#8221; gradient_type=&#8221;default&#8221; shape_type=&#8221;&#8221;][vc_column column_padding=&#8221;no-extra-padding&#8221; column_padding_tablet=&#8221;inherit&#8221; column_padding_phone=&#8221;inherit&#8221; column_padding_position=&#8221;all&#8221; column_element_spacing=&#8221;default&#8221; background_color_opacity=&#8221;1&#8243; background_hover_color_opacity=&#8221;1&#8243;&#8230;<\/p>\n","protected":false},"author":7,"featured_media":16005,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[1028,4],"tags":[],"class_list":{"0":"post-16004","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-asr","8":"category-blog"},"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v23.3 (Yoast SEO v24.7) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Evaluating Automatic Speech Recognition Technology | TranscribeMe<\/title>\n<meta name=\"description\" content=\"Even the foremost innovators in ASR Technology still struggle to meet all the requirements necessary for its users of it. We evaluate a few of the reasons why.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Evaluating Automatic Speech Recognition Technology\" \/>\n<meta property=\"og:description\" content=\"Even the foremost innovators in ASR Technology still struggle to meet all the requirements necessary for its users of it. We evaluate a few of the reasons why.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/\" \/>\n<meta property=\"og:site_name\" content=\"TranscribeMe\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/TranscribeMe\/\" \/>\n<meta property=\"article:published_time\" content=\"2022-07-07T23:28:31+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-07-03T04:32:53+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png\" \/>\n\t<meta property=\"og:image:width\" content=\"801\" \/>\n\t<meta property=\"og:image:height\" content=\"601\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Transcribe Me\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@transcribeme\" \/>\n<meta name=\"twitter:site\" content=\"@transcribeme\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Transcribe Me\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/\"},\"author\":{\"name\":\"Transcribe Me\",\"@id\":\"https:\/\/www.transcribeme.com\/#\/schema\/person\/632cda4e18ad799c64ebcfa85ca09c22\"},\"headline\":\"Evaluating Automatic Speech Recognition Technology\",\"datePublished\":\"2022-07-07T23:28:31+00:00\",\"dateModified\":\"2024-07-03T04:32:53+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/\"},\"wordCount\":2009,\"publisher\":{\"@id\":\"https:\/\/www.transcribeme.com\/#organization\"},\"image\":{\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png\",\"articleSection\":[\"ASR\",\"Blog\"],\"inLanguage\":\"en\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/\",\"url\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/\",\"name\":\"Evaluating Automatic Speech Recognition Technology | TranscribeMe\",\"isPartOf\":{\"@id\":\"https:\/\/www.transcribeme.com\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png\",\"datePublished\":\"2022-07-07T23:28:31+00:00\",\"dateModified\":\"2024-07-03T04:32:53+00:00\",\"description\":\"Even the foremost innovators in ASR Technology still struggle to meet all the requirements necessary for its users of it. We evaluate a few of the reasons why.\",\"breadcrumb\":{\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#breadcrumb\"},\"inLanguage\":\"en\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en\",\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#primaryimage\",\"url\":\"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png\",\"contentUrl\":\"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png\",\"width\":801,\"height\":601,\"caption\":\"TranscribeMe - ASR\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.transcribeme.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Evaluating Automatic Speech Recognition Technology\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.transcribeme.com\/#website\",\"url\":\"https:\/\/www.transcribeme.com\/\",\"name\":\"TranscribeMe\",\"description\":\"The most accurate transcription starting at $0.79 per minute\",\"publisher\":{\"@id\":\"https:\/\/www.transcribeme.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.transcribeme.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.transcribeme.com\/#organization\",\"name\":\"TranscribeMe.com\",\"url\":\"https:\/\/www.transcribeme.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en\",\"@id\":\"https:\/\/www.transcribeme.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2021\/09\/featured-image-thumb.jpg\",\"contentUrl\":\"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2021\/09\/featured-image-thumb.jpg\",\"width\":512,\"height\":512,\"caption\":\"TranscribeMe.com\"},\"image\":{\"@id\":\"https:\/\/www.transcribeme.com\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/TranscribeMe\/\",\"https:\/\/x.com\/transcribeme\",\"https:\/\/www.linkedin.com\/company\/transcribeme\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.transcribeme.com\/#\/schema\/person\/632cda4e18ad799c64ebcfa85ca09c22\",\"name\":\"Transcribe Me\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en\",\"@id\":\"https:\/\/www.transcribeme.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/edb71dcbf6cd2a48f0eb4e9030185de7d39db37c0c53f317d6aadf73b387973b?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/edb71dcbf6cd2a48f0eb4e9030185de7d39db37c0c53f317d6aadf73b387973b?s=96&d=mm&r=g\",\"caption\":\"Transcribe Me\"}}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Evaluating Automatic Speech Recognition Technology | TranscribeMe","description":"Even the foremost innovators in ASR Technology still struggle to meet all the requirements necessary for its users of it. We evaluate a few of the reasons why.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/","og_locale":"en_US","og_type":"article","og_title":"Evaluating Automatic Speech Recognition Technology","og_description":"Even the foremost innovators in ASR Technology still struggle to meet all the requirements necessary for its users of it. We evaluate a few of the reasons why.","og_url":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/","og_site_name":"TranscribeMe","article_publisher":"https:\/\/www.facebook.com\/TranscribeMe\/","article_published_time":"2022-07-07T23:28:31+00:00","article_modified_time":"2024-07-03T04:32:53+00:00","og_image":[{"width":801,"height":601,"url":"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png","type":"image\/png"}],"author":"Transcribe Me","twitter_card":"summary_large_image","twitter_creator":"@transcribeme","twitter_site":"@transcribeme","twitter_misc":{"Written by":"Transcribe Me","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#article","isPartOf":{"@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/"},"author":{"name":"Transcribe Me","@id":"https:\/\/www.transcribeme.com\/#\/schema\/person\/632cda4e18ad799c64ebcfa85ca09c22"},"headline":"Evaluating Automatic Speech Recognition Technology","datePublished":"2022-07-07T23:28:31+00:00","dateModified":"2024-07-03T04:32:53+00:00","mainEntityOfPage":{"@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/"},"wordCount":2009,"publisher":{"@id":"https:\/\/www.transcribeme.com\/#organization"},"image":{"@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#primaryimage"},"thumbnailUrl":"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png","articleSection":["ASR","Blog"],"inLanguage":"en"},{"@type":"WebPage","@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/","url":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/","name":"Evaluating Automatic Speech Recognition Technology | TranscribeMe","isPartOf":{"@id":"https:\/\/www.transcribeme.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#primaryimage"},"image":{"@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#primaryimage"},"thumbnailUrl":"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png","datePublished":"2022-07-07T23:28:31+00:00","dateModified":"2024-07-03T04:32:53+00:00","description":"Even the foremost innovators in ASR Technology still struggle to meet all the requirements necessary for its users of it. We evaluate a few of the reasons why.","breadcrumb":{"@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#breadcrumb"},"inLanguage":"en","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/"]}]},{"@type":"ImageObject","inLanguage":"en","@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#primaryimage","url":"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png","contentUrl":"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2022\/07\/asr.png","width":801,"height":601,"caption":"TranscribeMe - ASR"},{"@type":"BreadcrumbList","@id":"https:\/\/www.transcribeme.com\/blog\/evaluating-automatic-speech-recognition-technology\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.transcribeme.com\/"},{"@type":"ListItem","position":2,"name":"Evaluating Automatic Speech Recognition Technology"}]},{"@type":"WebSite","@id":"https:\/\/www.transcribeme.com\/#website","url":"https:\/\/www.transcribeme.com\/","name":"TranscribeMe","description":"The most accurate transcription starting at $0.79 per minute","publisher":{"@id":"https:\/\/www.transcribeme.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.transcribeme.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en"},{"@type":"Organization","@id":"https:\/\/www.transcribeme.com\/#organization","name":"TranscribeMe.com","url":"https:\/\/www.transcribeme.com\/","logo":{"@type":"ImageObject","inLanguage":"en","@id":"https:\/\/www.transcribeme.com\/#\/schema\/logo\/image\/","url":"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2021\/09\/featured-image-thumb.jpg","contentUrl":"https:\/\/www.transcribeme.com\/wp-content\/uploads\/2021\/09\/featured-image-thumb.jpg","width":512,"height":512,"caption":"TranscribeMe.com"},"image":{"@id":"https:\/\/www.transcribeme.com\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/TranscribeMe\/","https:\/\/x.com\/transcribeme","https:\/\/www.linkedin.com\/company\/transcribeme"]},{"@type":"Person","@id":"https:\/\/www.transcribeme.com\/#\/schema\/person\/632cda4e18ad799c64ebcfa85ca09c22","name":"Transcribe Me","image":{"@type":"ImageObject","inLanguage":"en","@id":"https:\/\/www.transcribeme.com\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/edb71dcbf6cd2a48f0eb4e9030185de7d39db37c0c53f317d6aadf73b387973b?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/edb71dcbf6cd2a48f0eb4e9030185de7d39db37c0c53f317d6aadf73b387973b?s=96&d=mm&r=g","caption":"Transcribe Me"}}]}},"_links":{"self":[{"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/posts\/16004","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/comments?post=16004"}],"version-history":[{"count":0,"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/posts\/16004\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/media\/16005"}],"wp:attachment":[{"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/media?parent=16004"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/categories?post=16004"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.transcribeme.com\/wp-json\/wp\/v2\/tags?post=16004"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}