ABOUT THE AUTHOR


Mike Bergelson
Mike Bergelson is responsible for developing new product and business model strategies for Cisco's Unified Communications portfolio. Prior to this...
Read Full Bio >>
SHARE



Mike Bergelson | July 19, 2010 |

 
   

The State of Transcription for UC: Part 2.2: Meeting Transcription

The State of Transcription for UC: Part 2.2: Meeting Transcription audio and video meetings involve, by their nature, multiple speakers. This complicates the task of transcription considerably

audio and video meetings involve, by their nature, multiple speakers. This complicates the task of transcription considerably

This is a continuation of my blog post from last week, part of a series of posts on the application of transcription in unified communications.

In this, last week's and next week's posts, I discuss the state of transcription today. In the final post in the series, I'll address where I believe the market may be going and some key areas of innovation that can help us derive more benefit from recorded audio and video content.

Transcribing meetings is, today anyway, a fairly labor-intensive process. Medical and voicemail transcriptions are both complex in their own rights (think esoteric nomenclature and poor reception and background noises, respectively), yet they also benefit from the advantage of a single speaker; in contrast, audio and video meetings involve, by their nature, multiple speakers. This complicates the task of transcription considerably, as it necessitates the reliable tracking of who said what and handling of situations where people talk over each other.

To be effective, meeting transcriptions must be accurate, and they therefore require (at least today) some human involvement in the transcription.

This is, of course, expensive and time consuming. If meeting transcription was offered as a basic service to employees (as audio conferencing or email often are today), I estimate this capability could easily approach 10-20% of an employee's salary at the standard rates of $1–2 per transcribed minute. These rates are based on "typical" turn-around times of three to five days--a span that could reduce the value of the transcription to begin with. Next day transcriptions typically cost three to four times as much.

The costs are understandable; human operators typically transcribe around 30–40 words per minute. Assuming we speak at 100–150 WPM, it can take three to ten minutes of transcriptionist time for every minute of discussion, depending on the complexity of the dialogue, clarity of the speakers' voices and number of meeting participants and the uniqueness of their voices.

There are, however, some innovative approaches to reducing the costs. For example, Cogi allows users to highlight sections of a conference call in real-time to avoid transcribing uninteresting segments (such as, "We've almost got quorum, folks. Can someone give Bob a call on his cell to see if he's going to join?"). I've also seen some creative early uses of Amazon's Mechanical Turk service to accomplish lower cost, faster turn-around transcriptions.

In my final blog post in this series, I'll explore these approaches in more detail and suggest some other areas where innovation can help lead to lower cost, higher accuracy, faster-turn-around, secure transcriptions.

Given the productivity benefits to come from the availability of transcriptions of recorded audio and video content in the enterprise, it's our responsibility as widgeteers, analysts and consultants to ensure the general availability of these capabilities to our customers at a reasonable total burden of ownership*.

* Total burden of ownership considers not just hard and soft costs but end user complexity, an oft-overlooked expense of adopting new technologies.



COMMENTS




Currently we allow the following HTML tags in comments:

Single tags

These tags can be used alone and don't need an ending tag.

<br> Defines a single line break

<hr> Defines a horizontal line

Matching tags

These require an ending tag - e.g. <i>italic text</i>

<a> Defines an anchor

<b> Defines bold text

<big> Defines big text

<blockquote> Defines a long quotation

<caption> Defines a table caption

<cite> Defines a citation

<code> Defines computer code text

<em> Defines emphasized text

<fieldset> Defines a border around elements in a form

<h1> This is heading 1

<h2> This is heading 2

<h3> This is heading 3

<h4> This is heading 4

<h5> This is heading 5

<h6> This is heading 6

<i> Defines italic text

<p> Defines a paragraph

<pre> Defines preformatted text

<q> Defines a short quotation

<samp> Defines sample computer code text

<small> Defines small text

<span> Defines a section in a document

<s> Defines strikethrough text

<strike> Defines strikethrough text

<strong> Defines strong text

<sub> Defines subscripted text

<sup> Defines superscripted text

<u> Defines underlined text

Did you know you can style comments using HTML tags and upload your avatar photo? To upload your avatar photo, first complete your Disqus profile. Once your profile is complete, you may add your avatar photo. (Hide this hint)
Enterprise Connect Tour 2012
In response to the booming demand for SIP Trunks—and for information about SIP Trunks—Enterprise Connect is launching a four-city “road show” on this vital topic.
May 22: San Francisco
June 6: Chicago
June 27: New York
Enterprise Connect 2012 Roundup
Read blog posts and watch videos captured live at the industry's leading event, and catch up on all the post-show analysis too.
Enterprise Connect Webinar
Forrester Research principal analyst Art Schoeller will provide an overview of trends in enterprise SIP Trunking and unified communications adoption.
Enterprise Connect Orlando 2013
Enterprise Connect Orlando 2013 takes place March 18-21, 2013 at the Gaylord Palms Hotel. We'll be opening registration shortly. Stay tuned!
Trending Now
Upcoming Events
May 23, 2012
The explosion of new hosted and cloud communications offerings can be confounding. With the potential for cost savings and productivity enhancements, adopting the right Hosted PBX solution can make a ...
May 9, 2012
SIP Trunking and unified communications strategies are important components of enterprise telecommunication strategies. Enterprise Session Border Controllers (E-SBCs) play a critical role in maximizin...
April 25, 2012
Unified Communications (UC) is becoming mainstream in the enterprise, enabling real-time, collaborative communications via a host of new media and applications. But this transition will bring challeng...

Sign up to the No Jitter email newsletters

  • Catch up with the blogs, features and columns from No Jitter, the online community for the IP communications industry. Each Thursday, we'll send you a synopsis of the high-impact articles, podcasts and other material posted to No Jitter that week, with links for quick access.

  • A quick hit of original analysis by the experts who bring you Enterprise Connect, the leading event in Enterprise Communications & Collaboration. Each Wednesday, this enewsletter delivers to your email box a thought-provoking, objective take on the latest news and trends in the industry.

Your email address is required for membership. For details about the user information, please read the UBM Privacy Statement

As an added benefit, would you like to receive relevant 3rd party offers about new products/services and discounted offers via email? Yes

* = Required Field