An International Symposium
Technology and Deaf Education
TechSym

Exploring Instructional and Access Technologies


Captions

(M10E)

Meaning-for-Meaning Speech-to-Text Services: A Better Understanding

Cindy Camp



ROUGH EDITED COPY

RIT/NTID

An International Symposium

Technology and Deaf Education

"Meaning-for-Meaning Speech-to-Text Services:

A Better Understanding"

Presenters:

Cindy Camp and Pam Francis

Session M10E

June 23, 2008

CAPTIONING PROVIDED BY:

ALTERNATIVE COMMUNICATION SERVICES, LLC

P.O. BOX 278

LOMBARD, IL 60148

* * * * *

This is being provided in a rough-draft format.

Communication Access Realtime Translation (CART) is provided

in order to facilitate communication accessibility and may

not be a totally verbatim record of the proceedings.

* * * * *

>>PAM FRANCIS: Good morning. Thank you for coming this

morning. Do any members of our audience need an

interpreter? Yes? So all right. Well, then, thank you for

being here, and I would ask, if you would like to introduce

yourselves.

>> My name is Miriam.

>> My name is Erin Jacobs.

>>PAM FRANCIS: Thank you. I am Pam Francis, and I am one

of your presenters today and this is Cindy Camp and we are

here to talk about meaning-for-meaning, speech-to-text

services and to try to shed a little bit of light and share

some information with you. We do welcome questions during

the presentation, and we're a little short on time already,

so we'll -- there may be some sections we speed through a

little bit.

Briefly, we're going to talk just about a definition, a

general definition, some of the benefits of

meaning-for-meaning, give some examples, and make an

association with some -- a process that might be familiar to

some, and some strategies that service providers can use to

improve their skills. And we'll talk briefly about analysis

and evaluation.

So if I was to ask the members of the audience what would be

your description of meaning-for-meaning, do you have

something in mind? Perhaps a summary of the spoken English

content. But that's not really what we feel is an accurate

description. A translation or paraphrase of the spoken

English content. We went a little bit further. And I'll

give you a chance to read that.

So we'd like to offer this as a definition that you can take

away with you today of what meaning-for-meaning

speech-to-text services are. And where did we come up with

that, or where -- what do we think is important as part of

that definition?

Well, we use the word translation, and we also use the word

paraphrase. And again, I'd like to give you a minute to

read the slide.

So as we move through our presentation today, you'll have --

you'll be able to associate what we're saying with the

definition that we have provided.

What are some of the benefits of a meaning-for-meaning

representation? Well, it is a model of written English. So

what does that mean? Well, if a speaker -- and I'll use

myself as a good example -- a speaker doesn't necessarily

speak in correct English, and a meaning-for-meaning service

provider represents the information in clear grammar, clear

format, clear English of what written -- a model of written

English rather than spoken English. Succinct delivery of

spoken information. Well, that might mean if something is

repeated, then the information might be bolded or it might

be underlined rather than put down twice.

A manageable transcript. For an hour class, rather than

receiving 25 to 30 pages of notes, they might review ten or

12.

Includes pragmatics of spoken information. And we'll talk

about that a little bit more. But the words don't always

represent what a speaker's meaning is.

And then it's visually accessible, meaning the information

can be bolded, it can be underlined, it can be emphasized.

There are many different ways to manipulate the information.

So, let's talk a little bit more about words versus intent.

As I said, linguistic meaning does not always reflect a

speaker's meaning. So let's look at your examples.

If I were to type, "Can you see Mary? " That might not give

a clear indication of what I'm trying to say.

So I may choose to use a different word. It depends how the

speaker is saying that information, what meaning they want

to convey. Because can you see Mary is very different than

"Can you see Mary? " Or "Can you see Mary? Can you imagine

what she's doing? So one might be a question as to whether

you can actually see her, and one might be a question as to

can you imagine what she was doing?

Again, another example, sincerity versus sarcasm. The door

is over there, I might be directing someone, whereas I might

be telling someone where the door is because I'd like them

to leave. How that information is conveyed is important.

And that's where the pragmatics come in. Words do not

always equal intent.

>>CINDY CAMP: How many of you, from being here, you've seen

what CART or verbatim transcripts look like. How many of

you have experience with meaning-for-meaning transcripts?

Okay. A few of you.

What happens is if someone has seen a service provider who

is not very skilled, then they come away with the impression

that verbatim is always best, and meaning-for-meaning is a

substandard service. And I'd like to challenge you today to

think outside the box and realize that these are two

different services. And it's very similar to thinking about

an ASL interpreter as opposed to a signed English

interpreter. They both have their place. It depends on the

client who's using the service, and the presenter, and the

situation.

I'm going to give you just a minute to read this transcript.

It's a verbatim transcript of a person talking about

transition services.

This is a very good example for our hearing audience because

when you read this and there is no voice saying the words,

much of the meaning is lost.

If you were hearing the information, it would make a bit

more sense. Still not perfect, but better.

And this is a very good example of when someone says, oh,

verbatim, that's always what we need, if a student's sitting

in a classroom and reading this, how much information do you

think they're actually getting?

It doesn't make a lot of sense. This is exactly how the

speaker talks, but as we've discussed, speakers don't always

use proper grammar or complete sentences.

So, my point is, just like ASL is not bad English,

meaning-for-meaning is not bad verbatim.

In fact, the mental processes that a meaning-for-meaning

service provider uses is very similar to those that an

interpreter into sign language uses.

If someone is using a verbatim method to transcribe, there's

actually very little mental processing that occurs in the

use of the language.

You may process homophones such as T-H-E-R-E, T-H-E-I-R,

words that sound the same, you would need to choose which is

the appropriate word, but other than that, it's basically,

the sound comes in, the words go back out on the computer.

With meaning-for-meaning systems, such as C-Print and

TypeWell, we're processing more at the phrasal and sentence

level. So instead of hearing a word and typing that word,

we listen for a concept, a complete idea, and then process

that whole sentence, that whole concept at once.

Looking at some interpreting models can really help us at

this point understand how meaning-for-meaning is processed.

These are three quite famous models, if you know anything

about interpreting. And we're going to briefly look at each

one of them and see how they apply to a meaning-for-meaning

service provider.

The Colonomos Model, I've simplified it for you. But

basically, she says that the interpreter listens to the

source language, mentally processes it, and decides how to

output that into the target language. And she broke it down

into steps.

This is exactly the same thing that meaning-for-meaning

service provider would do, listen for an entire concept,

mentally process it, think about what the best

representation would be. You can't always give out a word

for word translation because our students may not understand

that. The meaning may not be clear.

Pam gave us some examples of words that -- of sentences that

look exactly the same, but have very different meanings. So

we have to think about that.

Also, meaning-for-meaning service providers include the

environmental cues, tone of voice and inflection and how

those influence the output.

I have a really good example. There was a student who was

taking a class, and the instructor had a reputation for

being very difficult. So right before the midterm test, all

of the students in the class were scared to death. This

teacher's really hard, our midterm's coming up, we're all

going to fail.

The teacher REALIZING that they're all stressed out says,

oh, yeah, you all need to be worried. Everybody in here's

going to fail.

All of the hearing students went, whew, it's okay.

The deaf student in the class who had a CART service

provider saw those words, left the class and dropped because

they didn't get the tone. They didn't understand that it

was a joke to make everybody feel better.

The second model I want to talk about is the Gish Model, and

this one is actually one I find most helpful because what

happens with most service providers doesn't matter what

language they're going with, they get stuck down here on the

detail level.

1492, Columbus! What about it? If you don't have those

connectors that Columbus discovered America in 1492, it

really doesn't matter that you got the year and you've got

the name.

So it's important to realize that we need the goal of the

speaker, and we always need to keep that in the back of our

heads.

What is the goal that this speaker wants to convey today?

Then we go down, there's going to be a theme. You'll have

several objectives. Then you get down into the units and

the details.

If you always keep going back up to that main goal, you're

pretty safe. I joke about if you're interpreting in church

or captioning in church and you lose the content, what's the

overall goal in church? God is good. Everyone should go to

church. You go back up to that main goal.

Another example from a classroom, a captionist went in, and

the teacher said, we're going to show a video today. Does

everybody love trying to interpret or caption for videos?

No.

It's impossible.

So the captionist went to the instructor and said, what is

the goal? What do you want them to learn from this video?

I'm not going to be able to keep up with the dialogue and

get it all, I'm going to have to summarize, so if you let me

know what you want them to learn, then I can focus on that.

The teacher said, the sound effects.

The captionist said, the sound effects?

Yes. It was a war movie, and this was a class where then

they were going to talk about how those sound effects

enhanced the plot and so on.

So the captionist dropped all the dialogue and focused on

the sound effects.

At the end, when the class was discussing the movie, the

student was able to fully participate, whereas if the

captionist had not asked what the goal was, she would have

focused on the dialogue, and then the follow-up

conversation, the student would have been totally lost.

So even had she been able to get verbatim of the dialogue,

that still wouldn't have helped. So that's why I find this

model very helpful.

This is the third model created by Vance Cokely, and I'm not

going to go into it at all because it's a very detailed

model of the mental processes involved in interpreting. But

I want to tell you that this is very similar to what a

speech-to-text person experiences, so this is your brain on

speech-to-text, and meaning-for-meaning. It's not an easy

thing.

The second part of the Cokely Model talks about how the

service provider influences the message sometimes

intentionally and sometimes not. And we can talk about how

things can be misrepresented. And again, this is the same

in meaning-for-meaning.

There are things that are omitted, there are things that

could be added that the speaker did not intend.

Sometimes substitutions with words that are not appropriate,

sometimes environmental intrusions into the service

provider's mind.

There's a cell phone ringing, there's a student who keeps

tapping their foot and the service provider misses

information because of that. And then sometimes there are

just those weird things that show up in the transcript that

we didn't intend.

In meaning-for-meaning, that can be a word that did not

expand correctly in the software. Just things like that.

When you're looking at a verbatim transcript you're going to

get basically the same thing. But I wanted to show you,

with meaning-for-meaning you're going to get variation, but

that's not necessarily bad.

Some service providers may provide more detail than others,

depending on skill level, depending on the content.

But I have an example of the verbatim of a lecture, and then

two different meaning-for-meaning transcripts. So you can

see what you might get. And I'll give you a minute to read

those.

Okay. After seeing an example of a meaning-for-meaning

transcript, what's your impression?

Do you think it's worse than verbatim? Do you think it's

appropriate in some situations? Everybody's still asleep

this morning. But I see some nodding.

Word-for-word transcript. Every single word. Everything

that's said, it's that.

Okay. Hopefully now you have an understanding of the

difference between verbatim and meaning-for-meaning, and

hopefully you can see that each has its place. If you have

an instructor who does not speak in complete sentences, and

rambles, then perhaps the student would prefer

meaning-for-meaning.

We had a situation where the instructor very clearly

outlined everything and was very organized, and the student

normally had a meaning-for-meaning service provider. We had

a substitute person come in who used CART, and the student

said they really preferred meaning-for-meaning in that

situation because of the formatting that could be shown.

In meaning-for-meaning systems, you can use bulleted lists,

numbered lists, bold, italics. It's very visual. And when

the instructor isn't -- speaks in an organized fashion and

uses those formatting tips, then the student was able to

follow along.

Whereas the CART provider used a paragraph format and didn't

have all of the formatting available to them. So I

challenge you to look at the situation and see what is going

to accomplish the goal of getting the information across.

Yes? We have a question?

>> I'm just wondering, with those two examples, do you have

any comment about the quality of one over the other?

>>CINDY CAMP: I think they are both very good examples. I

did not -- we have some examples later of where information

was missing and where things were lost. These are just

meant to show that there can be differences.

This is just a list of some skills that someone who was

providing a meaning-for-meaning service would need to have.

Very strong short term memory because it's not going in and

out. It's going in, process, then being put out. At the

same time you're listening for the next chunk of

information.

You need an expansive vocabulary. If there's a word that

could be confused with another word, what do you have in

your vocabulary to replace that?

Very good comprehension of the English language,

understanding idioms and knowing that those are probably not

going to translate well for someone who has English as a

second language.

A good knowledge of deaf culture and syntactical references.

Yes?

>> Is there a case where you may have the service provider

provide the idiom and the true meaning so they still get

exposed to that group of vocabulary, or in context of what

the speaker was saying?

>>CINDY CAMP: It's going to depend on the student and the

service provider. The majority of the time, yes, they will

give both. However, I've had situations where it disrupted

class greatly because the idiom was given and as soon as the

student read it, they burst out laughing because they had

never been exposed to it, and then the instructor thought

they were laughing at him, and so you have to be very

careful about how you do that.

You need a broad knowledge base. If you don't understand

the content that's being expressed, there's no way you can

translate it into another language. And trust me, written

English is a different language than spoken English.

If you've ever sat in a classroom and heard the instructor

lecture, then they read from a book, the syntax, the rate of

speech, the structure, it's all very different. So you have

to understand the concept to be able to express it

appropriately. You have to be very strong at summarization.

And summarization doesn't mean contents left out. It means

it's presented in a clear and concise fashion.

And then obviously you need to be a fast typer.

I'm going to turn it back over to Pam now and she's going to

talk a little bit about how you could evaluate a service

provider that you're seeing or at your institution, and then

we'll have a few minutes at the end for questions,

hopefully.

>>PAM FRANCIS: The evaluation of meaning-for-meaning could

take -- the discussion could take a few hours. So we won't

dive very deeply into it, but we do want to briefly address

it. There are existing systems out there that evaluate

non-verbatim representations of information.

For example, foreign language certifications, that might --

that's one example.

And there are systems that are working, they're in court,

but to get there is a very, very complicated process and

time intensive.

So what do those certifications systems, those evaluation

systems have in common? A specified unit of analysis. For

example, idea units, certain chunk of information broken

down. Objective scoring criteria, very, very important

because as Cindy said, you can put five people in a room and

they can transcribe the information, and what you get at the

end looks different from all five people. Okay. So you

have to have objective scoring criteria.

Consensus scoring. For these systems you need something

that more than one person evaluates the output. That is

very important, and it's very important to really reach an

objective evaluation. And they want the final outcome, the

final outcome needs to be valid and reliable. This is a

quantitative system. It's not -- it's much more objective.

It's much more precise.

But to get there takes quite a bit. Five minutes? Okay.

To get there takes quite a bit. And this is just really a

representation of what it might take to develop one system.

Preparation of the source material. Getting the lectures or

getting a piece of spoken information that has a certain

level of content, and then recording it, so on and so forth.

Identifying the units of analysis. So breaking it down into

idea units. That's a huge task. If you're talking about a

one-hour lecture, you can end up with 300 to 400 pieces of

information or idea units.

Training judges, training people who will take that

information and evaluate that transcript. And that's not

just one judge. There may be three. There may be five. It

depends on the system.

Performing the analysis. Actually having the judges go

through idea unit by idea unit and deciding whether or not

it's appropriate. And then summarizing and reporting. So,

for one, okay, we're talking 38 to 42 hours. That's a lot

of time that most -- unless you have some system set up to

evaluate, for example, you have a company that's doing the

evaluation, you have some system that does evaluation,

that's a lot of time. And most places, especially campuses,

do not have that kind of time to devote to that evaluation.

There is an effort to try to develop some type of national

certification that is appropriate, but that -- to develop

the system, that is legally defensible, that is objective,

it will cost well over a million dollars. So what do you do

in the meantime? And I'm going to be very quick about this.

You have to look at the real time transcript and decide

whether it has the qualities that are needed.

Again, you have five different transcripts. You have to

look at them each individually. You have to decide whether

things are clearly worded, whether the information is

complete and accurate, and whether it's easy and quick to

read. And that doesn't mean that it's at a lower

educational level. That means that the reader can look at

it, get the information and move on, that it's not

convoluted in any way, short, quick sentences representing

the information.

I apologize that I'm running along.

So, how do you do that? You get the unedited transcript.

Okay? And you review it.

It would be great to be in the classroom or in the situation

where you're hearing the lecture and then you can go back

and review it. You don't always have that option. Some

things you can do is get notes from a notetaker. You can

review the transcript with a consent expert or the

instructor. There are a few things you can do.

Okay. Check the transcript out. This is just a few

examples. One thing that drives me crazy is when I see a

transcript of just solid text. There's no visual breaks.

There's no -- the information is not clearly represented

where a thought starts and stops. So really look at it and

make sure that the content is good and the representation is

good.

Once that transcript is reviewed, there are things that can

be gleaned from that, and information -- assistance can be

offered to the service provider.

For example, depending on how they're representing the

information, perhaps giving them suggestions, and we passed

out that strategies for service providers sheet. On how

they might include their wording or their vocabulary, how

they might better capture the information.

So really, what is gleaned from that review is very

important, and that information can be shared with the

service provider, and their skills can be improved.

This is a very basic one-hour, two-hour process. It's -- at

least for the review.

It's not meant to be a legal process or even a process that

can be used to hire or retain a person unless it's done at a

very -- much deeper level. But the point is, is the

interview -- or the transcript can be reviewed and

information can be provided to the service providers so that

their skills can improve.

As far as the certification or evaluation system, that is

widely available. That takes time. It's not easy to take a

meaning-for-meaning transcript and objectively say, you have

all the information, you have 60% of the information, you

have 30%, you have 90%. It's very difficult to do. And

it's time intensive. But there are things in the interim

that can be done. So what did we talk about today?

We gave a brief definition. We talked about some benefits.

Cindy talked about interpreting and the interpreting models

and things that might be familiar, and some skills that

service providers, meaning-for-meaning service providers

need. And a very brief discussion about analysis.

Are there any questions at all?

I think what's -- oh, go ahead.

>> I'm interested to see the meaning-for-meaning. In the

U.K., it is in the training course for notetakers, but I

found that it takes a lot longer for people to learn how to

do it, especially if they have fast typing speeds they tend

not to be good at summarizing. Is this part of your initial

training for a notetaker or is it an add-on?

>>PAM FRANCIS: Well, one of the things that is an issue

here is that the meaning-for-meaning service providers are

very concerned, and it's not notetaking. If they're

accurate and they -- and they get all of the information,

it's not really notetaking. And depending on the skills of

the student -- bless you -- depending on the skills of the

student, if the student needs notetaking and the information

that a meaning-for-meaning service provider gets is too

much, if they're overwhelmed, then they have to cut back.

So what we do, at least in the training that we -- that we

have done is we provide some skill exercises, we provide

information about listening, so on and so forth.

I think one of the very difficult parts is that

meaning-for-meaning service providers, one of the reasons

that they were established is because they wanted, and I

apologize for talking very quickly -- they wanted a training

that was quick, and the people could get it into the

classroom.

The problem with that is that you cannot develop the

processing skills that are necessary in three weeks or four

weeks or five weeks, in six months.

Okay. So that is the problem or a main issue is developing

the processing skills to be able to take the information and

put it back out. Many meaning-for-meaning service providers

don't come in with that. It takes time. There are many

people who are cross-trained. The best meaning-for-meaning

service providers are people who have those skills, like

interpreters.

>> Thank you, Pam and Cindy! Thank you very much for your

presentation! I'm sure there are many of you with

questions, and you can approach Pam and Cindy during the

rest of the conference.

We are very interested in your feedback. We are encouraging

you to fill out evaluations online, and the learning center

is available for that.

I have some hard copy evaluation forms if that is your

preference. Each session is named for the day and the time.

This session is M10, Monday at 10:00, and that's the most

important thing when you're providing feedback. If you

would like to provide written feedback, please take one of

these forms and complete that. Thank you so much for

coming!

>>CINDY CAMP: I have some brochures on the speech-to-text

services network, which is an organization for both

meaning-for-meaning and verbatim. I don't have enough for

everyone, but you're welcome to take them and go online and

learn more.



Close