Chapter 10











Speachreading

Speechreading at IPVR, Uni Stuttgart

Computer Lipreading - Overview

Computer Lipreading - Simulation Results

Lipreading for Robust Automatic Speech Recognition

Center for Automation Research (CfAR)

At the time 2001 was filmed, work was well underway on giving computers the ability to "see" the world by analyzing images. A computer as advanced as HAL would be expected to have well-developed visual capabilities. Dave certainly takes it for granted that HAL can "see" and appreciate his sketches ("May I see them? . . . That's a very nice rendering") and is not surprised when he discovers -- too late -- that HAL can speechread ("I could see your lips move".)

Can a computer really be programmed to do such things? The field of computer vision deals with methods a computer can use to obtain information about objects and events in a scene by analysing images of the scene. These methods need not resemble those used by humans (or animals) to see the world as long as they yield correct results. In this chapter, we first discuss how a computer might distinguish objects in a scene -- for example, how it might distinguish a human head from its background or differentiate parts of the head (e.g., hair, eyes) from each other. We then explore how information about the parts can be used for recognition -- for example, in identifying a particular person's face in a collection of images. Finally, we consider how variations in the appearance of parts of a face over a sequence of images can be used to recognize a person's facial expressions.









top of pageauthor infoorderfurther reading