Automatic real-time lip synchronization using LPC analysis and neural networks

Haaser, Christina Marie

NOTE: This item is not available outside the Texas A&M University network. Texas A&M affiliated users who are off campus can access the item through NetID and password authentication or by using TAMU VPN. Non-affiliated individuals should request a copy through their local library's interlibrary loan service.

View/ Open

2002 Thesis H18.pdf (2.130Mb)

Date

2002

Author

Haaser, Christina Marie

Metadata

Show full item record

Abstract

Automating the task of lip synchronization has long been an interesting yet challenging problem, especially for animation that is needed "on the fly," in real-time. This thesis presents a method for creating a computer program that simplifies the process of generating real-time lip sync animation, while keeping the resulting animation as believable as possible. Using a single audio voice track or live microphone input, the implemented program extracts distinguishing features from the audio signal, specifically LPC cepstral coefficients, gain, and zero-crossing rate. These features are used as input into a trained three-layer feedforward back-propagation neural network for phonetic classification per frame of animation. The training of the neural network is speaker-dependant, accurately matching only the speaker who contributed to the sound samples in the training set.

URI

https://hdl.handle.net/1969.1/ETD-TAMU-2002-THESIS-H18

Description

Due to the character of the original source materials and the nature of batch digitization, quality control issues may be present in this document. Please report any quality issues you encounter to digital@library.tamu.edu, referencing the URI of the item.
Includes bibliographical references (leaves 73-75).
Issued also on microfiche from Lange Micrographics.

Subject

visualization sciences.
Major visualization sciences.

Collections

Digitized Theses and Dissertations (1922–2004)

Citation

Haaser, Christina Marie (2002). Automatic real-time lip synchronization using LPC analysis and neural networks. Master's thesis, Texas A&M University. Available electronically from https : / /hdl .handle .net /1969 .1 /ETD -TAMU -2002 -THESIS -H18.

This item and its contents are restricted. If this is your thesis or dissertation, you can make it open-access. This will allow all visitors to view the contents of the thesis.

Request Open Access