Show simple item record

dc.contributor.advisorGibson, Jerry D.
dc.creatorLee, Insung
dc.date.accessioned2024-02-09T20:48:36Z
dc.date.available2024-02-09T20:48:36Z
dc.date.issued1992
dc.identifier.urihttps://hdl.handle.net/1969.1/DISSERTATIONS-1374767
dc.descriptionVitaen
dc.descriptionMajor subject: Electrical Engineeringen
dc.description.abstractEfficient speech coders using tree coding combined with harmonic scaling are designed at the rates of 6.4 kilobits/sec (kbps) and 4.8 kbps. A time domain harmonic scaling algorithm (TDHS) is used to compress input speech by a factor of two. This process allows the tree coder to have 2 bits/sample for 6.4 kbps and 1.5 bits/sample for 4.8 kbps in the case of a 6.4 kHz sampling rate. Various window functions and pitch extraction algorithms are compared in the TDHS algorithm. In the backward adaptive tree coder, there are three components of the code generator, including a hybrid adaptive quantizer, a short-term predictor and a pitch predictor. The short-term predictor with a pole-zero structure improves the performance of the tree coder in the error free channel over an all-pole structure. The robustness of the tree coder is achieved by carefully choosing the input of the short-term predictor adaptation. The residual signal shaped by an all-zero filter is used as the input for adaptation. The all-zero filter is obtained by truncating the infinite impulse response of the pole-zero synthesis filter. Another method to improve the robustness of the tree coder is the inclusion of a smoother in the pitch predictor. The coefficients of the smoother with three taps are computed from the estimated autocorrelation function of the pitch synthesizer output. The inclusion of a smoother provides a small improvement in the error free channel and robustness to transmission errors. Subjectively, tree coding combined with TDHS achieves very good quality speech at the rate of 6.4 kbps and good quality speech at 4.8 kbps. The performance of the coding system is evaluated for real speech by means of signal-to-noise ratio (SNR), segmental SNR, frequency domain SNR, spectrograms, and informal listening tests.en
dc.format.extentxv, 148 leavesen
dc.format.mediumelectronicen
dc.format.mimetypeapplication/pdf
dc.language.isoeng
dc.rightsThis thesis was part of a retrospective digitization project authorized by the Texas A&M University Libraries. Copyright remains vested with the author(s). It is the user's responsibility to secure permission from the copyright holder(s) for re-use of the work beyond the provision of Fair Use.en
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/
dc.subjectMajor electrical engineeringen
dc.subject.classification1992 Thesis L4783
dc.subject.lcshSpeech processing systemsen
dc.subject.lcshSignal processingen
dc.subject.lcshDigital techniquesen
dc.subject.lcshCoding theoryen
dc.titleRobust tree coding combined with harmonic scaling of speechen
dc.typeThesisen
thesis.degree.disciplineElectrical Engineeringen
thesis.degree.grantorTexas A&M Universityen
thesis.degree.nameDoctor of Philosophyen
thesis.degree.namePh. Den
thesis.degree.levelDoctorialen
dc.contributor.committeeMemberGeorghiades, Costas N.
dc.contributor.committeeMemberGriswold, Norman C.
dc.contributor.committeeMemberHsing, Tailen
dc.type.genredissertationsen
dc.type.materialtexten
dc.format.digitalOriginreformatted digitalen
dc.publisher.digitalTexas A&M University. Libraries
dc.identifier.oclc30130138


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

This item and its contents are restricted. If this is your thesis or dissertation, you can make it open-access. This will allow all visitors to view the contents of the thesis.

Request Open Access