Speech Recognition in Modulated Noise and Temporal Resolution

Speech Recognition in Modulated Noise and Temporal Resolution PDF Author: Timothy Daniel Trine
Publisher:
ISBN:
Category :
Languages : en
Pages : 194

Book Description


Speech Perception in Gated Noise

Speech Perception in Gated Noise PDF Author: Su-Hyun Jin
Publisher:
ISBN:
Category :
Languages : en
Pages : 268

Book Description


Dynamics of Speech Production and Perception

Dynamics of Speech Production and Perception PDF Author: P.L. Divenyi
Publisher: IOS Press
ISBN: 1607502038
Category : Language Arts & Disciplines
Languages : en
Pages : 388

Book Description
The idea that speech is a dynamic process is a tautology: whether from the standpoint of the talker, the listener, or the engineer, speech is an action, a sound, or a signal continuously changing in time. Yet, because phonetics and speech science are offspring of classical phonology, speech has been viewed as a sequence of discrete events-positions of the articulatory apparatus, waveform segments, and phonemes. Although this perspective has been mockingly referred to as "beads on a string", from the time of Henry Sweet's 19th century treatise almost up to our days specialists of speech science and speech technology have continued to conceptualize the speech signal as a sequence of static states interleaved with transitional elements reflecting the quasi-continuous nature of vocal production. This book, a collection of papers of which each looks at speech as a dynamic process and highlights one of its particularities, is dedicated to the memory of Ludmilla Andreevna Chistovich. At the outset, it was planned to be a Chistovich festschrift but, sadly, she passed away a few months before the book went to press. The 24 chapters of this volume testify to the enormous influence that she and her colleagues have had over the four decades since the publication of their 1965 monograph.

Speech Processing in the Auditory System

Speech Processing in the Auditory System PDF Author: Steven Greenberg
Publisher: Springer Science & Business Media
ISBN: 0387215751
Category : Science
Languages : en
Pages : 487

Book Description
Although speech is the primary behavioral medium by which humans communicate, its auditory basis is poorly understood, having profound implications on efforts to ameliorate the behavioral consequences of hearing impairment and on the development of robust algorithms for computer speech recognition. In this volume, the authors provide an up-to-date synthesis of recent research in the area of speech processing in the auditory system, bringing together a diverse range of scientists to present the subject from an interdisciplinary perspective. Of particular concern is the ability to understand speech in uncertain, potentially adverse acoustic environments, currently the bane of both hearing aid and speech recognition technology. There is increasing evidence that the perceptual stability characteristic of speech understanding is due, at least in part, to elegant transformations of the acoustic signal performed by auditory mechanisms. As a comprehensive review of speech's auditory basis, this book will interest physiologists, anatomists, psychologists, phoneticians, computer scientists, biomedical and electrical engineers, and clinicians.

The Frequency-Following Response

The Frequency-Following Response PDF Author: Nina Kraus
Publisher: Springer
ISBN: 331947944X
Category : Medical
Languages : en
Pages : 306

Book Description
This volume will cover a variety of topics, including child language development; hearing loss; listening in noise; statistical learning; poverty; auditory processing disorder; cochlear neuropathy; attention; and aging. It will appeal broadly to auditory scientists—and in fact, any scientist interested in the biology of human communication and learning. The range of the book highlights the interdisciplinary series of questions that are pursued using the auditory frequency-following response and will accordingly attract a wide and diverse readership, while remaining a lasting resource for the field.

Advances in Non-Linear Modeling for Speech Processing

Advances in Non-Linear Modeling for Speech Processing PDF Author: Raghunath S. Holambe
Publisher: Springer Science & Business Media
ISBN: 1461415055
Category : Technology & Engineering
Languages : en
Pages : 109

Book Description
Advances in Non-Linear Modeling for Speech Processing includes advanced topics in non-linear estimation and modeling techniques along with their applications to speaker recognition. Non-linear aeroacoustic modeling approach is used to estimate the important fine-structure speech events, which are not revealed by the short time Fourier transform (STFT). This aeroacostic modeling approach provides the impetus for the high resolution Teager energy operator (TEO). This operator is characterized by a time resolution that can track rapid signal energy changes within a glottal cycle. The cepstral features like linear prediction cepstral coefficients (LPCC) and mel frequency cepstral coefficients (MFCC) are computed from the magnitude spectrum of the speech frame and the phase spectra is neglected. To overcome the problem of neglecting the phase spectra, the speech production system can be represented as an amplitude modulation-frequency modulation (AM-FM) model. To demodulate the speech signal, to estimation the amplitude envelope and instantaneous frequency components, the energy separation algorithm (ESA) and the Hilbert transform demodulation (HTD) algorithm are discussed. Different features derived using above non-linear modeling techniques are used to develop a speaker identification system. Finally, it is shown that, the fusion of speech production and speech perception mechanisms can lead to a robust feature set.

Quick Screen of Phonology

Quick Screen of Phonology PDF Author: Nicholas W. Bankson
Publisher:
ISBN: 9781559900393
Category : Articulation disorders in children
Languages : en
Pages : 30

Book Description
Designed to provide a quick estimate of a young child's phonological development, referenced to age norms.

The SAGE Encyclopedia of Human Communication Sciences and Disorders

The SAGE Encyclopedia of Human Communication Sciences and Disorders PDF Author: Jack S. Damico
Publisher: SAGE Publications
ISBN: 1483380823
Category : Education
Languages : en
Pages : 2354

Book Description
The SAGE Encyclopedia of Human Communication Sciences and Disorders is an in-depth encyclopedia aimed at students interested in interdisciplinary perspectives on human communication—both normal and disordered—across the lifespan. This timely and unique set will look at the spectrum of communication disorders, from causation and prevention to testing and assessment; through rehabilitation, intervention, and education. Examples of the interdisciplinary reach of this encyclopedia: A strong focus on health issues, with topics such as Asperger's syndrome, fetal alcohol syndrome, anatomy of the human larynx, dementia, etc. Including core psychology and cognitive sciences topics, such as social development, stigma, language acquisition, self-help groups, memory, depression, memory, Behaviorism, and cognitive development Education is covered in topics such as cooperative learning, special education, classroom-based service delivery The editors have recruited top researchers and clinicians across multiple fields to contribute to approximately 640 signed entries across four volumes.

Communication Acoustics

Communication Acoustics PDF Author: Ville Pulkki
Publisher: John Wiley & Sons
ISBN: 1118866541
Category : Technology & Engineering
Languages : en
Pages : 454

Book Description
In communication acoustics, the communication channel consists of a sound source, a channel (acoustic and/or electric) and finally the receiver: the human auditory system, a complex and intricate system that shapes the way sound is heard. Thus, when developing techniques in communication acoustics, such as in speech, audio and aided hearing, it is important to understand the time–frequency–space resolution of hearing. This book facilitates the reader’s understanding and development of speech and audio techniques based on our knowledge of the auditory perceptual mechanisms by introducing the physical, signal-processing and psychophysical background to communication acoustics. It then provides a detailed explanation of sound technologies where a human listener is involved, including audio and speech techniques, sound quality measurement, hearing aids and audiology. Key features: Explains perceptually-based audio: the authors take a detailed but accessible engineering perspective on sound and hearing with a focus on the human place in the audio communications signal chain, from psychoacoustics and audiology to optimizing digital signal processing for human listening. Presents a wide overview of speech, from the human production of speech sounds and basics of phonetics to major speech technologies, recognition and synthesis of speech and methods for speech quality evaluation. Includes MATLAB examples that serve as an excellent basis for the reader’s own investigations into communication acoustics interaction schemes which intuitively combine touch, vision and voice for lifelike interactions.

Acoustical and Environmental Robustness in Automatic Speech Recognition

Acoustical and Environmental Robustness in Automatic Speech Recognition PDF Author: A. Acero
Publisher: Springer Science & Business Media
ISBN: 1461531225
Category : Technology & Engineering
Languages : en
Pages : 197

Book Description
The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in recent years, as more systems are finding their way into practical applications. Although the issue of environmental robustness has received only a small fraction of the attention devoted to speaker independence, even speech recognition systems that are designed to be speaker independent frequently perform very poorly when they are tested using a different type of microphone or acoustical environment from the one with which they were trained. The use of microphones other than a "close talking" headset also tends to severely degrade speech recognition -performance. Even in relatively quiet office environments, speech is degraded by additive noise from fans, slamming doors, and other conversations, as well as by the effects of unknown linear filtering arising reverberation from surface reflections in a room, or spectral shaping by microphones or the vocal tracts of individual speakers. Speech-recognition systems designed for long-distance telephone lines, or applications deployed in more adverse acoustical environments such as motor vehicles, factory floors, oroutdoors demand far greaterdegrees ofenvironmental robustness. There are several different ways of building acoustical robustness into speech recognition systems. Arrays of microphones can be used to develop a directionally-sensitive system that resists intelference from competing talkers and other noise sources that are spatially separated from the source of the desired speech signal.