Author: Nishant Balan
Publisher:
ISBN:
Category : Automatic speech recognition
Languages : en
Pages : 192
Book Description
Analysis and Evaluation of Factors Affecting Speech Driven Facial Animation
Author: Nishant Balan
Publisher:
ISBN:
Category : Automatic speech recognition
Languages : en
Pages : 192
Book Description
Publisher:
ISBN:
Category : Automatic speech recognition
Languages : en
Pages : 192
Book Description
Data-Driven 3D Facial Animation
Author: Zhigang Deng
Publisher: Springer Science & Business Media
ISBN: 1846289068
Category : Computers
Languages : en
Pages : 303
Book Description
Data-Driven 3D Facial Animation systematically describes the important techniques developed over the last ten years or so. Comprehensive in scope, the book provides an up-to-date reference source for those working in the facial animation field.
Publisher: Springer Science & Business Media
ISBN: 1846289068
Category : Computers
Languages : en
Pages : 303
Book Description
Data-Driven 3D Facial Animation systematically describes the important techniques developed over the last ten years or so. Comprehensive in scope, the book provides an up-to-date reference source for those working in the facial animation field.
Expressive Speech-driven Facial Animation
Visual Prosody in Speech-driven Facial Animation
Author: Marco Enrique Zavala Chmelicka
Publisher:
ISBN:
Category :
Languages : en
Pages :
Book Description
Facial animations capable of articulating accurate movements in synchrony with a speech track have become a subject of much research during the past decade. Most of these efforts have focused on articulation of lip and tongue movements, since these are the primary sources of information in speech reading. However, a wealth of paralinguistic information is implicitly conveyed through visual prosody (e.g., head and eyebrow movements). In contrast with lip/tongue movements, however, for which the articulation rules are fairly well known (i.e., viseme-phoneme mappings, coarticulation), little is known about the generation of visual prosody. The objective of this thesis is to explore the perceptual contributions of visual prosody in speech-driven facial avatars. Our main hypothesis is that visual prosody driven by acoustics of the speech signal, as opposed to random or no visual prosody, results in more realistic, coherent and convincing facial animations. To test this hypothesis, we have developed an audio-visual system capable of capturing synchronized speech and facial motion from a speaker using infrared illumination and retro-reflective markers. In order to elicit natural visual prosody, a story-telling experiment was designed in which the actors were shown a short cartoon video, and subsequently asked to narrate the episode. From this audio-visual data, four different facial animations were generated, articulating no visual prosody, Perlin-noise, speech-driven movements, and ground truth movements. Speech-driven movements were driven by acoustic features of the speech signal (e.g., fundamental frequency and energy) using rule-based heuristics and autoregressive models. A pair-wise perceptual evaluation shows that subjects can clearly discriminate among the four visual prosody animations. It also shows that speech-driven movements and Perlin-noise, in that order, approach the performance of veridical motion. The results are quite promising and suggest that speech-driven motion could outperform Perlin-noise if more powerful motion prediction models are used. In addition, our results also show that exaggeration can bias the viewer to perceive a computer generated character to be more realistic motion-wise.
Publisher:
ISBN:
Category :
Languages : en
Pages :
Book Description
Facial animations capable of articulating accurate movements in synchrony with a speech track have become a subject of much research during the past decade. Most of these efforts have focused on articulation of lip and tongue movements, since these are the primary sources of information in speech reading. However, a wealth of paralinguistic information is implicitly conveyed through visual prosody (e.g., head and eyebrow movements). In contrast with lip/tongue movements, however, for which the articulation rules are fairly well known (i.e., viseme-phoneme mappings, coarticulation), little is known about the generation of visual prosody. The objective of this thesis is to explore the perceptual contributions of visual prosody in speech-driven facial avatars. Our main hypothesis is that visual prosody driven by acoustics of the speech signal, as opposed to random or no visual prosody, results in more realistic, coherent and convincing facial animations. To test this hypothesis, we have developed an audio-visual system capable of capturing synchronized speech and facial motion from a speaker using infrared illumination and retro-reflective markers. In order to elicit natural visual prosody, a story-telling experiment was designed in which the actors were shown a short cartoon video, and subsequently asked to narrate the episode. From this audio-visual data, four different facial animations were generated, articulating no visual prosody, Perlin-noise, speech-driven movements, and ground truth movements. Speech-driven movements were driven by acoustic features of the speech signal (e.g., fundamental frequency and energy) using rule-based heuristics and autoregressive models. A pair-wise perceptual evaluation shows that subjects can clearly discriminate among the four visual prosody animations. It also shows that speech-driven movements and Perlin-noise, in that order, approach the performance of veridical motion. The results are quite promising and suggest that speech-driven motion could outperform Perlin-noise if more powerful motion prediction models are used. In addition, our results also show that exaggeration can bias the viewer to perceive a computer generated character to be more realistic motion-wise.
An Integrated Framework for Face Modeling, Facial Motion Analysis and Synthesis
Perceptual Analysis and Modeling of Facial Animation
Author: Xiaohan Ma
Publisher:
ISBN:
Category : Computer animation
Languages : en
Pages : 302
Book Description
Publisher:
ISBN:
Category : Computer animation
Languages : en
Pages : 302
Book Description
Data-Driven 3D Facial Animation
Author: Zhigang Deng
Publisher: Springer
ISBN: 9781848006416
Category : Computers
Languages : en
Pages : 296
Book Description
Data-Driven 3D Facial Animation systematically describes the important techniques developed over the last ten years or so. Although 3D facial animation is used more and more in the entertainment industries, to date there have been very few books that address the techniques involved. Comprehensive in scope, the book covers not only traditional lip-sync (speech animation), but also expressive facial motion, facial gestures, facial modeling, editing and sketching, and facial animation transferring. It provides an up-to-date reference source for academic research and for professionals working in the facial animation field.
Publisher: Springer
ISBN: 9781848006416
Category : Computers
Languages : en
Pages : 296
Book Description
Data-Driven 3D Facial Animation systematically describes the important techniques developed over the last ten years or so. Although 3D facial animation is used more and more in the entertainment industries, to date there have been very few books that address the techniques involved. Comprehensive in scope, the book covers not only traditional lip-sync (speech animation), but also expressive facial motion, facial gestures, facial modeling, editing and sketching, and facial animation transferring. It provides an up-to-date reference source for academic research and for professionals working in the facial animation field.
Data-driven 3D Facial Animation
Author: Zhigang Deng
Publisher:
ISBN: 9781826489064
Category : Computer animation
Languages : en
Pages : 296
Book Description
Publisher:
ISBN: 9781826489064
Category : Computer animation
Languages : en
Pages : 296
Book Description
Analysis and Comparison of Facial Animation Algorithms
Author: Anna Isabel Bellido Rivas
Publisher:
ISBN:
Category :
Languages : en
Pages :
Book Description
Publisher:
ISBN:
Category :
Languages : en
Pages :
Book Description
Animation of a Hierarchical Image Based Facial Model and Perceptual Analysis of Visual Speech
Author: Darren Cosker
Publisher:
ISBN:
Category : Computer animation
Languages : en
Pages : 0
Book Description
Publisher:
ISBN:
Category : Computer animation
Languages : en
Pages : 0
Book Description