Author: Gabriel Mittag
Publisher: Springer Nature
ISBN: 3030914798
Category : Technology & Engineering
Languages : en
Pages : 171
Book Description
This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.
Deep Learning Based Speech Quality Prediction
Author: Gabriel Mittag
Publisher: Springer Nature
ISBN: 3030914798
Category : Technology & Engineering
Languages : en
Pages : 171
Book Description
This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.
Publisher: Springer Nature
ISBN: 3030914798
Category : Technology & Engineering
Languages : en
Pages : 171
Book Description
This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitability of different deep learning architectures for this task. The author then shows how the resulting model outperforms traditional speech quality models and provides additional information about the cause of a quality impairment through the prediction of the speech quality dimensions of noisiness, coloration, discontinuity, and loudness.
Speech and Computer
Author: Alexey Karpov
Publisher: Springer Nature
ISBN: 303148312X
Category : Computers
Languages : en
Pages : 587
Book Description
The two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.
Publisher: Springer Nature
ISBN: 303148312X
Category : Computers
Languages : en
Pages : 587
Book Description
The two-volume proceedings set LNAI 14338 and 14339 constitutes the refereed proceedings of the 25th International Conference on Speech and Computer, SPECOM 2023, held in Dharwad, India, during November 29–December 2, 2023. The 94 papers included in these proceedings were carefully reviewed and selected from 174 submissions. They focus on all aspects of speech science and technology: automatic speech recognition; computational paralinguistics; digital signal processing; speech prosody; natural language processing; child speech processing; speech processing for medicine; industrial speech and language technology; speech technology for under-resourced languages; speech analysis and synthesis; speaker and language identification, verification and diarization.
Simulating Conversations for the Prediction of Speech Quality
Author: Thilo Michael
Publisher: Springer Nature
ISBN: 3031318447
Category : Technology & Engineering
Languages : en
Pages : 157
Book Description
This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone.
Publisher: Springer Nature
ISBN: 3031318447
Category : Technology & Engineering
Languages : en
Pages : 157
Book Description
This book discusses the simulation of conversations through a novel approach of predicting speech quality based on the interactions of two simulated interlocutors. The author describes the setup of a simulation environment that is capable of simulating human dialogue on the speech level. The impact of delay and bursty packet loss on VoIP conversations is investigated and modeled for the use in the simulation. Based on parameters extracted from simulated conversations, the author proposes extensions to the E-model, a parametric model standardized by the International Telecommunications Union, in order to predict the quality of the simulated conversations. The author shows that predictions based on the simulated conversations outperform models that rely on the transmission parameters alone.
Advances in Multimedia Modeling
Author: Susanne Boll
Publisher: Springer
ISBN: 364211301X
Category : Computers
Languages : en
Pages : 822
Book Description
The 16th international conference on Multimedia Modeling (MMM2010) was held in the famous mountain city Chongqing, China, January 6–8, 2010, and hosted by Southwest University. MMM is a leading international conference for researchersand industry practitioners to share their new ideas, original research results and practicaldevelopment experiences from all multimedia related areas. MMM2010attractedmorethan160regular,specialsession,anddemosession submissions from 21 countries/regions around the world. All submitted papers were reviewed by at least two PC members or external reviewers, and most of them were reviewed by three reviewers. The review process was very selective. From the total of 133 submissions to the main track, 43 (32. 3%) were accepted as regular papers, 22 (16. 5%) as short papers. In all, 15 papers were received for three special sessions, which is by invitation only, and 14 submissions were received for a demo session, with 9 being selected. Authors of accepted papers come from 16 countries/regions. This volume of the proceedings contains the abstracts of three invited talks and all the regular, short, special session and demo papers. The regular papers were categorized into nine sections: 3D mod- ing;advancedvideocodingandadaptation;face,gestureandapplications;image processing;imageretrieval;learningsemanticconcepts;mediaanalysisandm- eling; semantic video concepts; and tracking and motion analysis. Three special sessions were video analysis and event recognition, cross-X multimedia mining in large scale, and mobile computing and applications. The technical programfeatured three invited talks, paralleloral presentation of all the accepted regular and special session papers, and poster sessions for short and demo papers.
Publisher: Springer
ISBN: 364211301X
Category : Computers
Languages : en
Pages : 822
Book Description
The 16th international conference on Multimedia Modeling (MMM2010) was held in the famous mountain city Chongqing, China, January 6–8, 2010, and hosted by Southwest University. MMM is a leading international conference for researchersand industry practitioners to share their new ideas, original research results and practicaldevelopment experiences from all multimedia related areas. MMM2010attractedmorethan160regular,specialsession,anddemosession submissions from 21 countries/regions around the world. All submitted papers were reviewed by at least two PC members or external reviewers, and most of them were reviewed by three reviewers. The review process was very selective. From the total of 133 submissions to the main track, 43 (32. 3%) were accepted as regular papers, 22 (16. 5%) as short papers. In all, 15 papers were received for three special sessions, which is by invitation only, and 14 submissions were received for a demo session, with 9 being selected. Authors of accepted papers come from 16 countries/regions. This volume of the proceedings contains the abstracts of three invited talks and all the regular, short, special session and demo papers. The regular papers were categorized into nine sections: 3D mod- ing;advancedvideocodingandadaptation;face,gestureandapplications;image processing;imageretrieval;learningsemanticconcepts;mediaanalysisandm- eling; semantic video concepts; and tracking and motion analysis. Three special sessions were video analysis and event recognition, cross-X multimedia mining in large scale, and mobile computing and applications. The technical programfeatured three invited talks, paralleloral presentation of all the accepted regular and special session papers, and poster sessions for short and demo papers.
Computational Data and Social Networks
Author: Minh Hoàng Hà
Publisher: Springer Nature
ISBN: 9819706696
Category :
Languages : en
Pages : 440
Book Description
Publisher: Springer Nature
ISBN: 9819706696
Category :
Languages : en
Pages : 440
Book Description
Artificial Neural Networks and Machine Learning – ICANN 2023
Author: Lazaros Iliadis
Publisher: Springer Nature
ISBN: 3031441958
Category : Computers
Languages : en
Pages : 559
Book Description
The 10-volume set LNCS 14254-14263 constitutes the proceedings of the 32nd International Conference on Artificial Neural Networks and Machine Learning, ICANN 2023, which took place in Heraklion, Crete, Greece, during September 26–29, 2023. The 426 full papers, 9 short papers and 9 abstract papers included in these proceedings were carefully reviewed and selected from 947 submissions. ICANN is a dual-track conference, featuring tracks in brain inspired computing on the one hand, and machine learning on the other, with strong cross-disciplinary interactions and applications.
Publisher: Springer Nature
ISBN: 3031441958
Category : Computers
Languages : en
Pages : 559
Book Description
The 10-volume set LNCS 14254-14263 constitutes the proceedings of the 32nd International Conference on Artificial Neural Networks and Machine Learning, ICANN 2023, which took place in Heraklion, Crete, Greece, during September 26–29, 2023. The 426 full papers, 9 short papers and 9 abstract papers included in these proceedings were carefully reviewed and selected from 947 submissions. ICANN is a dual-track conference, featuring tracks in brain inspired computing on the one hand, and machine learning on the other, with strong cross-disciplinary interactions and applications.
Proceedings of the International Symposium on Intelligent Computing and Networking 2024
Author: Michel Kadoch
Publisher: Springer Nature
ISBN: 3031674472
Category :
Languages : en
Pages : 436
Book Description
Publisher: Springer Nature
ISBN: 3031674472
Category :
Languages : en
Pages : 436
Book Description
Speech Enhancement
Author: Philipos C. Loizou
Publisher: CRC Press
ISBN: 1466599227
Category : Technology & Engineering
Languages : en
Pages : 715
Book Description
With the proliferation of mobile devices and hearing devices, including hearing aids and cochlear implants, there is a growing and pressing need to design algorithms that can improve speech intelligibility without sacrificing quality. Responding to this need, Speech Enhancement: Theory and Practice, Second Edition introduces readers to the basic pr
Publisher: CRC Press
ISBN: 1466599227
Category : Technology & Engineering
Languages : en
Pages : 715
Book Description
With the proliferation of mobile devices and hearing devices, including hearing aids and cochlear implants, there is a growing and pressing need to design algorithms that can improve speech intelligibility without sacrificing quality. Responding to this need, Speech Enhancement: Theory and Practice, Second Edition introduces readers to the basic pr
The 10th International Conference on Computer Engineering and Networks
Author: Qi Liu
Publisher: Springer Nature
ISBN: 9811584621
Category : Technology & Engineering
Languages : en
Pages : 1770
Book Description
This book contains a collection of the papers accepted by the CENet2020 – the 10th International Conference on Computer Engineering and Networks held on October 16-18, 2020 in Xi’an, China. The topics focus but are not limited to Internet of Things and Smart Systems, Artificial Intelligence and Applications, Communication System Detection, Analysis and Application, and Medical Engineering and Information Systems. Each part can be used as an excellent reference by industry practitioners, university faculties, research fellows and undergraduates as well as graduate students who need to build a knowledge base of the most current advances and state-of-practice in the topics covered by this conference proceedings. This will enable them to produce, maintain, and manage systems with high levels of trustworthiness and complexity.
Publisher: Springer Nature
ISBN: 9811584621
Category : Technology & Engineering
Languages : en
Pages : 1770
Book Description
This book contains a collection of the papers accepted by the CENet2020 – the 10th International Conference on Computer Engineering and Networks held on October 16-18, 2020 in Xi’an, China. The topics focus but are not limited to Internet of Things and Smart Systems, Artificial Intelligence and Applications, Communication System Detection, Analysis and Application, and Medical Engineering and Information Systems. Each part can be used as an excellent reference by industry practitioners, university faculties, research fellows and undergraduates as well as graduate students who need to build a knowledge base of the most current advances and state-of-practice in the topics covered by this conference proceedings. This will enable them to produce, maintain, and manage systems with high levels of trustworthiness and complexity.
Text, Speech, and Dialogue
Author: Kamil Ekštein
Publisher: Springer Nature
ISBN: 303140498X
Category : Computers
Languages : en
Pages : 383
Book Description
This book constitutes the refereed proceedings of the 26th International Conference on Text, Speech, and Dialogue, TSD 2023, held in Pilsen, Czech Republic, during September 4–6, 2023. The 31 full papers presented together with the abstracts of 3 keynote talks were carefully reviewed and selected from 64 submissions. The conference attracts researchers not only from Central and Eastern Europe but also from other parts of the world. One of its goals has always been bringing together NLP researchers with various interests from different parts of the world and promoting their cooperation. One of the ambitions of the conference is, not only to deal with dialogue systems but also to improve dialogue among researchers in areas of NLP, i.e., among the “text” and the “speech” and the “dialogue” people.
Publisher: Springer Nature
ISBN: 303140498X
Category : Computers
Languages : en
Pages : 383
Book Description
This book constitutes the refereed proceedings of the 26th International Conference on Text, Speech, and Dialogue, TSD 2023, held in Pilsen, Czech Republic, during September 4–6, 2023. The 31 full papers presented together with the abstracts of 3 keynote talks were carefully reviewed and selected from 64 submissions. The conference attracts researchers not only from Central and Eastern Europe but also from other parts of the world. One of its goals has always been bringing together NLP researchers with various interests from different parts of the world and promoting their cooperation. One of the ambitions of the conference is, not only to deal with dialogue systems but also to improve dialogue among researchers in areas of NLP, i.e., among the “text” and the “speech” and the “dialogue” people.