Author: K.E. Manjunath
Publisher:
ISBN: 9783030807429
Category :
Languages : en
Pages : 0
Book Description
The book presents current research and developments in multilingual speech recognition. The author presents a Multilingual Phone Recognition System (Multi-PRS), developed using a common multilingual phone-set derived from the International Phonetic Alphabets (IPA) based transcription of six Indian languages - Kannada, Telugu, Bengali, Odia, Urdu, and Assamese. The author shows how the performance of Multi-PRS can be improved using tandem features. The book compares Monolingual Phone Recognition Systems (Mono-PRS) versus Multi-PRS and baseline versus tandem system. Methods are proposed to predict Articulatory Features (AFs) from spectral features using Deep Neural Networks (DNN). Multitask learning is explored to improve the prediction accuracy of AFs. Then, the AFs are explored to improve the performance of Multi-PRS using lattice rescoring method of combination and tandem method of combination. The author goes on to develop and evaluate the Language Identification followed by Monolingual phone recognition (LID-Mono) and common multilingual phone-set based multilingual phone recognition systems.
Multilingual Phone Recognition in Indian Languages
Author: K.E. Manjunath
Publisher:
ISBN: 9783030807429
Category :
Languages : en
Pages : 0
Book Description
The book presents current research and developments in multilingual speech recognition. The author presents a Multilingual Phone Recognition System (Multi-PRS), developed using a common multilingual phone-set derived from the International Phonetic Alphabets (IPA) based transcription of six Indian languages - Kannada, Telugu, Bengali, Odia, Urdu, and Assamese. The author shows how the performance of Multi-PRS can be improved using tandem features. The book compares Monolingual Phone Recognition Systems (Mono-PRS) versus Multi-PRS and baseline versus tandem system. Methods are proposed to predict Articulatory Features (AFs) from spectral features using Deep Neural Networks (DNN). Multitask learning is explored to improve the prediction accuracy of AFs. Then, the AFs are explored to improve the performance of Multi-PRS using lattice rescoring method of combination and tandem method of combination. The author goes on to develop and evaluate the Language Identification followed by Monolingual phone recognition (LID-Mono) and common multilingual phone-set based multilingual phone recognition systems.
Publisher:
ISBN: 9783030807429
Category :
Languages : en
Pages : 0
Book Description
The book presents current research and developments in multilingual speech recognition. The author presents a Multilingual Phone Recognition System (Multi-PRS), developed using a common multilingual phone-set derived from the International Phonetic Alphabets (IPA) based transcription of six Indian languages - Kannada, Telugu, Bengali, Odia, Urdu, and Assamese. The author shows how the performance of Multi-PRS can be improved using tandem features. The book compares Monolingual Phone Recognition Systems (Mono-PRS) versus Multi-PRS and baseline versus tandem system. Methods are proposed to predict Articulatory Features (AFs) from spectral features using Deep Neural Networks (DNN). Multitask learning is explored to improve the prediction accuracy of AFs. Then, the AFs are explored to improve the performance of Multi-PRS using lattice rescoring method of combination and tandem method of combination. The author goes on to develop and evaluate the Language Identification followed by Monolingual phone recognition (LID-Mono) and common multilingual phone-set based multilingual phone recognition systems.
Multilingual Phone Recognition in Indian Languages
Author: K.E Manjunath
Publisher: Springer Nature
ISBN: 303080741X
Category : Technology & Engineering
Languages : en
Pages : 113
Book Description
The book presents current research and developments in multilingual speech recognition. The author presents a Multilingual Phone Recognition System (Multi-PRS), developed using a common multilingual phone-set derived from the International Phonetic Alphabets (IPA) based transcription of six Indian languages - Kannada, Telugu, Bengali, Odia, Urdu, and Assamese. The author shows how the performance of Multi-PRS can be improved using tandem features. The book compares Monolingual Phone Recognition Systems (Mono-PRS) versus Multi-PRS and baseline versus tandem system. Methods are proposed to predict Articulatory Features (AFs) from spectral features using Deep Neural Networks (DNN). Multitask learning is explored to improve the prediction accuracy of AFs. Then, the AFs are explored to improve the performance of Multi-PRS using lattice rescoring method of combination and tandem method of combination. The author goes on to develop and evaluate the Language Identification followed by Monolingual phone recognition (LID-Mono) and common multilingual phone-set based multilingual phone recognition systems.
Publisher: Springer Nature
ISBN: 303080741X
Category : Technology & Engineering
Languages : en
Pages : 113
Book Description
The book presents current research and developments in multilingual speech recognition. The author presents a Multilingual Phone Recognition System (Multi-PRS), developed using a common multilingual phone-set derived from the International Phonetic Alphabets (IPA) based transcription of six Indian languages - Kannada, Telugu, Bengali, Odia, Urdu, and Assamese. The author shows how the performance of Multi-PRS can be improved using tandem features. The book compares Monolingual Phone Recognition Systems (Mono-PRS) versus Multi-PRS and baseline versus tandem system. Methods are proposed to predict Articulatory Features (AFs) from spectral features using Deep Neural Networks (DNN). Multitask learning is explored to improve the prediction accuracy of AFs. Then, the AFs are explored to improve the performance of Multi-PRS using lattice rescoring method of combination and tandem method of combination. The author goes on to develop and evaluate the Language Identification followed by Monolingual phone recognition (LID-Mono) and common multilingual phone-set based multilingual phone recognition systems.
Advances in Signal Processing and Intelligent Recognition Systems
Author: Sabu M. Thampi
Publisher: Springer Nature
ISBN: 9811548285
Category : Computers
Languages : en
Pages : 414
Book Description
This book constitutes the refereed proceedings of the 5th International Symposium on Advances in Signal Processing and Intelligent Recognition Systems, SIRS 2019, held in Trivandrum, India, in December 2019. The 19 revised full papers and 8 revised short papers presented were carefully reviewed and selected from 63 submissions. The papers cover wide research fields including information retrieval, human-computer interaction (HCI), information extraction, speech recognition.
Publisher: Springer Nature
ISBN: 9811548285
Category : Computers
Languages : en
Pages : 414
Book Description
This book constitutes the refereed proceedings of the 5th International Symposium on Advances in Signal Processing and Intelligent Recognition Systems, SIRS 2019, held in Trivandrum, India, in December 2019. The 19 revised full papers and 8 revised short papers presented were carefully reviewed and selected from 63 submissions. The papers cover wide research fields including information retrieval, human-computer interaction (HCI), information extraction, speech recognition.
Smart and Sustainable Intelligent Systems
Author: Namita Gupta
Publisher: John Wiley & Sons
ISBN: 111975058X
Category : Computers
Languages : en
Pages : 576
Book Description
The world is experiencing an unprecedented period of change and growth through all the electronic and technilogical developments and everyone on the planet has been impacted. What was once ‘science fiction’, today it is a reality. This book explores the world of many of once unthinkable advancements by explaining current technologies in great detail. Each chapter focuses on a different aspect - Machine Vision, Pattern Analysis and Image Processing - Advanced Trends in Computational Intelligence and Data Analytics - Futuristic Communication Technologies - Disruptive Technologies for Future Sustainability. The chapters include the list of topics that spans all the areas of smart intelligent systems and computing such as: Data Mining with Soft Computing, Evolutionary Computing, Quantum Computing, Expert Systems, Next Generation Communication, Blockchain and Trust Management, Intelligent Biometrics, Multi-Valued Logical Systems, Cloud Computing and security etc. An extensive list of bibliographic references at the end of each chapter guides the reader to probe further into application area of interest to him/her.
Publisher: John Wiley & Sons
ISBN: 111975058X
Category : Computers
Languages : en
Pages : 576
Book Description
The world is experiencing an unprecedented period of change and growth through all the electronic and technilogical developments and everyone on the planet has been impacted. What was once ‘science fiction’, today it is a reality. This book explores the world of many of once unthinkable advancements by explaining current technologies in great detail. Each chapter focuses on a different aspect - Machine Vision, Pattern Analysis and Image Processing - Advanced Trends in Computational Intelligence and Data Analytics - Futuristic Communication Technologies - Disruptive Technologies for Future Sustainability. The chapters include the list of topics that spans all the areas of smart intelligent systems and computing such as: Data Mining with Soft Computing, Evolutionary Computing, Quantum Computing, Expert Systems, Next Generation Communication, Blockchain and Trust Management, Intelligent Biometrics, Multi-Valued Logical Systems, Cloud Computing and security etc. An extensive list of bibliographic references at the end of each chapter guides the reader to probe further into application area of interest to him/her.
Automatic Speech Recognition and Translation for Low Resource Languages
Author: L. Ashok Kumar
Publisher: John Wiley & Sons
ISBN: 1394214170
Category : Computers
Languages : en
Pages : 428
Book Description
AUTOMATIC SPEECH RECOGNITION and TRANSLATION for LOW-RESOURCE LANGUAGES This book is a comprehensive exploration into the cutting-edge research, methodologies, and advancements in addressing the unique challenges associated with ASR and translation for low-resource languages. Automatic Speech Recognition and Translation for Low Resource Languages contains groundbreaking research from experts and researchers sharing innovative solutions that address language challenges in low-resource environments. The book begins by delving into the fundamental concepts of ASR and translation, providing readers with a solid foundation for understanding the subsequent chapters. It then explores the intricacies of low-resource languages, analyzing the factors that contribute to their challenges and the significance of developing tailored solutions to overcome them. The chapters encompass a wide range of topics, ranging from both the theoretical and practical aspects of ASR and translation for low-resource languages. The book discusses data augmentation techniques, transfer learning, and multilingual training approaches that leverage the power of existing linguistic resources to improve accuracy and performance. Additionally, it investigates the possibilities offered by unsupervised and semi-supervised learning, as well as the benefits of active learning and crowdsourcing in enriching the training data. Throughout the book, emphasis is placed on the importance of considering the cultural and linguistic context of low-resource languages, recognizing the unique nuances and intricacies that influence accurate ASR and translation. Furthermore, the book explores the potential impact of these technologies in various domains, such as healthcare, education, and commerce, empowering individuals and communities by breaking down language barriers. Audience The book targets researchers and professionals in the fields of natural language processing, computational linguistics, and speech technology. It will also be of interest to engineers, linguists, and individuals in industries and organizations working on cross-lingual communication, accessibility, and global connectivity.
Publisher: John Wiley & Sons
ISBN: 1394214170
Category : Computers
Languages : en
Pages : 428
Book Description
AUTOMATIC SPEECH RECOGNITION and TRANSLATION for LOW-RESOURCE LANGUAGES This book is a comprehensive exploration into the cutting-edge research, methodologies, and advancements in addressing the unique challenges associated with ASR and translation for low-resource languages. Automatic Speech Recognition and Translation for Low Resource Languages contains groundbreaking research from experts and researchers sharing innovative solutions that address language challenges in low-resource environments. The book begins by delving into the fundamental concepts of ASR and translation, providing readers with a solid foundation for understanding the subsequent chapters. It then explores the intricacies of low-resource languages, analyzing the factors that contribute to their challenges and the significance of developing tailored solutions to overcome them. The chapters encompass a wide range of topics, ranging from both the theoretical and practical aspects of ASR and translation for low-resource languages. The book discusses data augmentation techniques, transfer learning, and multilingual training approaches that leverage the power of existing linguistic resources to improve accuracy and performance. Additionally, it investigates the possibilities offered by unsupervised and semi-supervised learning, as well as the benefits of active learning and crowdsourcing in enriching the training data. Throughout the book, emphasis is placed on the importance of considering the cultural and linguistic context of low-resource languages, recognizing the unique nuances and intricacies that influence accurate ASR and translation. Furthermore, the book explores the potential impact of these technologies in various domains, such as healthcare, education, and commerce, empowering individuals and communities by breaking down language barriers. Audience The book targets researchers and professionals in the fields of natural language processing, computational linguistics, and speech technology. It will also be of interest to engineers, linguists, and individuals in industries and organizations working on cross-lingual communication, accessibility, and global connectivity.
Proceedings of the Multi-Conference 2011
Author: Himanshu B. Soni
Publisher: Universal-Publishers
ISBN: 1612330029
Category :
Languages : en
Pages : 1074
Book Description
The International Conference on Signals, Systems and Automation (ICSSA 2011) aims to spread awareness in the research and academic community regarding cutting-edge technological advancements revolutionizing the world. The main emphasis of this conference is on dissemination of information, experience, and research results on the current topics of interest through in-depth discussions and participation of researchers from all over the world. The objective is to provide a platform to scientists, research scholars, and industrialists for interacting and exchanging ideas in a number of research areas. This will facilitate communication among researchers in different fields of Electronics and Communication Engineering. The International Conference on Intelligent System and Data Processing (ICISD 2011) is organized to address various issues that will foster the creation of intelligent solutions in the future. The primary goal of the conference is to bring together worldwide leading researchers, developers, practitioners, and educators interested in advancing the state of the art in computational intelligence and data processing for exchanging knowledge that encompasses a broad range of disciplines among various distinct communities. Another goal is to promote scientific information interchange between researchers, developers, engineers, students, and practitioners working in India and abroad.
Publisher: Universal-Publishers
ISBN: 1612330029
Category :
Languages : en
Pages : 1074
Book Description
The International Conference on Signals, Systems and Automation (ICSSA 2011) aims to spread awareness in the research and academic community regarding cutting-edge technological advancements revolutionizing the world. The main emphasis of this conference is on dissemination of information, experience, and research results on the current topics of interest through in-depth discussions and participation of researchers from all over the world. The objective is to provide a platform to scientists, research scholars, and industrialists for interacting and exchanging ideas in a number of research areas. This will facilitate communication among researchers in different fields of Electronics and Communication Engineering. The International Conference on Intelligent System and Data Processing (ICISD 2011) is organized to address various issues that will foster the creation of intelligent solutions in the future. The primary goal of the conference is to bring together worldwide leading researchers, developers, practitioners, and educators interested in advancing the state of the art in computational intelligence and data processing for exchanging knowledge that encompasses a broad range of disciplines among various distinct communities. Another goal is to promote scientific information interchange between researchers, developers, engineers, students, and practitioners working in India and abroad.
Innovations in Bio-Inspired Computing and Applications
Author: Ajith Abraham
Publisher: Springer Nature
ISBN: 3031274997
Category : Technology & Engineering
Languages : en
Pages : 951
Book Description
This book highlights recent research on bio-inspired computing and its various innovative applications in information and communication technologies. It presents 85 high-quality papers from the 13th International Conference on Innovations in Bio-Inspired Computing and Applications (IBICA 2022) and 12th World Congress on Information and Communication Technologies (WICT 2022), which was held online during 15–17 December 2022. As a premier conference, IBICA–WICT brings together researchers, engineers and practitioners whose work involves bio-inspired computing, computational intelligence and their applications in information security, real-world contexts, etc. Including contributions by authors from 25 countries, the book offers a valuable reference guide for all researchers, students and practitioners in the fields of Computer Science and Engineering.
Publisher: Springer Nature
ISBN: 3031274997
Category : Technology & Engineering
Languages : en
Pages : 951
Book Description
This book highlights recent research on bio-inspired computing and its various innovative applications in information and communication technologies. It presents 85 high-quality papers from the 13th International Conference on Innovations in Bio-Inspired Computing and Applications (IBICA 2022) and 12th World Congress on Information and Communication Technologies (WICT 2022), which was held online during 15–17 December 2022. As a premier conference, IBICA–WICT brings together researchers, engineers and practitioners whose work involves bio-inspired computing, computational intelligence and their applications in information security, real-world contexts, etc. Including contributions by authors from 25 countries, the book offers a valuable reference guide for all researchers, students and practitioners in the fields of Computer Science and Engineering.
Language Identification Using Spectral and Prosodic Features
Author: K. Sreenivasa Rao
Publisher: Springer
ISBN: 3319171631
Category : Technology & Engineering
Languages : en
Pages : 106
Book Description
This book discusses the impact of spectral features extracted from frame level, glottal closure regions, and pitch-synchronous analysis on the performance of language identification systems. In addition to spectral features, the authors explore prosodic features such as intonation, rhythm, and stress features for discriminating the languages. They present how the proposed spectral and prosodic features capture the language specific information from two complementary aspects, showing how the development of language identification (LID) system using the combination of spectral and prosodic features will enhance the accuracy of identification as well as improve the robustness of the system. This book provides the methods to extract the spectral and prosodic features at various levels, and also suggests the appropriate models for developing robust LID systems according to specific spectral and prosodic features. Finally, the book discuss about various combinations of spectral and prosodic features, and the desired models to enhance the performance of LID systems.
Publisher: Springer
ISBN: 3319171631
Category : Technology & Engineering
Languages : en
Pages : 106
Book Description
This book discusses the impact of spectral features extracted from frame level, glottal closure regions, and pitch-synchronous analysis on the performance of language identification systems. In addition to spectral features, the authors explore prosodic features such as intonation, rhythm, and stress features for discriminating the languages. They present how the proposed spectral and prosodic features capture the language specific information from two complementary aspects, showing how the development of language identification (LID) system using the combination of spectral and prosodic features will enhance the accuracy of identification as well as improve the robustness of the system. This book provides the methods to extract the spectral and prosodic features at various levels, and also suggests the appropriate models for developing robust LID systems according to specific spectral and prosodic features. Finally, the book discuss about various combinations of spectral and prosodic features, and the desired models to enhance the performance of LID systems.
Phoneme-Based Speech Segmentation using Hybrid Soft Computing Framework
Author: Mousmita Sarma
Publisher: Springer
ISBN: 8132218620
Category : Technology & Engineering
Languages : en
Pages : 199
Book Description
The book discusses intelligent system design using soft computing and similar systems and their interdisciplinary applications. It also focuses on the recent trends to use soft computing as a versatile tool for designing a host of decision support systems.
Publisher: Springer
ISBN: 8132218620
Category : Technology & Engineering
Languages : en
Pages : 199
Book Description
The book discusses intelligent system design using soft computing and similar systems and their interdisciplinary applications. It also focuses on the recent trends to use soft computing as a versatile tool for designing a host of decision support systems.
Advanced Machine Intelligence and Signal Processing
Author: Deepak Gupta
Publisher: Springer Nature
ISBN: 9811908400
Category : Technology & Engineering
Languages : en
Pages : 859
Book Description
This book covers the latest advancements in the areas of machine learning, computer vision, pattern recognition, computational learning theory, big data analytics, network intelligence, signal processing, and their applications in real world. The topics covered in machine learning involve feature extraction, variants of support vector machine (SVM), extreme learning machine (ELM), artificial neural network (ANN), and other areas in machine learning. The mathematical analysis of computer vision and pattern recognition involves the use of geometric techniques, scene understanding and modeling from video, 3D object recognition, localization and tracking, medical image analysis, and so on. Computational learning theory involves different kinds of learning like incremental, online, reinforcement, manifold, multitask, semi-supervised, etc. Further, it covers the real-time challenges involved while processing big data analytics and stream processing with the integration of smart data computing services and interconnectivity. Additionally, it covers the recent developments to network intelligence for analyzing the network information and thereby adapting the algorithms dynamically to improve the efficiency. In the last, it includes the progress in signal processing to process the normal and abnormal categories of real-world signals, for instance signals generated from IoT devices, smart systems, speech, videos, etc., and involves biomedical signal processing: electrocardiogram (ECG), electroencephalogram (EEG), magnetoencephalography (MEG), and electromyogram (EMG).
Publisher: Springer Nature
ISBN: 9811908400
Category : Technology & Engineering
Languages : en
Pages : 859
Book Description
This book covers the latest advancements in the areas of machine learning, computer vision, pattern recognition, computational learning theory, big data analytics, network intelligence, signal processing, and their applications in real world. The topics covered in machine learning involve feature extraction, variants of support vector machine (SVM), extreme learning machine (ELM), artificial neural network (ANN), and other areas in machine learning. The mathematical analysis of computer vision and pattern recognition involves the use of geometric techniques, scene understanding and modeling from video, 3D object recognition, localization and tracking, medical image analysis, and so on. Computational learning theory involves different kinds of learning like incremental, online, reinforcement, manifold, multitask, semi-supervised, etc. Further, it covers the real-time challenges involved while processing big data analytics and stream processing with the integration of smart data computing services and interconnectivity. Additionally, it covers the recent developments to network intelligence for analyzing the network information and thereby adapting the algorithms dynamically to improve the efficiency. In the last, it includes the progress in signal processing to process the normal and abnormal categories of real-world signals, for instance signals generated from IoT devices, smart systems, speech, videos, etc., and involves biomedical signal processing: electrocardiogram (ECG), electroencephalogram (EEG), magnetoencephalography (MEG), and electromyogram (EMG).