Embedded Arabic Text Detection and Recognition in Videos PDF Download

Are you looking for read ebook online? Search for your book and save it on your Kindle device, PC, phones or tablets. Download Embedded Arabic Text Detection and Recognition in Videos PDF full book. Access full book title Embedded Arabic Text Detection and Recognition in Videos by Sonia Yousfi. Download full books in PDF and EPUB format.

Embedded Arabic Text Detection and Recognition in Videos

Embedded Arabic Text Detection and Recognition in Videos PDF Author: Sonia Yousfi
Publisher:
ISBN:
Category :
Languages : en
Pages : 129

Book Description
This thesis focuses on Arabic embedded text detection and recognition in videos. Different approaches robust to Arabic text variability (fonts, scales, sizes, etc.) as well as to environmental and acquisition condition challenges (contrasts, degradation, complex background, etc.) are proposed. We introduce different machine learning-based solutions for robust text detection without relying on any pre-processing. The first method is based on Convolutional Neural Networks (ConvNet) while the others use a specific boosting cascade to select relevant hand-crafted text features. For the text recognition, our methodology is segmentation-free. Text images are transformed into sequences of features using a multi-scale scanning scheme. Standing out from the dominant methodology of hand-crafted features, we propose to learn relevant text representations from data using different deep learning methods, namely Deep Auto-Encoders, ConvNets and unsupervised learning models. Each one leads to a specific OCR (Optical Character Recognition) solution. Sequence labeling is performed without any prior segmentation using a recurrent connectionist learning model. Proposed solutions are compared to other methods based on non-connectionist and hand-crafted features. In addition, we propose to enhance the recognition results using Recurrent Neural Network-based language models that are able to capture long-range linguistic dependencies. Both OCR and language model probabilities are incorporated in a joint decoding scheme where additional hyper-parameters are introduced to boost recognition results and reduce the response time. Given the lack of public multimedia Arabic datasets, we propose novel annotated datasets issued from Arabic videos. The OCR dataset, called ALIF, is publicly available for research purposes. As the best of our knowledge, it is first public dataset dedicated for Arabic video OCR. Our proposed solutions were extensively evaluated. Obtained results highlight the genericity and the efficiency of our approaches, reaching a word recognition rate of 88.63% on the ALIF dataset and outperforming well-known commercial OCR engine by more than 36%.

Embedded Arabic Text Detection and Recognition in Videos

Embedded Arabic Text Detection and Recognition in Videos PDF Author: Sonia Yousfi
Publisher:
ISBN:
Category :
Languages : en
Pages : 129

Book Description
This thesis focuses on Arabic embedded text detection and recognition in videos. Different approaches robust to Arabic text variability (fonts, scales, sizes, etc.) as well as to environmental and acquisition condition challenges (contrasts, degradation, complex background, etc.) are proposed. We introduce different machine learning-based solutions for robust text detection without relying on any pre-processing. The first method is based on Convolutional Neural Networks (ConvNet) while the others use a specific boosting cascade to select relevant hand-crafted text features. For the text recognition, our methodology is segmentation-free. Text images are transformed into sequences of features using a multi-scale scanning scheme. Standing out from the dominant methodology of hand-crafted features, we propose to learn relevant text representations from data using different deep learning methods, namely Deep Auto-Encoders, ConvNets and unsupervised learning models. Each one leads to a specific OCR (Optical Character Recognition) solution. Sequence labeling is performed without any prior segmentation using a recurrent connectionist learning model. Proposed solutions are compared to other methods based on non-connectionist and hand-crafted features. In addition, we propose to enhance the recognition results using Recurrent Neural Network-based language models that are able to capture long-range linguistic dependencies. Both OCR and language model probabilities are incorporated in a joint decoding scheme where additional hyper-parameters are introduced to boost recognition results and reduce the response time. Given the lack of public multimedia Arabic datasets, we propose novel annotated datasets issued from Arabic videos. The OCR dataset, called ALIF, is publicly available for research purposes. As the best of our knowledge, it is first public dataset dedicated for Arabic video OCR. Our proposed solutions were extensively evaluated. Obtained results highlight the genericity and the efficiency of our approaches, reaching a word recognition rate of 88.63% on the ALIF dataset and outperforming well-known commercial OCR engine by more than 36%.

Embedded Arabic Text Detection and Recognition in Videos

Embedded Arabic Text Detection and Recognition in Videos PDF Author: Sonia Yousfi
Publisher:
ISBN:
Category :
Languages : en
Pages : 0

Book Description
This thesis focuses on Arabic embedded text detection and recognition in videos. Different approaches robust to Arabic text variability (fonts, scales, sizes, etc.) as well as to environmental and acquisition condition challenges (contrasts, degradation, complex background, etc.) are proposed. We introduce different machine learning-based solutions for robust text detection without relying on any pre-processing. The first method is based on Convolutional Neural Networks (ConvNet) while the others use a specific boosting cascade to select relevant hand-crafted text features. For the text recognition, our methodology is segmentation-free. Text images are transformed into sequences of features using a multi-scale scanning scheme. Standing out from the dominant methodology of hand-crafted features, we propose to learn relevant text representations from data using different deep learning methods, namely Deep Auto-Encoders, ConvNets and unsupervised learning models. Each one leads to a specific OCR (Optical Character Recognition) solution. Sequence labeling is performed without any prior segmentation using a recurrent connectionist learning model. Proposed solutions are compared to other methods based on non-connectionist and hand-crafted features. In addition, we propose to enhance the recognition results using Recurrent Neural Network-based language models that are able to capture long-range linguistic dependencies. Both OCR and language model probabilities are incorporated in a joint decoding scheme where additional hyper-parameters are introduced to boost recognition results and reduce the response time. Given the lack of public multimedia Arabic datasets, we propose novel annotated datasets issued from Arabic videos. The OCR dataset, called ALIF, is publicly available for research purposes. As the best of our knowledge, it is first public dataset dedicated for Arabic video OCR. Our proposed solutions were extensively evaluated. Obtained results highlight the genericity and the efficiency of our approaches, reaching a word recognition rate of 88.63% on the ALIF dataset and outperforming well-known commercial OCR engine by more than 36%.

Cursive Script Text Recognition in Natural Scene Images

Cursive Script Text Recognition in Natural Scene Images PDF Author: Saad Bin Ahmed
Publisher: Springer Nature
ISBN: 9811512973
Category : Computers
Languages : en
Pages : 121

Book Description
This book offers a broad and structured overview of the state-of-the-art methods that could be applied for context-dependent languages like Arabic. It also provides guidelines on how to deal with Arabic scene data that appeared in an uncontrolled environment impacted by different font size, font styles, image resolution, and opacity of text. Being an intrinsic script, Arabic and Arabic-like languages attract attention from research community. There are a number of challenges associated with the detection and recognition of Arabic text from natural images. This book discusses these challenges and open problems and also provides insights into the complexities and issues that researchers encounter in the context of Arabic or Arabic-like text recognition in natural and document images. It sheds light on fundamental questions, such as a) How the complexity of Arabic as a cursive scripts can be demonstrated b) What the structure of Arabic text is and how to consider the features from a given text and c) What guidelines should be followed to address the context learning ability of classifiers existing in machine learning.

Document Image Processing

Document Image Processing PDF Author: Ergina Kavallieratou
Publisher: MDPI
ISBN: 3038971057
Category : Technology & Engineering
Languages : en
Pages : 217

Book Description
This book is a printed edition of the Special Issue "Document Image Processing" that was published in J. Imaging

Pattern Recognition. ICPR International Workshops and Challenges

Pattern Recognition. ICPR International Workshops and Challenges PDF Author: Alberto Del Bimbo
Publisher: Springer Nature
ISBN: 3030687937
Category : Computers
Languages : en
Pages : 502

Book Description
This 8-volumes set constitutes the refereed of the 25th International Conference on Pattern Recognition Workshops, ICPR 2020, held virtually in Milan, Italy and rescheduled to January 10 - 11, 2021 due to Covid-19 pandemic. The 416 full papers presented in these 8 volumes were carefully reviewed and selected from about 700 submissions. The 46 workshops cover a wide range of areas including machine learning, pattern analysis, healthcare, human behavior, environment, surveillance, forensics and biometrics, robotics and egovision, cultural heritage and document analysis, retrieval, and women at ICPR2020.

Video Text Detection

Video Text Detection PDF Author: Tong Lu
Publisher: Springer
ISBN: 1447165152
Category : Computers
Languages : en
Pages : 272

Book Description
This book presents a systematic introduction to the latest developments in video text detection. Opening with a discussion of the underlying theory and a brief history of video text detection, the text proceeds to cover pre-processing and post-processing techniques, character segmentation and recognition, identification of non-English scripts, techniques for multi-modal analysis and performance evaluation. The detection of text from both natural video scenes and artificially inserted captions is examined. Various applications of the technology are also reviewed, from license plate recognition and road navigation assistance, to sports analysis and video advertising systems. Features: explains the fundamental theory in a succinct manner, supplemented with references for further reading; highlights practical techniques to help the reader understand and develop their own video text detection systems and applications; serves as an easy-to-navigate reference, presenting the material in self-contained chapters.

Cognitively Inspired Video Text Processing

Cognitively Inspired Video Text Processing PDF Author: Palaiahnakote Shivakumara
Publisher: Springer Nature
ISBN: 9811670692
Category : Computers
Languages : en
Pages : 283

Book Description
As technologies are fast advancing, the importance of text detection and recognition is receiving special attention from the researchers. Thus, one can see several real-time applications of video text processing which requires cognitive-based methods to find a solution. The main applications are (1) retrieving and indexing video based on semantic of the content of the video, (2) machine translation to assist foreigners, (3) assisting blind people to walk on the road freely without aid, (4) automatic vehicle driving, (5) license plate tracing to catch vehicles which violate the traffic signals, (6) monitoring the images posted on social media based on text and content of the images, (7) identifying the location based on the address of the street and shops, etc., (8) tracing players in the sports based on the jersey/bib number or text, and (9) in the same way, tracing the bib number in case of marathon and other events. For the above-mentioned applications, text detection and recognition in video and natural scene images is an integral part of the system.

Guide to OCR for Arabic Scripts

Guide to OCR for Arabic Scripts PDF Author: Volker Märgner
Publisher: Springer Science & Business Media
ISBN: 1447140729
Category : Computers
Languages : en
Pages : 593

Book Description
This Guide to OCR for Arabic Scripts is the first book of its kind, specifically devoted to this emerging field. Topics and features: contains contributions from the leading researchers in the field; with a Foreword by Professor Bente Maegaard of the University of Copenhagen; presents a detailed overview of Arabic character recognition technology, covering a range of different aspects of pre-processing and feature extraction; reviews a broad selection of varying approaches, including HMM-based methods and a recognition system based on multidimensional recurrent neural networks; examines the evaluation of Arabic script recognition systems, discussing data collection and annotation, benchmarking strategies, and handwriting recognition competitions; describes numerous applications of Arabic script recognition technology, from historical Arabic manuscripts to online Arabic recognition.

Computer Networks and Inventive Communication Technologies

Computer Networks and Inventive Communication Technologies PDF Author: S. Smys
Publisher: Springer Nature
ISBN: 981193035X
Category : Technology & Engineering
Languages : en
Pages : 909

Book Description
This book is a collection of peer-reviewed best selected research papers presented at 5th International Conference on Computer Networks and Inventive Communication Technologies (ICCNCT 2022). The book covers new results in theory, methodology, and applications of computer networks and data communications. It includes original papers on computer networks, network protocols and wireless networks, data communication technologies, and network security. The proceedings of this conference is a valuable resource, dealing with both the important core and the specialized issues in the areas of next generation wireless network design, control, and management, as well as in the areas of protection, assurance, and trust in information security practice. It is a reference for researchers, instructors, students, scientists, engineers, managers, and industry practitioners for advance work in the area.

Advances in Cybersecurity, Cybercrimes, and Smart Emerging Technologies

Advances in Cybersecurity, Cybercrimes, and Smart Emerging Technologies PDF Author: Ahmed A. Abd El-Latif
Publisher: Springer Nature
ISBN: 3031211014
Category : Technology & Engineering
Languages : en
Pages : 336

Book Description
This book gathers the proceedings of the International conference on Cybersecurity, Cybercrimes, and Smart Emerging Technologies, held on May 10–11, 2022, in Riyadh, Saudi Arabia. The conference organized by the College of Computer Science of Prince Sultan University, Saudi Arabia. This book provides an opportunity to account for state-of-the-art works, future trends impacting cybersecurity, cybercrimes, and smart emerging technologies, that concern to organizations and individuals, thus creating new research opportunities, focusing on elucidating the challenges, opportunities, and inter-dependencies that are just around the corner. This book is helpful for students and researchers as well as practitioners. CCSET 2022 was devoted to advances in cybersecurity, cybercime, and smart emerging technologies. It was considered a meeting point for researchers and practitioners to implement advanced information technologies into various industries. There were 89 paper submissions from 25 countries. Each submission was reviewed by at least three chairs or PC members and 26 regular papers (30%) were accepted. Unfortunately, due to limitations of conference topics and edited volumes, the Program Committee was forced to reject some interesting papers, which did not satisfy these topics or publisher requirements.