The Role of Syntax and Semantics in Machine Translation and Quality Estimation of Machine-translated User-generated Content PDF Download

Are you looking for read ebook online? Search for your book and save it on your Kindle device, PC, phones or tablets. Download The Role of Syntax and Semantics in Machine Translation and Quality Estimation of Machine-translated User-generated Content PDF full book. Access full book title The Role of Syntax and Semantics in Machine Translation and Quality Estimation of Machine-translated User-generated Content by Rasoul Samad Zadeh Kaljahi. Download full books in PDF and EPUB format.

The Role of Syntax and Semantics in Machine Translation and Quality Estimation of Machine-translated User-generated Content

The Role of Syntax and Semantics in Machine Translation and Quality Estimation of Machine-translated User-generated Content PDF Author: Rasoul Samad Zadeh Kaljahi
Publisher:
ISBN:
Category :
Languages : en
Pages : 0

Book Description


The Role of Syntax and Semantics in Machine Translation and Quality Estimation of Machine-translated User-generated Content

The Role of Syntax and Semantics in Machine Translation and Quality Estimation of Machine-translated User-generated Content PDF Author: Rasoul Samad Zadeh Kaljahi
Publisher:
ISBN:
Category :
Languages : en
Pages : 0

Book Description


Quality Estimation for Machine Translation

Quality Estimation for Machine Translation PDF Author: Lucia Specia
Publisher: Springer Nature
ISBN: 3031021681
Category : Computers
Languages : en
Pages : 148

Book Description
Many applications within natural language processing involve performing text-to-text transformations, i.e., given a text in natural language as input, systems are required to produce a version of this text (e.g., a translation), also in natural language, as output. Automatically evaluating the output of such systems is an important component in developing text-to-text applications. Two approaches have been proposed for this problem: (i) to compare the system outputs against one or more reference outputs using string matching-based evaluation metrics and (ii) to build models based on human feedback to predict the quality of system outputs without reference texts. Despite their popularity, reference-based evaluation metrics are faced with the challenge that multiple good (and bad) quality outputs can be produced by text-to-text approaches for the same input. This variation is very hard to capture, even with multiple reference texts. In addition, reference-based metrics cannot be used in production (e.g., online machine translation systems), when systems are expected to produce outputs for any unseen input. In this book, we focus on the second set of metrics, so-called Quality Estimation (QE) metrics, where the goal is to provide an estimate on how good or reliable the texts produced by an application are without access to gold-standard outputs. QE enables different types of evaluation that can target different types of users and applications. Machine learning techniques are used to build QE models with various types of quality labels and explicit features or learnt representations, which can then predict the quality of unseen system outputs. This book describes the topic of QE for text-to-text applications, covering quality labels, features, algorithms, evaluation, uses, and state-of-the-art approaches. It focuses on machine translation as application, since this represents most of the QE work done to date. It also briefly describes QE for several other applications, including text simplification, text summarization, grammatical error correction, and natural language generation.

From Syntax to Semantics

From Syntax to Semantics PDF Author: Erich Steiner
Publisher: Intellect Books
ISBN:
Category : Computers
Languages : en
Pages : 280

Book Description
Machine translation is a central aspect of research in artifical intelligence. This book is written in the context of the Machine Translation (MT) project EUROTRA, a multi-lingual MT-project putting special emphasis on the definition of semantic representation.

Metataxis

Metataxis PDF Author: Klaus Schubert
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 311087606X
Category : Language Arts & Disciplines
Languages : en
Pages : 252

Book Description
No detailed description available for "Metataxis".

Machine Translation of User-generated Content

Machine Translation of User-generated Content PDF Author: Pintu Lohar
Publisher:
ISBN:
Category :
Languages : en
Pages : 0

Book Description
The world of social media has undergone huge evolution during the last few years. With the spread of social media and online forums, individual users actively participate in the generation of online content in different languages from all over the world. Sharing of online content has become much easier than before with the advent of popular websites such as Twitter, Facebook etc. Such content is referred to as 'User-Generated Content' (UGC). Some examples of UGC are user reviews, customer feedback, tweets etc. In general, UGC is informal and noisy in terms of linguistic norms. Such noise does not create significant problems for human to understand the content, but it can pose challenges for several natural language processing applications such as parsing, sentiment analysis, machine translation (MT), etc. An additional challenge for MT is sparseness of bilingual (translated) parallel UGC corpora. In this research, we explore the general issues in MT of UGC and set some research goals from our findings. One of our main goals is to exploit comparable corpora in order to extract parallel or semantically similar sentences. To accomplish this task, we design a document alignment system to extract semantically similar bilingual document pairs using the bilingual comparable corpora. We then apply strategies to extract parallel or semantically similar sentences from comparable corpora by transforming the document alignment system into a sentence alignment system. We seek to improve the quality of parallel data extraction for UGC translation and assemble the extracted data with the existing human translated resources. Another objective of this research is to demonstrate the usefulness of MT-based sentiment analysis. However, when using openly available systems such as Google Translate, the translation process may alter the sentiment in the target language. To cope with this phenomenon, we instead build fine-grained sentiment translation models that focus on sentiment preservation in the target language during translation.

Linguistic Issues in Machine Translation

Linguistic Issues in Machine Translation PDF Author: Frank Van Eydne
Publisher: Bloomsbury Publishing
ISBN: 1474246559
Category : Language Arts & Disciplines
Languages : en
Pages : 258

Book Description
The key assumption in this text is that machine translation is not merely a mechanical process but in fact requires a high level of linguistic sophistication, as the nuances of syntax, semantics and intonation cannot always be conveyed by modern technology. The increasing dependence on artificial communication by private and corporate users makes this research area an invaluable element when teaching linguistic theory.

Machine Translation

Machine Translation PDF Author: John Lehrberger
Publisher: John Benjamins Publishing
ISBN: 9027286205
Category : Language Arts & Disciplines
Languages : en
Pages : 257

Book Description
The use of the computer in translating natural languages ranges from that of a translator's aid for word processing and dictionary lookup to that of a full-fledged translator on its own. However the obstacles to translating by means of the computer are primarily linguistic. To overcome them it is necessary to resolve the ambiguities that pervade a natural language when words and sentences are viewed in isolation. The problem then is to formalize, in the computer, these aspects of natural language understanding. The authors show how, from a linguistic point of view, one may form some idea of what goes on inside a system's black box, given only the input (original text) and the raw output (translated text before post-editing). Many examples of English/French translation are used to illustrate the principles involved.

Syntax-based Statistical Machine Translation

Syntax-based Statistical Machine Translation PDF Author: Philip Williams
Publisher: Springer Nature
ISBN: 3031021649
Category : Computers
Languages : en
Pages : 190

Book Description
This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models. The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.

Translation Quality Assessment

Translation Quality Assessment PDF Author: Joss Moorkens
Publisher: Springer
ISBN: 3319912410
Category : Computers
Languages : en
Pages : 292

Book Description
This is the first volume that brings together research and practice from academic and industry settings and a combination of human and machine translation evaluation. Its comprehensive collection of papers by leading experts in human and machine translation quality and evaluation who situate current developments and chart future trends fills a clear gap in the literature. This is critical to the successful integration of translation technologies in the industry today, where the lines between human and machine are becoming increasingly blurred by technology: this affects the whole translation landscape, from students and trainers to project managers and professionals, including in-house and freelance translators, as well as, of course, translation scholars and researchers. The editors have broad experience in translation quality evaluation research, including investigations into professional practice with qualitative and quantitative studies, and the contributors are leading experts in their respective fields, providing a unique set of complementary perspectives on human and machine translation quality and evaluation, combining theoretical and applied approaches.

Machine Translation with Minimal Reliance on Parallel Resources

Machine Translation with Minimal Reliance on Parallel Resources PDF Author: George Tambouratzis
Publisher: Springer
ISBN: 3319631071
Category : Computers
Languages : en
Pages : 92

Book Description
This book provides a unified view on a new methodology for Machine Translation (MT). This methodology extracts information from widely available resources (extensive monolingual corpora) while only assuming the existence of a very limited parallel corpus, thus having a unique starting point to Statistical Machine Translation (SMT). In this book, a detailed presentation of the methodology principles and system architecture is followed by a series of experiments, where the proposed system is compared to other MT systems using a set of established metrics including BLEU, NIST, Meteor and TER. Additionally, a free-to-use code is available, that allows the creation of new MT systems. The volume is addressed to both language professionals and researchers. Prerequisites for the readers are very limited and include a basic understanding of the machine translation as well as of the basic tools of natural language processing.​