Author: K.R. Rao
Publisher: CRC Press
ISBN: 1000794636
Category : Technology & Engineering
Languages : en
Pages : 319
Book Description
High Efficiency Video Coding and Other Emerging Standards provides an overview of high efficiency video coding (HEVC) and all its extensions and profiles. There are nearly 300 projects and problems included, and about 400 references related to HEVC alone. Next generation video coding (NGVC) beyond HEVC is also described. Other video coding standards such as AVS2, DAALA, THOR, VP9 (Google), DIRAC, VC1, and AV1 are addressed, and image coding standards such as JPEG, JPEG-LS, JPEG2000, JPEG XR, JPEG XS, JPEG XT and JPEG-Pleno are also listed.Understanding of these standards and their implementation is facilitated by overview papers, standards documents, reference software, software manuals, test sequences, source codes, tutorials, keynote speakers, panel discussions, reflector and ftp/web sites – all in the public domain. Access to these categories is also provided.
High Efficiency Video Coding and Other Emerging Standards
Video coding standards
Author: K.R. Rao
Publisher: Springer Science & Business Media
ISBN: 9400767420
Category : Science
Languages : en
Pages : 515
Book Description
The requirements for multimedia (especially video and audio) communications increase rapidly in the last two decades in broad areas such as television, entertainment, interactive services, telecommunications, conference, medicine, security, business, traffic, defense and banking. Video and audio coding standards play most important roles in multimedia communications. In order to meet these requirements, series of video and audio coding standards have been developed such as MPEG-2, MPEG-4, MPEG-21 for audio and video by ISO/IEC, H.26x for video and G.72x for audio by ITU-T, Video Coder 1 (VC-1) for video by the Society of Motion Picture and Television Engineers (SMPTE) and RealVideo (RV) 9 for video by Real Networks. AVS China is the abbreviation for Audio Video Coding Standard of China. This new standard includes four main technical areas, which are systems, video, audio and digital copyright management (DRM), and some supporting documents such as consistency verification. The second part of the standard known as AVS1-P2 (Video - Jizhun) was approved as the national standard of China in 2006, and several final drafts of the standard have been completed, including AVS1-P1 (System - Broadcast), AVS1-P2 (Video - Zengqiang), AVS1-P3 (Audio - Double track), AVS1-P3 (Audio - 5.1), AVS1-P7 (Mobile Video), AVS-S-P2 (Video) and AVS-S-P3 (Audio). AVS China provides a technical solution for many applications such as digital broadcasting (SDTV and HDTV), high-density storage media, Internet streaming media, and will be used in the domestic IPTV, satellite and possibly the cable TV market. Comparing with other coding standards such as H.264 AVC, the advantages of AVS video standard include similar performance, lower complexity, lower implementation cost and licensing fees. This standard has attracted great deal of attention from industries related to television, multimedia communications and even chip manufacturing from around the world. Also many well known companies have joined the AVS Group to be Full Members or Observing Members. The 163 members of AVS Group include Texas Instruments (TI) Co., Agilent Technologies Co. Ltd., Envivio Inc., NDS, Philips Research East Asia, Aisino Corporation, LG, Alcatel Shanghai Bell Co. Ltd., Nokia (China) Investment (NCIC) Co. Ltd., Sony (China) Ltd., and Toshiba (China) Co. Ltd. as well as some high level universities in China. Thus there is a pressing need from the instructors, students, and engineers for a book dealing with the topic of AVS China and its performance comparisons with similar standards such as H.264, VC-1 and RV-9.
Publisher: Springer Science & Business Media
ISBN: 9400767420
Category : Science
Languages : en
Pages : 515
Book Description
The requirements for multimedia (especially video and audio) communications increase rapidly in the last two decades in broad areas such as television, entertainment, interactive services, telecommunications, conference, medicine, security, business, traffic, defense and banking. Video and audio coding standards play most important roles in multimedia communications. In order to meet these requirements, series of video and audio coding standards have been developed such as MPEG-2, MPEG-4, MPEG-21 for audio and video by ISO/IEC, H.26x for video and G.72x for audio by ITU-T, Video Coder 1 (VC-1) for video by the Society of Motion Picture and Television Engineers (SMPTE) and RealVideo (RV) 9 for video by Real Networks. AVS China is the abbreviation for Audio Video Coding Standard of China. This new standard includes four main technical areas, which are systems, video, audio and digital copyright management (DRM), and some supporting documents such as consistency verification. The second part of the standard known as AVS1-P2 (Video - Jizhun) was approved as the national standard of China in 2006, and several final drafts of the standard have been completed, including AVS1-P1 (System - Broadcast), AVS1-P2 (Video - Zengqiang), AVS1-P3 (Audio - Double track), AVS1-P3 (Audio - 5.1), AVS1-P7 (Mobile Video), AVS-S-P2 (Video) and AVS-S-P3 (Audio). AVS China provides a technical solution for many applications such as digital broadcasting (SDTV and HDTV), high-density storage media, Internet streaming media, and will be used in the domestic IPTV, satellite and possibly the cable TV market. Comparing with other coding standards such as H.264 AVC, the advantages of AVS video standard include similar performance, lower complexity, lower implementation cost and licensing fees. This standard has attracted great deal of attention from industries related to television, multimedia communications and even chip manufacturing from around the world. Also many well known companies have joined the AVS Group to be Full Members or Observing Members. The 163 members of AVS Group include Texas Instruments (TI) Co., Agilent Technologies Co. Ltd., Envivio Inc., NDS, Philips Research East Asia, Aisino Corporation, LG, Alcatel Shanghai Bell Co. Ltd., Nokia (China) Investment (NCIC) Co. Ltd., Sony (China) Ltd., and Toshiba (China) Co. Ltd. as well as some high level universities in China. Thus there is a pressing need from the instructors, students, and engineers for a book dealing with the topic of AVS China and its performance comparisons with similar standards such as H.264, VC-1 and RV-9.
Versatile Video Coding
Author: Humberto Ochoa Dominguez
Publisher: CRC Press
ISBN: 1000795055
Category : Technology & Engineering
Languages : en
Pages : 458
Book Description
Video is the main driver of bandwidth use, accounting for over 80 per cent of consumer Internet traffic. Video compression is a critical component of many of the available multimedia applications, it is necessary for storage or transmission of digital video over today's band-limited networks. The majority of this video is coded using international standards developed in collaboration with ITU-T Study Group and MPEG. The MPEG family of video coding standards begun on the early 1990s with MPEG-1, developed for video and audio storage on CD-ROMs, with support for progressive video. MPEG-2 was standardized in 1995 for applications of video on DVD, standard and high definition television, with support for interlaced and progressive video. MPEG-4 part 2, also known as MPEG-2 video, was standardized in 1999 for applications of low- bit rate multimedia on mobile platforms and the Internet, with the support of object-based or content based coding by modeling the scene as background and foreground. Since MPEG-1, the main video coding standards were based on the so-called macroblocks. However, research groups continued the work beyond the traditional video coding architectures and found that macroblocks could limit the performance of the compression when using high-resolution video. Therefore, in 2013 the high efficiency video coding (HEVC) also known and H.265, was released, with a structure similar to H.264/AVC but using coding units with more flexible partitions than the traditional macroblocks. HEVC has greater flexibility in prediction modes and transform block sizes, also it has a more sophisticated interpolation and de blocking filters. In 2006 the VC-1 was released. VC-1 is a video codec implemented by Microsoft and the Microsoft Windows Media Video (VMW) 9 and standardized by the Society of Motion Picture and Television Engineers (SMPTE). In 2017 the Joint Video Experts Team (JVET) released a call for proposals for a new video coding standard initially called Beyond the HEVC, Future Video Coding (FVC) or known as Versatile Video Coding (VVC). VVC is being built on top of HEVC for application on Standard Dynamic Range (SDR), High Dynamic Range (HDR) and 360° Video. The VVC is planned to be finalized by 2020. This book presents the new VVC, and updates on the HEVC. The book discusses the advances in lossless coding and covers the topic of screen content coding. Technical topics discussed include: Beyond the High Efficiency Video CodingHigh Efficiency Video Coding encoderScreen contentLossless and visually lossless coding algorithmsFast coding algorithmsVisual quality assessmentOther screen content coding algorithmsOverview of JPEG Series
Publisher: CRC Press
ISBN: 1000795055
Category : Technology & Engineering
Languages : en
Pages : 458
Book Description
Video is the main driver of bandwidth use, accounting for over 80 per cent of consumer Internet traffic. Video compression is a critical component of many of the available multimedia applications, it is necessary for storage or transmission of digital video over today's band-limited networks. The majority of this video is coded using international standards developed in collaboration with ITU-T Study Group and MPEG. The MPEG family of video coding standards begun on the early 1990s with MPEG-1, developed for video and audio storage on CD-ROMs, with support for progressive video. MPEG-2 was standardized in 1995 for applications of video on DVD, standard and high definition television, with support for interlaced and progressive video. MPEG-4 part 2, also known as MPEG-2 video, was standardized in 1999 for applications of low- bit rate multimedia on mobile platforms and the Internet, with the support of object-based or content based coding by modeling the scene as background and foreground. Since MPEG-1, the main video coding standards were based on the so-called macroblocks. However, research groups continued the work beyond the traditional video coding architectures and found that macroblocks could limit the performance of the compression when using high-resolution video. Therefore, in 2013 the high efficiency video coding (HEVC) also known and H.265, was released, with a structure similar to H.264/AVC but using coding units with more flexible partitions than the traditional macroblocks. HEVC has greater flexibility in prediction modes and transform block sizes, also it has a more sophisticated interpolation and de blocking filters. In 2006 the VC-1 was released. VC-1 is a video codec implemented by Microsoft and the Microsoft Windows Media Video (VMW) 9 and standardized by the Society of Motion Picture and Television Engineers (SMPTE). In 2017 the Joint Video Experts Team (JVET) released a call for proposals for a new video coding standard initially called Beyond the HEVC, Future Video Coding (FVC) or known as Versatile Video Coding (VVC). VVC is being built on top of HEVC for application on Standard Dynamic Range (SDR), High Dynamic Range (HDR) and 360° Video. The VVC is planned to be finalized by 2020. This book presents the new VVC, and updates on the HEVC. The book discusses the advances in lossless coding and covers the topic of screen content coding. Technical topics discussed include: Beyond the High Efficiency Video CodingHigh Efficiency Video Coding encoderScreen contentLossless and visually lossless coding algorithmsFast coding algorithmsVisual quality assessmentOther screen content coding algorithmsOverview of JPEG Series
Computational Modeling of Objects Presented in Images
Author: Paolo Di Giamberardino
Publisher: Springer Science & Business Media
ISBN: 3319040391
Category : Technology & Engineering
Languages : en
Pages : 315
Book Description
This book contains extended versions of selected papers from the 3rd edition of the International Symposium CompIMAGE. These contributions include cover methods of signal and image processing and analysis to tackle problems found in medicine, material science, surveillance, biometric, robotics, defence, satellite data, traffic analysis and architecture, image segmentation, 2D and 3D reconstruction, data acquisition, interpolation and registration, data visualization, motion and deformation analysis and 3D vision.
Publisher: Springer Science & Business Media
ISBN: 3319040391
Category : Technology & Engineering
Languages : en
Pages : 315
Book Description
This book contains extended versions of selected papers from the 3rd edition of the International Symposium CompIMAGE. These contributions include cover methods of signal and image processing and analysis to tackle problems found in medicine, material science, surveillance, biometric, robotics, defence, satellite data, traffic analysis and architecture, image segmentation, 2D and 3D reconstruction, data acquisition, interpolation and registration, data visualization, motion and deformation analysis and 3D vision.
Advances in Visual Data Compression and Communication
Author: Feng Wu
Publisher: CRC Press
ISBN: 1482234130
Category : Technology & Engineering
Languages : en
Pages : 517
Book Description
Visual information is one of the richest and most bandwidth-consuming modes of communication. To meet the requirements of emerging applications, powerful data compression and transmission techniques are required to achieve highly efficient communication, even in the presence of growing communication channels that offer increased bandwidth. Presenting the results of the author’s years of research on visual data compression and transmission, Advances in Visual Data Compression and Communication: Meeting the Requirements of New Applications provides a theoretical and technical basis for advanced research on visual data compression and communication. The book studies the drifting problem in scalable video coding, analyzes the reasons causing the problem, and proposes various solutions to the problem. It explores the author’s Barbell-based lifting coding scheme that has been adopted as common software by MPEG. It also proposes a unified framework for deriving a directional transform from the nondirectional counterpart. The structure of the framework and the statistic distribution of coefficients are similar to those of the nondirectional transforms, which facilitates subsequent entropy coding. Exploring the visual correlation that exists in media, the text extends the current coding framework from different aspects, including advanced image synthesis—from description and reconstruction to organizing correlated images as a pseudo sequence. It explains how to apply compressive sensing to solve the data compression problem during transmission and covers novel research on compressive sensor data gathering, random projection codes, and compressive modulation. For analog and digital transmission technologies, the book develops the pseudo-analog transmission for media and explores cutting-edge research on distributed pseudo-analog transmission, denoising in pseudo-analog transmission, and supporting MIMO. It concludes by considering emerging developments of information theory for future applications.
Publisher: CRC Press
ISBN: 1482234130
Category : Technology & Engineering
Languages : en
Pages : 517
Book Description
Visual information is one of the richest and most bandwidth-consuming modes of communication. To meet the requirements of emerging applications, powerful data compression and transmission techniques are required to achieve highly efficient communication, even in the presence of growing communication channels that offer increased bandwidth. Presenting the results of the author’s years of research on visual data compression and transmission, Advances in Visual Data Compression and Communication: Meeting the Requirements of New Applications provides a theoretical and technical basis for advanced research on visual data compression and communication. The book studies the drifting problem in scalable video coding, analyzes the reasons causing the problem, and proposes various solutions to the problem. It explores the author’s Barbell-based lifting coding scheme that has been adopted as common software by MPEG. It also proposes a unified framework for deriving a directional transform from the nondirectional counterpart. The structure of the framework and the statistic distribution of coefficients are similar to those of the nondirectional transforms, which facilitates subsequent entropy coding. Exploring the visual correlation that exists in media, the text extends the current coding framework from different aspects, including advanced image synthesis—from description and reconstruction to organizing correlated images as a pseudo sequence. It explains how to apply compressive sensing to solve the data compression problem during transmission and covers novel research on compressive sensor data gathering, random projection codes, and compressive modulation. For analog and digital transmission technologies, the book develops the pseudo-analog transmission for media and explores cutting-edge research on distributed pseudo-analog transmission, denoising in pseudo-analog transmission, and supporting MIMO. It concludes by considering emerging developments of information theory for future applications.
RGB-D Image Analysis and Processing
Author: Paul L. Rosin
Publisher: Springer Nature
ISBN: 3030286037
Category : Computers
Languages : en
Pages : 522
Book Description
This book focuses on the fundamentals and recent advances in RGB-D imaging as well as covering a range of RGB-D applications. The topics covered include: data acquisition, data quality assessment, filling holes, 3D reconstruction, SLAM, multiple depth camera systems, segmentation, object detection, salience detection, pose estimation, geometric modelling, fall detection, autonomous driving, motor rehabilitation therapy, people counting and cognitive service robots. The availability of cheap RGB-D sensors has led to an explosion over the last five years in the capture and application of colour plus depth data. The addition of depth data to regular RGB images vastly increases the range of applications, and has resulted in a demand for robust and real-time processing of RGB-D data. There remain many technical challenges, and RGB-D image processing is an ongoing research area. This book covers the full state of the art, and consists of a series of chapters by internationally renowned experts in the field. Each chapter is written so as to provide a detailed overview of that topic. RGB-D Image Analysis and Processing will enable both students and professional developers alike to quickly get up to speed with contemporary techniques, and apply RGB-D imaging in their own projects.
Publisher: Springer Nature
ISBN: 3030286037
Category : Computers
Languages : en
Pages : 522
Book Description
This book focuses on the fundamentals and recent advances in RGB-D imaging as well as covering a range of RGB-D applications. The topics covered include: data acquisition, data quality assessment, filling holes, 3D reconstruction, SLAM, multiple depth camera systems, segmentation, object detection, salience detection, pose estimation, geometric modelling, fall detection, autonomous driving, motor rehabilitation therapy, people counting and cognitive service robots. The availability of cheap RGB-D sensors has led to an explosion over the last five years in the capture and application of colour plus depth data. The addition of depth data to regular RGB images vastly increases the range of applications, and has resulted in a demand for robust and real-time processing of RGB-D data. There remain many technical challenges, and RGB-D image processing is an ongoing research area. This book covers the full state of the art, and consists of a series of chapters by internationally renowned experts in the field. Each chapter is written so as to provide a detailed overview of that topic. RGB-D Image Analysis and Processing will enable both students and professional developers alike to quickly get up to speed with contemporary techniques, and apply RGB-D imaging in their own projects.
Group and Crowd Behavior for Computer Vision
Author: Vittorio Murino
Publisher: Academic Press
ISBN: 0128092807
Category : Computers
Languages : en
Pages : 440
Book Description
Group and Crowd Behavior for Computer Vision provides a multidisciplinary perspective on how to solve the problem of group and crowd analysis and modeling, combining insights from the social sciences with technological ideas in computer vision and pattern recognition. The book answers many unresolved issues in group and crowd behavior, with Part One providing an introduction to the problems of analyzing groups and crowds that stresses that they should not be considered as completely diverse entities, but as an aggregation of people. Part Two focuses on features and representations with the aim of recognizing the presence of groups and crowds in image and video data. It discusses low level processing methods to individuate when and where a group or crowd is placed in the scene, spanning from the use of people detectors toward more ad-hoc strategies to individuate group and crowd formations. Part Three discusses methods for analyzing the behavior of groups and the crowd once they have been detected, showing how to extract semantic information, predicting/tracking the movement of a group, the formation or disaggregation of a group/crowd and the identification of different kinds of groups/crowds depending on their behavior. The final section focuses on identifying and promoting datasets for group/crowd analysis and modeling, presenting and discussing metrics for evaluating the pros and cons of the various models and methods. This book gives computer vision researcher techniques for segmentation and grouping, tracking and reasoning for solving group and crowd modeling and analysis, as well as more general problems in computer vision and machine learning. - Presents the first book to cover the topic of modeling and analysis of groups in computer vision - Discusses the topics of group and crowd modeling from a cross-disciplinary perspective, using social science anthropological theories translated into computer vision algorithms - Focuses on group and crowd analysis metrics - Discusses real industrial systems dealing with the problem of analyzing groups and crowds
Publisher: Academic Press
ISBN: 0128092807
Category : Computers
Languages : en
Pages : 440
Book Description
Group and Crowd Behavior for Computer Vision provides a multidisciplinary perspective on how to solve the problem of group and crowd analysis and modeling, combining insights from the social sciences with technological ideas in computer vision and pattern recognition. The book answers many unresolved issues in group and crowd behavior, with Part One providing an introduction to the problems of analyzing groups and crowds that stresses that they should not be considered as completely diverse entities, but as an aggregation of people. Part Two focuses on features and representations with the aim of recognizing the presence of groups and crowds in image and video data. It discusses low level processing methods to individuate when and where a group or crowd is placed in the scene, spanning from the use of people detectors toward more ad-hoc strategies to individuate group and crowd formations. Part Three discusses methods for analyzing the behavior of groups and the crowd once they have been detected, showing how to extract semantic information, predicting/tracking the movement of a group, the formation or disaggregation of a group/crowd and the identification of different kinds of groups/crowds depending on their behavior. The final section focuses on identifying and promoting datasets for group/crowd analysis and modeling, presenting and discussing metrics for evaluating the pros and cons of the various models and methods. This book gives computer vision researcher techniques for segmentation and grouping, tracking and reasoning for solving group and crowd modeling and analysis, as well as more general problems in computer vision and machine learning. - Presents the first book to cover the topic of modeling and analysis of groups in computer vision - Discusses the topics of group and crowd modeling from a cross-disciplinary perspective, using social science anthropological theories translated into computer vision algorithms - Focuses on group and crowd analysis metrics - Discusses real industrial systems dealing with the problem of analyzing groups and crowds
Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Author: César Beltrán-Castañón
Publisher: Springer
ISBN: 3319522779
Category : Computers
Languages : en
Pages : 560
Book Description
This book constitutes the refereed post-conference proceedings of the 21st Iberoamerican Congress on Pattern Recognition, CIARP 2016, held in Lima, Peru, in November 2016. The 69 papers presented were carefully reviewed and selected from 131 submissions. The papers feature research results in the areas of pattern recognition, biometrics, image processing, computer vision, speech recognition, and remote sensing. They constitute theoretical as well as applied contributions in many fields related to the main topics of the conference.
Publisher: Springer
ISBN: 3319522779
Category : Computers
Languages : en
Pages : 560
Book Description
This book constitutes the refereed post-conference proceedings of the 21st Iberoamerican Congress on Pattern Recognition, CIARP 2016, held in Lima, Peru, in November 2016. The 69 papers presented were carefully reviewed and selected from 131 submissions. The papers feature research results in the areas of pattern recognition, biometrics, image processing, computer vision, speech recognition, and remote sensing. They constitute theoretical as well as applied contributions in many fields related to the main topics of the conference.
Proceedings of 2nd International Conference on Computer Vision & Image Processing
Author: Bidyut B. Chaudhuri
Publisher: Springer
ISBN: 981107898X
Category : Technology & Engineering
Languages : en
Pages : 405
Book Description
The book provides insights into the Second International Conference on Computer Vision & Image Processing (CVIP-2017) organized by Department of Computer Science and Engineering of Indian Institute of Technology Roorkee. The book presents technological progress and research outcomes in the area of image processing and computer vision. The topics covered in this book are image/video processing and analysis; image/video formation and display; image/video filtering, restoration, enhancement and super-resolution; image/video coding and transmission; image/video storage, retrieval and authentication; image/video quality; transform-based and multi-resolution image/video analysis; biological and perceptual models for image/video processing; machine learning in image/video analysis; probability and uncertainty handling for image/video processing; motion and tracking; segmentation and recognition; shape, structure and stereo.
Publisher: Springer
ISBN: 981107898X
Category : Technology & Engineering
Languages : en
Pages : 405
Book Description
The book provides insights into the Second International Conference on Computer Vision & Image Processing (CVIP-2017) organized by Department of Computer Science and Engineering of Indian Institute of Technology Roorkee. The book presents technological progress and research outcomes in the area of image processing and computer vision. The topics covered in this book are image/video processing and analysis; image/video formation and display; image/video filtering, restoration, enhancement and super-resolution; image/video coding and transmission; image/video storage, retrieval and authentication; image/video quality; transform-based and multi-resolution image/video analysis; biological and perceptual models for image/video processing; machine learning in image/video analysis; probability and uncertainty handling for image/video processing; motion and tracking; segmentation and recognition; shape, structure and stereo.
Image and Graphics
Author: Yao Zhao
Publisher: Springer Nature
ISBN: 3030341135
Category : Computers
Languages : en
Pages : 688
Book Description
This three-volume set LNCS 11901, 11902, and 11903 constitutes the refereed conference proceedings of the 10thth International Conference on Image and Graphics, ICIG 2019, held in Beijing, China, in August 2019. The 183 full papers presented were selected from 384 submissions and focus on advances of theory, techniques and algorithms as well as innovative technologies of image, video and graphics processing and fostering innovation, entrepreneurship, and networking.
Publisher: Springer Nature
ISBN: 3030341135
Category : Computers
Languages : en
Pages : 688
Book Description
This three-volume set LNCS 11901, 11902, and 11903 constitutes the refereed conference proceedings of the 10thth International Conference on Image and Graphics, ICIG 2019, held in Beijing, China, in August 2019. The 183 full papers presented were selected from 384 submissions and focus on advances of theory, techniques and algorithms as well as innovative technologies of image, video and graphics processing and fostering innovation, entrepreneurship, and networking.