On Exploiting Location Flexibility in Data-intensive Distributed Systems

On Exploiting Location Flexibility in Data-intensive Distributed Systems PDF Author: Boyang Yu
Publisher:
ISBN:
Category :
Languages : en
Pages :

Book Description
With the fast growth of data-intensive distributed systems today, more novel and principled approaches are needed to improve the system efficiency, ensure the service quality to satisfy the user requirements, and lower the system running cost. This dissertation studies the design issues in the data-intensive distributed systems, which are differentiated from other systems by the heavy workload of data movement and are characterized by the fact that the destination of each data flow is limited to a subset of available locations, such as those servers holding the requested data. Besides, even among the feasible subset, different locations may result in different performance.The studies in this dissertation improve the data-intensive systems by exploiting the data storage location flexibility. It addresses how to reasonably determine the data placement based on the measured request patterns, to improve a series of performance metrics, such as the data access latency, system throughput and various costs, by the proposed hypergraph models for data placement. To implement the proposal with a lower overhead, a sketch-based data placement scheme is presented, which constructs the sparsified hypergraph under a distributed and streaming-based system model, achieving a good approximation on the performance improvement. As the network can potentially become the bottleneck of distributed data-intensive systems due to the frequent data movement among storage nodes, the online data placement by reinforcement learning is proposed which intelligently determines the storage locations of each data item at the moment that the item is going to be written or updated, with the joint-awareness of network conditions and request patterns. Meanwhile, noticing that distributed memory caches are effective measures in lowering the workload to the backend storage systems, the auto-scaling of memory cache clusters is studied, which tries to balance the energy cost of the service and the performance ensured.As the outcome of this dissertation, the designed schemes and methods essentially help to improve the running efficiency of data-intensive distributed systems. Therefore, they can either help to improve the user-perceived service quality under the same level of system resource investment, or help to lower the monetary expense and energy consumption in maintaining the system under the same performance standard. From the two perspectives, both the end users and the system providers could obtain benefits from the results of the studies.

Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management

Data Intensive Distributed Computing: Challenges and Solutions for Large-scale Information Management PDF Author: Kosar, Tevfik
Publisher: IGI Global
ISBN: 1615209727
Category : Computers
Languages : en
Pages : 353

Book Description
"This book focuses on the challenges of distributed systems imposed by the data intensive applications, and on the different state-of-the-art solutions proposed to overcome these challenges"--Provided by publisher.

Data Intensive Computing Applications for Big Data

Data Intensive Computing Applications for Big Data PDF Author: M. Mittal
Publisher: IOS Press
ISBN: 1614998140
Category : Computers
Languages : en
Pages : 618

Book Description
The book ‘Data Intensive Computing Applications for Big Data’ discusses the technical concepts of big data, data intensive computing through machine learning, soft computing and parallel computing paradigms. It brings together researchers to report their latest results or progress in the development of the above mentioned areas. Since there are few books on this specific subject, the editors aim to provide a common platform for researchers working in this area to exhibit their novel findings. The book is intended as a reference work for advanced undergraduates and graduate students, as well as multidisciplinary, interdisciplinary and transdisciplinary research workers and scientists on the subjects of big data and cloud/parallel and distributed computing, and explains didactically many of the core concepts of these approaches for practical applications. It is organized into 24 chapters providing a comprehensive overview of big data analysis using parallel computing and addresses the complete data science workflow in the cloud, as well as dealing with privacy issues and the challenges faced in a data-intensive cloud computing environment. The book explores both fundamental and high-level concepts, and will serve as a manual for those in the industry, while also helping beginners to understand the basic and advanced aspects of big data and cloud computing.

Intelligent Agents in Data-intensive Computing

Intelligent Agents in Data-intensive Computing PDF Author: Joanna Kołodziej
Publisher: Springer
ISBN: 331923742X
Category : Technology & Engineering
Languages : en
Pages : 231

Book Description
This book presents new approaches that advance research in all aspects of agent-based models, technologies, simulations and implementations for data intensive applications. The nine chapters contain a review of recent cross-disciplinary approaches in cloud environments and multi-agent systems, and important formulations of data intensive problems in distributed computational environments together with the presentation of new agent-based tools to handle those problems and Big Data in general. This volume can serve as a reference for students, researchers and industry practitioners working in or interested in joining interdisciplinary work in the areas of data intensive computing and Big Data systems using emergent large-scale distributed computing paradigms. It will also allow newcomers to grasp key concepts and potential solutions on advanced topics of theory, models, technologies, system architectures and implementation of applications in Multi-Agent systems and data intensive computing.

Principles of Distributed Database Systems

Principles of Distributed Database Systems PDF Author: M. Tamer Özsu
Publisher: Springer Nature
ISBN: 3030262537
Category : Computers
Languages : en
Pages : 674

Book Description
The fourth edition of this classic textbook provides major updates. This edition has completely new chapters on Big Data Platforms (distributed storage systems, MapReduce, Spark, data stream processing, graph analytics) and on NoSQL, NewSQL and polystore systems. It also includes an updated web data management chapter that includes RDF and semantic web discussion, an integrated database integration chapter focusing both on schema integration and querying over these systems. The peer-to-peer computing chapter has been updated with a discussion of blockchains. The chapters that describe classical distributed and parallel database technology have all been updated. The new edition covers the breadth and depth of the field from a modern viewpoint. Graduate students, as well as senior undergraduate students studying computer science and other related fields will use this book as a primary textbook. Researchers working in computer science will also find this textbook useful. This textbook has a companion web site that includes background information on relational database fundamentals, query processing, transaction management, and computer networks for those who might need this background. The web site also includes all the figures and presentation slides as well as solutions to exercises (restricted to instructors).

Scientific and Technical Aerospace Reports

Scientific and Technical Aerospace Reports PDF Author:
Publisher:
ISBN:
Category : Aeronautics
Languages : en
Pages : 976

Book Description
Lists citations with abstracts for aerospace related reports obtained from world wide sources and announces documents that have recently been entered into the NASA Scientific and Technical Information Database.

Data Intensive Storage Services for Cloud Environments

Data Intensive Storage Services for Cloud Environments PDF Author: Kyriazis, Dimosthenis
Publisher: IGI Global
ISBN: 1466639350
Category : Computers
Languages : en
Pages : 342

Book Description
With the evolution of digitized data, our society has become dependent on services to extract valuable information and enhance decision making by individuals, businesses, and government in all aspects of life. Therefore, emerging cloud-based infrastructures for storage have been widely thought of as the next generation solution for the reliance on data increases. Data Intensive Storage Services for Cloud Environments provides an overview of the current and potential approaches towards data storage services and its relationship to cloud environments. This reference source brings together research on storage technologies in cloud environments and various disciplines useful for both professionals and researchers.

Computer-Assisted Management and Control of Manufacturing Systems

Computer-Assisted Management and Control of Manufacturing Systems PDF Author: Spyros G. Tzafestas
Publisher: Springer Science & Business Media
ISBN: 1447109597
Category : Technology & Engineering
Languages : en
Pages : 504

Book Description
Modem manufacturing systems involve many processes and operations that can be monitored and controlled at several levels of intelligence. At the highest level there is a computer that supervises the various manufacturing functions, whereas at the lowest level there are stand alone computer controlled systems of manufacturing processes and robotic cells. Until recenty computer-aided manufacturing systems constituted isolated "islands" of automation, each oriented to a particular application, but present day systems offer integrated approaches to manufacturing and enterprise operations. These modem systems, known as computer-integrated manufacturing (CIM) systems, can easily meet the current performance and manufacturing competitiveness requirements under strong environmental changes. CIM systems are much of a challenge, and imply a systemic approach to the design and operation of a manufacturing enterprise. Actualy, a CIM system must take into account in a unified way the following three views : the user view, the technology view, and the enterprise view. This means that CIM includes both the engineering and enterprise planning and control activities, as well as the information flow activities across all the stages of the system.

Applying Integration Techniques and Methods in Distributed Systems and Technologies

Applying Integration Techniques and Methods in Distributed Systems and Technologies PDF Author: Kecskemeti, Gabor
Publisher: IGI Global
ISBN: 1522582967
Category : Computers
Languages : en
Pages : 351

Book Description
Distributed systems intertwine with our everyday lives. The benefits and current shortcomings of the underpinning technologies are experienced by a wide range of people and their smart devices. With the rise of large-scale IoT and similar distributed systems, cloud bursting technologies, and partial outsourcing solutions, private entities are encouraged to increase their efficiency and offer unparalleled availability and reliability to their users. Applying Integration Techniques and Methods in Distributed Systems is a critical scholarly publication that defines the current state of distributed systems, determines further goals, and presents architectures and service frameworks to achieve highly integrated distributed systems and presents solutions to integration and efficient management challenges faced by current and future distributed systems. Highlighting topics such as multimedia, programming languages, and smart environments, this book is ideal for system administrators, integrators, designers, developers, researchers, and academicians.

Data-Intensive Computing

Data-Intensive Computing PDF Author: Ian Gorton
Publisher: Cambridge University Press
ISBN: 0521191955
Category : Computers
Languages : en
Pages : 299

Book Description
Describes principles of the emerging field of data-intensive computing, along with methods for designing, managing and analyzing the big data sets of today.