Author: David Foster
Publisher: "O'Reilly Media, Inc."
ISBN: 1492041890
Category : Computers
Languages : en
Pages : 301
Book Description
Generative modeling is one of the hottest topics in AI. It’s now possible to teach a machine to excel at human endeavors such as painting, writing, and composing music. With this practical book, machine-learning engineers and data scientists will discover how to re-create some of the most impressive examples of generative deep learning models, such as variational autoencoders,generative adversarial networks (GANs), encoder-decoder models and world models. Author David Foster demonstrates the inner workings of each technique, starting with the basics of deep learning before advancing to some of the most cutting-edge algorithms in the field. Through tips and tricks, you’ll understand how to make your models learn more efficiently and become more creative. Discover how variational autoencoders can change facial expressions in photos Build practical GAN examples from scratch, including CycleGAN for style transfer and MuseGAN for music generation Create recurrent generative models for text generation and learn how to improve the models using attention Understand how generative models can help agents to accomplish tasks within a reinforcement learning setting Explore the architecture of the Transformer (BERT, GPT-2) and image generation models such as ProGAN and StyleGAN
Generative Deep Learning
Author: David Foster
Publisher: "O'Reilly Media, Inc."
ISBN: 1492041890
Category : Computers
Languages : en
Pages : 301
Book Description
Generative modeling is one of the hottest topics in AI. It’s now possible to teach a machine to excel at human endeavors such as painting, writing, and composing music. With this practical book, machine-learning engineers and data scientists will discover how to re-create some of the most impressive examples of generative deep learning models, such as variational autoencoders,generative adversarial networks (GANs), encoder-decoder models and world models. Author David Foster demonstrates the inner workings of each technique, starting with the basics of deep learning before advancing to some of the most cutting-edge algorithms in the field. Through tips and tricks, you’ll understand how to make your models learn more efficiently and become more creative. Discover how variational autoencoders can change facial expressions in photos Build practical GAN examples from scratch, including CycleGAN for style transfer and MuseGAN for music generation Create recurrent generative models for text generation and learn how to improve the models using attention Understand how generative models can help agents to accomplish tasks within a reinforcement learning setting Explore the architecture of the Transformer (BERT, GPT-2) and image generation models such as ProGAN and StyleGAN
Publisher: "O'Reilly Media, Inc."
ISBN: 1492041890
Category : Computers
Languages : en
Pages : 301
Book Description
Generative modeling is one of the hottest topics in AI. It’s now possible to teach a machine to excel at human endeavors such as painting, writing, and composing music. With this practical book, machine-learning engineers and data scientists will discover how to re-create some of the most impressive examples of generative deep learning models, such as variational autoencoders,generative adversarial networks (GANs), encoder-decoder models and world models. Author David Foster demonstrates the inner workings of each technique, starting with the basics of deep learning before advancing to some of the most cutting-edge algorithms in the field. Through tips and tricks, you’ll understand how to make your models learn more efficiently and become more creative. Discover how variational autoencoders can change facial expressions in photos Build practical GAN examples from scratch, including CycleGAN for style transfer and MuseGAN for music generation Create recurrent generative models for text generation and learn how to improve the models using attention Understand how generative models can help agents to accomplish tasks within a reinforcement learning setting Explore the architecture of the Transformer (BERT, GPT-2) and image generation models such as ProGAN and StyleGAN
Generative AI and LLMs
Author: S. Balasubramaniam
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 311142507X
Category : Computers
Languages : en
Pages : 290
Book Description
Generative artificial intelligence (GAI) and large language models (LLM) are machine learning algorithms that operate in an unsupervised or semi-supervised manner. These algorithms leverage pre-existing content, such as text, photos, audio, video, and code, to generate novel content. The primary objective is to produce authentic and novel material. In addition, there exists an absence of constraints on the quantity of novel material that they are capable of generating. New material can be generated through the utilization of Application Programming Interfaces (APIs) or natural language interfaces, such as the ChatGPT developed by Open AI and Bard developed by Google. The field of generative artificial intelligence (AI) stands out due to its unique characteristic of undergoing development and maturation in a highly transparent manner, with its progress being observed by the public at large. The current era of artificial intelligence is being influenced by the imperative to effectively utilise its capabilities in order to enhance corporate operations. Specifically, the use of large language model (LLM) capabilities, which fall under the category of Generative AI, holds the potential to redefine the limits of innovation and productivity. However, as firms strive to include new technologies, there is a potential for compromising data privacy, long-term competitiveness, and environmental sustainability. This book delves into the exploration of generative artificial intelligence (GAI) and LLM. It examines the historical and evolutionary development of generative AI models, as well as the challenges and issues that have emerged from these models and LLM. This book also discusses the necessity of generative AI-based systems and explores the various training methods that have been developed for generative AI models, including LLM pretraining, LLM fine-tuning, and reinforcement learning from human feedback. Additionally, it explores the potential use cases, applications, and ethical considerations associated with these models. This book concludes by discussing future directions in generative AI and presenting various case studies that highlight the applications of generative AI and LLM.
Publisher: Walter de Gruyter GmbH & Co KG
ISBN: 311142507X
Category : Computers
Languages : en
Pages : 290
Book Description
Generative artificial intelligence (GAI) and large language models (LLM) are machine learning algorithms that operate in an unsupervised or semi-supervised manner. These algorithms leverage pre-existing content, such as text, photos, audio, video, and code, to generate novel content. The primary objective is to produce authentic and novel material. In addition, there exists an absence of constraints on the quantity of novel material that they are capable of generating. New material can be generated through the utilization of Application Programming Interfaces (APIs) or natural language interfaces, such as the ChatGPT developed by Open AI and Bard developed by Google. The field of generative artificial intelligence (AI) stands out due to its unique characteristic of undergoing development and maturation in a highly transparent manner, with its progress being observed by the public at large. The current era of artificial intelligence is being influenced by the imperative to effectively utilise its capabilities in order to enhance corporate operations. Specifically, the use of large language model (LLM) capabilities, which fall under the category of Generative AI, holds the potential to redefine the limits of innovation and productivity. However, as firms strive to include new technologies, there is a potential for compromising data privacy, long-term competitiveness, and environmental sustainability. This book delves into the exploration of generative artificial intelligence (GAI) and LLM. It examines the historical and evolutionary development of generative AI models, as well as the challenges and issues that have emerged from these models and LLM. This book also discusses the necessity of generative AI-based systems and explores the various training methods that have been developed for generative AI models, including LLM pretraining, LLM fine-tuning, and reinforcement learning from human feedback. Additionally, it explores the potential use cases, applications, and ethical considerations associated with these models. This book concludes by discussing future directions in generative AI and presenting various case studies that highlight the applications of generative AI and LLM.
Human + Machine
Author: Paul R. Daugherty
Publisher: Harvard Business Press
ISBN: 1633693872
Category : Computers
Languages : en
Pages : 268
Book Description
AI is radically transforming business. Are you ready? Look around you. Artificial intelligence is no longer just a futuristic notion. It's here right now--in software that senses what we need, supply chains that "think" in real time, and robots that respond to changes in their environment. Twenty-first-century pioneer companies are already using AI to innovate and grow fast. The bottom line is this: Businesses that understand how to harness AI can surge ahead. Those that neglect it will fall behind. Which side are you on? In Human + Machine, Accenture leaders Paul R. Daugherty and H. James (Jim) Wilson show that the essence of the AI paradigm shift is the transformation of all business processes within an organization--whether related to breakthrough innovation, everyday customer service, or personal productivity habits. As humans and smart machines collaborate ever more closely, work processes become more fluid and adaptive, enabling companies to change them on the fly--or to completely reimagine them. AI is changing all the rules of how companies operate. Based on the authors' experience and research with 1,500 organizations, the book reveals how companies are using the new rules of AI to leap ahead on innovation and profitability, as well as what you can do to achieve similar results. It describes six entirely new types of hybrid human + machine roles that every company must develop, and it includes a "leader’s guide" with the five crucial principles required to become an AI-fueled business. Human + Machine provides the missing and much-needed management playbook for success in our new age of AI. BOOK PROCEEDS FOR THE AI GENERATION The authors' goal in publishing Human + Machine is to help executives, workers, students and others navigate the changes that AI is making to business and the economy. They believe AI will bring innovations that truly improve the way the world works and lives. However, AI will cause disruption, and many people will need education, training and support to prepare for the newly created jobs. To support this need, the authors are donating the royalties received from the sale of this book to fund education and retraining programs focused on developing fusion skills for the age of artificial intelligence.
Publisher: Harvard Business Press
ISBN: 1633693872
Category : Computers
Languages : en
Pages : 268
Book Description
AI is radically transforming business. Are you ready? Look around you. Artificial intelligence is no longer just a futuristic notion. It's here right now--in software that senses what we need, supply chains that "think" in real time, and robots that respond to changes in their environment. Twenty-first-century pioneer companies are already using AI to innovate and grow fast. The bottom line is this: Businesses that understand how to harness AI can surge ahead. Those that neglect it will fall behind. Which side are you on? In Human + Machine, Accenture leaders Paul R. Daugherty and H. James (Jim) Wilson show that the essence of the AI paradigm shift is the transformation of all business processes within an organization--whether related to breakthrough innovation, everyday customer service, or personal productivity habits. As humans and smart machines collaborate ever more closely, work processes become more fluid and adaptive, enabling companies to change them on the fly--or to completely reimagine them. AI is changing all the rules of how companies operate. Based on the authors' experience and research with 1,500 organizations, the book reveals how companies are using the new rules of AI to leap ahead on innovation and profitability, as well as what you can do to achieve similar results. It describes six entirely new types of hybrid human + machine roles that every company must develop, and it includes a "leader’s guide" with the five crucial principles required to become an AI-fueled business. Human + Machine provides the missing and much-needed management playbook for success in our new age of AI. BOOK PROCEEDS FOR THE AI GENERATION The authors' goal in publishing Human + Machine is to help executives, workers, students and others navigate the changes that AI is making to business and the economy. They believe AI will bring innovations that truly improve the way the world works and lives. However, AI will cause disruption, and many people will need education, training and support to prepare for the newly created jobs. To support this need, the authors are donating the royalties received from the sale of this book to fund education and retraining programs focused on developing fusion skills for the age of artificial intelligence.
The Executive Guide to Artificial Intelligence
Author: Andrew Burgess
Publisher: Springer
ISBN: 3319638203
Category : Business & Economics
Languages : en
Pages : 187
Book Description
This book takes a pragmatic and hype–free approach to explaining artificial intelligence and how it can be utilised by businesses today. At the core of the book is a framework, developed by the author, which describes in non–technical language the eight core capabilities of Artificial Intelligence (AI). Each of these capabilities, ranging from image recognition, through natural language processing, to prediction, is explained using real–life examples and how they can be applied in a business environment. It will include interviews with executives who have successfully implemented AI as well as CEOs from AI vendors and consultancies. AI is one of the most talked about technologies in business today. It has the ability to deliver step–change benefits to organisations and enables forward–thinking CEOs to rethink their business models or create completely new businesses. But most of the real value of AI is hidden behind marketing hyperbole, confusing terminology, inflated expectations and dire warnings of ‘robot overlords’. Any business executive that wants to know how to exploit AI in their business today is left confused and frustrated. As an advisor in Artificial Intelligence, Andrew Burgess regularly comes face–to–face with business executives who are struggling to cut through the hype that surrounds AI. The knowledge and experience he has gained in advising them, as well as working as a strategic advisor to AI vendors and consultancies, has provided him with the skills to help business executives understand what AI is and how they can exploit its many benefits. Through the distilled knowledge included in this book business leaders will be able to take full advantage of this most disruptive of technologies and create substantial competitive advantage for their companies.
Publisher: Springer
ISBN: 3319638203
Category : Business & Economics
Languages : en
Pages : 187
Book Description
This book takes a pragmatic and hype–free approach to explaining artificial intelligence and how it can be utilised by businesses today. At the core of the book is a framework, developed by the author, which describes in non–technical language the eight core capabilities of Artificial Intelligence (AI). Each of these capabilities, ranging from image recognition, through natural language processing, to prediction, is explained using real–life examples and how they can be applied in a business environment. It will include interviews with executives who have successfully implemented AI as well as CEOs from AI vendors and consultancies. AI is one of the most talked about technologies in business today. It has the ability to deliver step–change benefits to organisations and enables forward–thinking CEOs to rethink their business models or create completely new businesses. But most of the real value of AI is hidden behind marketing hyperbole, confusing terminology, inflated expectations and dire warnings of ‘robot overlords’. Any business executive that wants to know how to exploit AI in their business today is left confused and frustrated. As an advisor in Artificial Intelligence, Andrew Burgess regularly comes face–to–face with business executives who are struggling to cut through the hype that surrounds AI. The knowledge and experience he has gained in advising them, as well as working as a strategic advisor to AI vendors and consultancies, has provided him with the skills to help business executives understand what AI is and how they can exploit its many benefits. Through the distilled knowledge included in this book business leaders will be able to take full advantage of this most disruptive of technologies and create substantial competitive advantage for their companies.
Machine Learning with PyTorch and Scikit-Learn
Author: Sebastian Raschka
Publisher: Packt Publishing Ltd
ISBN: 1801816387
Category : Computers
Languages : en
Pages : 775
Book Description
This book of the bestselling and widely acclaimed Python Machine Learning series is a comprehensive guide to machine and deep learning using PyTorch s simple to code framework. Purchase of the print or Kindle book includes a free eBook in PDF format. Key Features Learn applied machine learning with a solid foundation in theory Clear, intuitive explanations take you deep into the theory and practice of Python machine learning Fully updated and expanded to cover PyTorch, transformers, XGBoost, graph neural networks, and best practices Book DescriptionMachine Learning with PyTorch and Scikit-Learn is a comprehensive guide to machine learning and deep learning with PyTorch. It acts as both a step-by-step tutorial and a reference you'll keep coming back to as you build your machine learning systems. Packed with clear explanations, visualizations, and examples, the book covers all the essential machine learning techniques in depth. While some books teach you only to follow instructions, with this machine learning book, we teach the principles allowing you to build models and applications for yourself. Why PyTorch? PyTorch is the Pythonic way to learn machine learning, making it easier to learn and simpler to code with. This book explains the essential parts of PyTorch and how to create models using popular libraries, such as PyTorch Lightning and PyTorch Geometric. You will also learn about generative adversarial networks (GANs) for generating new data and training intelligent agents with reinforcement learning. Finally, this new edition is expanded to cover the latest trends in deep learning, including graph neural networks and large-scale transformers used for natural language processing (NLP). This PyTorch book is your companion to machine learning with Python, whether you're a Python developer new to machine learning or want to deepen your knowledge of the latest developments.What you will learn Explore frameworks, models, and techniques for machines to learn from data Use scikit-learn for machine learning and PyTorch for deep learning Train machine learning classifiers on images, text, and more Build and train neural networks, transformers, and boosting algorithms Discover best practices for evaluating and tuning models Predict continuous target outcomes using regression analysis Dig deeper into textual and social media data using sentiment analysis Who this book is for If you have a good grasp of Python basics and want to start learning about machine learning and deep learning, then this is the book for you. This is an essential resource written for developers and data scientists who want to create practical machine learning and deep learning applications using scikit-learn and PyTorch. Before you get started with this book, you’ll need a good understanding of calculus, as well as linear algebra.
Publisher: Packt Publishing Ltd
ISBN: 1801816387
Category : Computers
Languages : en
Pages : 775
Book Description
This book of the bestselling and widely acclaimed Python Machine Learning series is a comprehensive guide to machine and deep learning using PyTorch s simple to code framework. Purchase of the print or Kindle book includes a free eBook in PDF format. Key Features Learn applied machine learning with a solid foundation in theory Clear, intuitive explanations take you deep into the theory and practice of Python machine learning Fully updated and expanded to cover PyTorch, transformers, XGBoost, graph neural networks, and best practices Book DescriptionMachine Learning with PyTorch and Scikit-Learn is a comprehensive guide to machine learning and deep learning with PyTorch. It acts as both a step-by-step tutorial and a reference you'll keep coming back to as you build your machine learning systems. Packed with clear explanations, visualizations, and examples, the book covers all the essential machine learning techniques in depth. While some books teach you only to follow instructions, with this machine learning book, we teach the principles allowing you to build models and applications for yourself. Why PyTorch? PyTorch is the Pythonic way to learn machine learning, making it easier to learn and simpler to code with. This book explains the essential parts of PyTorch and how to create models using popular libraries, such as PyTorch Lightning and PyTorch Geometric. You will also learn about generative adversarial networks (GANs) for generating new data and training intelligent agents with reinforcement learning. Finally, this new edition is expanded to cover the latest trends in deep learning, including graph neural networks and large-scale transformers used for natural language processing (NLP). This PyTorch book is your companion to machine learning with Python, whether you're a Python developer new to machine learning or want to deepen your knowledge of the latest developments.What you will learn Explore frameworks, models, and techniques for machines to learn from data Use scikit-learn for machine learning and PyTorch for deep learning Train machine learning classifiers on images, text, and more Build and train neural networks, transformers, and boosting algorithms Discover best practices for evaluating and tuning models Predict continuous target outcomes using regression analysis Dig deeper into textual and social media data using sentiment analysis Who this book is for If you have a good grasp of Python basics and want to start learning about machine learning and deep learning, then this is the book for you. This is an essential resource written for developers and data scientists who want to create practical machine learning and deep learning applications using scikit-learn and PyTorch. Before you get started with this book, you’ll need a good understanding of calculus, as well as linear algebra.
Natural Language Processing with Transformers, Revised Edition
Author: Lewis Tunstall
Publisher: "O'Reilly Media, Inc."
ISBN: 1098136764
Category : Computers
Languages : en
Pages : 409
Book Description
Since their introduction in 2017, transformers have quickly become the dominant architecture for achieving state-of-the-art results on a variety of natural language processing tasks. If you're a data scientist or coder, this practical book -now revised in full color- shows you how to train and scale these large models using Hugging Face Transformers, a Python-based deep learning library. Transformers have been used to write realistic news stories, improve Google Search queries, and even create chatbots that tell corny jokes. In this guide, authors Lewis Tunstall, Leandro von Werra, and Thomas Wolf, among the creators of Hugging Face Transformers, use a hands-on approach to teach you how transformers work and how to integrate them in your applications. You'll quickly learn a variety of tasks they can help you solve. Build, debug, and optimize transformer models for core NLP tasks, such as text classification, named entity recognition, and question answering Learn how transformers can be used for cross-lingual transfer learning Apply transformers in real-world scenarios where labeled data is scarce Make transformer models efficient for deployment using techniques such as distillation, pruning, and quantization Train transformers from scratch and learn how to scale to multiple GPUs and distributed environments
Publisher: "O'Reilly Media, Inc."
ISBN: 1098136764
Category : Computers
Languages : en
Pages : 409
Book Description
Since their introduction in 2017, transformers have quickly become the dominant architecture for achieving state-of-the-art results on a variety of natural language processing tasks. If you're a data scientist or coder, this practical book -now revised in full color- shows you how to train and scale these large models using Hugging Face Transformers, a Python-based deep learning library. Transformers have been used to write realistic news stories, improve Google Search queries, and even create chatbots that tell corny jokes. In this guide, authors Lewis Tunstall, Leandro von Werra, and Thomas Wolf, among the creators of Hugging Face Transformers, use a hands-on approach to teach you how transformers work and how to integrate them in your applications. You'll quickly learn a variety of tasks they can help you solve. Build, debug, and optimize transformer models for core NLP tasks, such as text classification, named entity recognition, and question answering Learn how transformers can be used for cross-lingual transfer learning Apply transformers in real-world scenarios where labeled data is scarce Make transformer models efficient for deployment using techniques such as distillation, pruning, and quantization Train transformers from scratch and learn how to scale to multiple GPUs and distributed environments
Generative AI with Large Language Models: A Comprehensive Guide
Author: Anand Vemula
Publisher: Anand Vemula
ISBN:
Category : Computers
Languages : en
Pages : 43
Book Description
This book delves into the fascinating world of Generative AI, exploring the two key technologies driving its advancements: Large Language Models (LLMs) and Foundation Models (FMs). Part 1: Foundations LLMs Demystified: We begin by understanding LLMs, powerful AI models trained on massive amounts of text data. These models can generate human-quality text, translate languages, write different creative formats, and even answer your questions in an informative way. The Rise of FMs: However, LLMs are just a piece of the puzzle. We explore Foundation Models, a broader category encompassing models trained on various data types like images, audio, and even scientific data. These models represent a significant leap forward in AI, offering a more versatile approach to information processing. Part 2: LLMs and Generative AI Applications Training LLMs: We delve into the intricate process of training LLMs, from data acquisition and pre-processing to different training techniques like supervised and unsupervised learning. The chapter also explores challenges like computational resources and data bias, along with best practices for responsible LLM training. Fine-Tuning for Specific Tasks: LLMs can be further specialized for targeted tasks through fine-tuning. We explore how fine-tuning allows LLMs to excel in areas like creative writing, code generation, drug discovery, and even music composition. Part 3: Advanced Topics LLM Architectures: We take a deep dive into the technical aspects of LLMs, exploring the workings of Transformer networks, the backbone of modern LLMs. We also examine the role of attention mechanisms in LLM processing and learn about different prominent LLM architectures like GPT-3 and Jurassic-1 Jumbo. Scaling Generative AI: Scaling up LLMs presents significant computational challenges. The chapter explores techniques like model parallelism and distributed training to address these hurdles, along with hardware considerations like GPUs and TPUs that facilitate efficient LLM training. Most importantly, we discuss the crucial role of safety and ethics in generative AI development. Mitigating bias, addressing potential risks like deepfakes, and ensuring transparency are all essential for responsible AI development. Part 4: The Future Evolving Generative AI Landscape: We explore emerging trends in LLM research, like the development of even larger and more capable models, along with advancements in explainable AI and the rise of multimodal LLMs that can handle different data types. We also discuss the potential applications of generative AI in unforeseen areas like personalized education and healthcare. Societal Impact and the Future of Work: The book concludes by examining the societal and economic implications of generative AI. We explore the potential transformation of industries, the need for workforce reskilling, and the importance of human-AI collaboration. Additionally, the book emphasizes the need for robust regulations to address concerns like bias, data privacy, and transparency in generative AI development. This book equips you with a comprehensive understanding of generative AI, its core technologies, its applications, and the considerations for its responsible development and deployment.
Publisher: Anand Vemula
ISBN:
Category : Computers
Languages : en
Pages : 43
Book Description
This book delves into the fascinating world of Generative AI, exploring the two key technologies driving its advancements: Large Language Models (LLMs) and Foundation Models (FMs). Part 1: Foundations LLMs Demystified: We begin by understanding LLMs, powerful AI models trained on massive amounts of text data. These models can generate human-quality text, translate languages, write different creative formats, and even answer your questions in an informative way. The Rise of FMs: However, LLMs are just a piece of the puzzle. We explore Foundation Models, a broader category encompassing models trained on various data types like images, audio, and even scientific data. These models represent a significant leap forward in AI, offering a more versatile approach to information processing. Part 2: LLMs and Generative AI Applications Training LLMs: We delve into the intricate process of training LLMs, from data acquisition and pre-processing to different training techniques like supervised and unsupervised learning. The chapter also explores challenges like computational resources and data bias, along with best practices for responsible LLM training. Fine-Tuning for Specific Tasks: LLMs can be further specialized for targeted tasks through fine-tuning. We explore how fine-tuning allows LLMs to excel in areas like creative writing, code generation, drug discovery, and even music composition. Part 3: Advanced Topics LLM Architectures: We take a deep dive into the technical aspects of LLMs, exploring the workings of Transformer networks, the backbone of modern LLMs. We also examine the role of attention mechanisms in LLM processing and learn about different prominent LLM architectures like GPT-3 and Jurassic-1 Jumbo. Scaling Generative AI: Scaling up LLMs presents significant computational challenges. The chapter explores techniques like model parallelism and distributed training to address these hurdles, along with hardware considerations like GPUs and TPUs that facilitate efficient LLM training. Most importantly, we discuss the crucial role of safety and ethics in generative AI development. Mitigating bias, addressing potential risks like deepfakes, and ensuring transparency are all essential for responsible AI development. Part 4: The Future Evolving Generative AI Landscape: We explore emerging trends in LLM research, like the development of even larger and more capable models, along with advancements in explainable AI and the rise of multimodal LLMs that can handle different data types. We also discuss the potential applications of generative AI in unforeseen areas like personalized education and healthcare. Societal Impact and the Future of Work: The book concludes by examining the societal and economic implications of generative AI. We explore the potential transformation of industries, the need for workforce reskilling, and the importance of human-AI collaboration. Additionally, the book emphasizes the need for robust regulations to address concerns like bias, data privacy, and transparency in generative AI development. This book equips you with a comprehensive understanding of generative AI, its core technologies, its applications, and the considerations for its responsible development and deployment.
Demystifying Large Language Models
Author: James Chen
Publisher: James Chen
ISBN: 1738908461
Category : Computers
Languages : en
Pages : 300
Book Description
This book is a comprehensive guide aiming to demystify the world of transformers -- the architecture that powers Large Language Models (LLMs) like GPT and BERT. From PyTorch basics and mathematical foundations to implementing a Transformer from scratch, you'll gain a deep understanding of the inner workings of these models. That's just the beginning. Get ready to dive into the realm of pre-training your own Transformer from scratch, unlocking the power of transfer learning to fine-tune LLMs for your specific use cases, exploring advanced techniques like PEFT (Prompting for Efficient Fine-Tuning) and LoRA (Low-Rank Adaptation) for fine-tuning, as well as RLHF (Reinforcement Learning with Human Feedback) for detoxifying LLMs to make them aligned with human values and ethical norms. Step into the deployment of LLMs, delivering these state-of-the-art language models into the real-world, whether integrating them into cloud platforms or optimizing them for edge devices, this section ensures you're equipped with the know-how to bring your AI solutions to life. Whether you're a seasoned AI practitioner, a data scientist, or a curious developer eager to advance your knowledge on the powerful LLMs, this book is your ultimate guide to mastering these cutting-edge models. By translating convoluted concepts into understandable explanations and offering a practical hands-on approach, this treasure trove of knowledge is invaluable to both aspiring beginners and seasoned professionals. Table of Contents 1. INTRODUCTION 1.1 What is AI, ML, DL, Generative AI and Large Language Model 1.2 Lifecycle of Large Language Models 1.3 Whom This Book Is For 1.4 How This Book Is Organized 1.5 Source Code and Resources 2. PYTORCH BASICS AND MATH FUNDAMENTALS 2.1 Tensor and Vector 2.2 Tensor and Matrix 2.3 Dot Product 2.4 Softmax 2.5 Cross Entropy 2.6 GPU Support 2.7 Linear Transformation 2.8 Embedding 2.9 Neural Network 2.10 Bigram and N-gram Models 2.11 Greedy, Random Sampling and Beam 2.12 Rank of Matrices 2.13 Singular Value Decomposition (SVD) 2.14 Conclusion 3. TRANSFORMER 3.1 Dataset and Tokenization 3.2 Embedding 3.3 Positional Encoding 3.4 Layer Normalization 3.5 Feed Forward 3.6 Scaled Dot-Product Attention 3.7 Mask 3.8 Multi-Head Attention 3.9 Encoder Layer and Encoder 3.10 Decoder Layer and Decoder 3.11 Transformer 3.12 Training 3.13 Inference 3.14 Conclusion 4. PRE-TRAINING 4.1 Machine Translation 4.2 Dataset and Tokenization 4.3 Load Data in Batch 4.4 Pre-Training nn.Transformer Model 4.5 Inference 4.6 Popular Large Language Models 4.7 Computational Resources 4.8 Prompt Engineering and In-context Learning (ICL) 4.9 Prompt Engineering on FLAN-T5 4.10 Pipelines 4.11 Conclusion 5. FINE-TUNING 5.1 Fine-Tuning 5.2 Parameter Efficient Fine-tuning (PEFT) 5.3 Low-Rank Adaptation (LoRA) 5.4 Adapter 5.5 Prompt Tuning 5.6 Evaluation 5.7 Reinforcement Learning 5.8 Reinforcement Learning Human Feedback (RLHF) 5.9 Implementation of RLHF 5.10 Conclusion 6. DEPLOYMENT OF LLMS 6.1 Challenges and Considerations 6.2 Pre-Deployment Optimization 6.3 Security and Privacy 6.4 Deployment Architectures 6.5 Scalability and Load Balancing 6.6 Compliance and Ethics Review 6.7 Model Versioning and Updates 6.8 LLM-Powered Applications 6.9 Vector Database 6.10 LangChain 6.11 Chatbot, Example of LLM-Powered Application 6.12 WebUI, Example of LLM-Power Application 6.13 Future Trends and Challenges 6.14 Conclusion REFERENCES ABOUT THE AUTHOR
Publisher: James Chen
ISBN: 1738908461
Category : Computers
Languages : en
Pages : 300
Book Description
This book is a comprehensive guide aiming to demystify the world of transformers -- the architecture that powers Large Language Models (LLMs) like GPT and BERT. From PyTorch basics and mathematical foundations to implementing a Transformer from scratch, you'll gain a deep understanding of the inner workings of these models. That's just the beginning. Get ready to dive into the realm of pre-training your own Transformer from scratch, unlocking the power of transfer learning to fine-tune LLMs for your specific use cases, exploring advanced techniques like PEFT (Prompting for Efficient Fine-Tuning) and LoRA (Low-Rank Adaptation) for fine-tuning, as well as RLHF (Reinforcement Learning with Human Feedback) for detoxifying LLMs to make them aligned with human values and ethical norms. Step into the deployment of LLMs, delivering these state-of-the-art language models into the real-world, whether integrating them into cloud platforms or optimizing them for edge devices, this section ensures you're equipped with the know-how to bring your AI solutions to life. Whether you're a seasoned AI practitioner, a data scientist, or a curious developer eager to advance your knowledge on the powerful LLMs, this book is your ultimate guide to mastering these cutting-edge models. By translating convoluted concepts into understandable explanations and offering a practical hands-on approach, this treasure trove of knowledge is invaluable to both aspiring beginners and seasoned professionals. Table of Contents 1. INTRODUCTION 1.1 What is AI, ML, DL, Generative AI and Large Language Model 1.2 Lifecycle of Large Language Models 1.3 Whom This Book Is For 1.4 How This Book Is Organized 1.5 Source Code and Resources 2. PYTORCH BASICS AND MATH FUNDAMENTALS 2.1 Tensor and Vector 2.2 Tensor and Matrix 2.3 Dot Product 2.4 Softmax 2.5 Cross Entropy 2.6 GPU Support 2.7 Linear Transformation 2.8 Embedding 2.9 Neural Network 2.10 Bigram and N-gram Models 2.11 Greedy, Random Sampling and Beam 2.12 Rank of Matrices 2.13 Singular Value Decomposition (SVD) 2.14 Conclusion 3. TRANSFORMER 3.1 Dataset and Tokenization 3.2 Embedding 3.3 Positional Encoding 3.4 Layer Normalization 3.5 Feed Forward 3.6 Scaled Dot-Product Attention 3.7 Mask 3.8 Multi-Head Attention 3.9 Encoder Layer and Encoder 3.10 Decoder Layer and Decoder 3.11 Transformer 3.12 Training 3.13 Inference 3.14 Conclusion 4. PRE-TRAINING 4.1 Machine Translation 4.2 Dataset and Tokenization 4.3 Load Data in Batch 4.4 Pre-Training nn.Transformer Model 4.5 Inference 4.6 Popular Large Language Models 4.7 Computational Resources 4.8 Prompt Engineering and In-context Learning (ICL) 4.9 Prompt Engineering on FLAN-T5 4.10 Pipelines 4.11 Conclusion 5. FINE-TUNING 5.1 Fine-Tuning 5.2 Parameter Efficient Fine-tuning (PEFT) 5.3 Low-Rank Adaptation (LoRA) 5.4 Adapter 5.5 Prompt Tuning 5.6 Evaluation 5.7 Reinforcement Learning 5.8 Reinforcement Learning Human Feedback (RLHF) 5.9 Implementation of RLHF 5.10 Conclusion 6. DEPLOYMENT OF LLMS 6.1 Challenges and Considerations 6.2 Pre-Deployment Optimization 6.3 Security and Privacy 6.4 Deployment Architectures 6.5 Scalability and Load Balancing 6.6 Compliance and Ethics Review 6.7 Model Versioning and Updates 6.8 LLM-Powered Applications 6.9 Vector Database 6.10 LangChain 6.11 Chatbot, Example of LLM-Powered Application 6.12 WebUI, Example of LLM-Power Application 6.13 Future Trends and Challenges 6.14 Conclusion REFERENCES ABOUT THE AUTHOR
The Definitive Guide to Conversational AI with Dialogflow and Google Cloud
Author: Lee Boonstra
Publisher: Apress
ISBN: 9781484270134
Category : Computers
Languages : en
Pages : 405
Book Description
Build enterprise chatbots for web, social media, voice assistants, IoT, and telephony contact centers with Google's Dialogflow conversational AI technology. This book will explain how to get started with conversational AI using Google and how enterprise users can use Dialogflow as part of Google Cloud. It will cover the core concepts such as Dialogflow essentials, deploying chatbots on web and social media channels, and building voice agents including advanced tips and tricks such as intents, entities, and working with context. The Definitive Guide to Conversational AI with Dialogflow and Google Cloud also explains how to build multilingual chatbots, orchestrate sub chatbots into a bigger conversational platform, use virtual agent analytics with popular tools, such as BigQuery or Chatbase, and build voice bots. It concludes with coverage of more advanced use cases, such as building fulfillment functionality, building your own integrations, securing your chatbots, and building your own voice platform with the Dialogflow SDK and other Google Cloud machine learning APIs. After reading this book, you will understand how to build cross-channel enterprise bots with popular Google tools such as Dialogflow, Google Cloud AI, Cloud Run, Cloud Functions, and Chatbase. What You Will Learn Discover Dialogflow, Dialogflow Essentials, Dialogflow CX, and how machine learning is used Create Dialogflow projects for individuals and enterprise usage Work with Dialogflow essential concepts such as intents, entities, custom entities, system entities, composites, and how to track context Build bots quickly using prebuilt agents, small talk modules, and FAQ knowledge bases Use Dialogflow for an out-of-the-box agent review Deploy text conversational UIs for web and social media channels Build voice agents for voice assistants, phone gateways, and contact centers Create multilingual chatbots Orchestrate many sub-chatbots to build a bigger conversational platform Use chatbot analytics and test the quality of your Dialogflow agent See the new Dialogflow CX concepts, how Dialogflow CX fits in, and what’s different in Dialogflow CX Who This Book Is For Everyone interested in building chatbots for web, social media, voice assistants, or contact centers using Google’s conversational AI/cloud technology.
Publisher: Apress
ISBN: 9781484270134
Category : Computers
Languages : en
Pages : 405
Book Description
Build enterprise chatbots for web, social media, voice assistants, IoT, and telephony contact centers with Google's Dialogflow conversational AI technology. This book will explain how to get started with conversational AI using Google and how enterprise users can use Dialogflow as part of Google Cloud. It will cover the core concepts such as Dialogflow essentials, deploying chatbots on web and social media channels, and building voice agents including advanced tips and tricks such as intents, entities, and working with context. The Definitive Guide to Conversational AI with Dialogflow and Google Cloud also explains how to build multilingual chatbots, orchestrate sub chatbots into a bigger conversational platform, use virtual agent analytics with popular tools, such as BigQuery or Chatbase, and build voice bots. It concludes with coverage of more advanced use cases, such as building fulfillment functionality, building your own integrations, securing your chatbots, and building your own voice platform with the Dialogflow SDK and other Google Cloud machine learning APIs. After reading this book, you will understand how to build cross-channel enterprise bots with popular Google tools such as Dialogflow, Google Cloud AI, Cloud Run, Cloud Functions, and Chatbase. What You Will Learn Discover Dialogflow, Dialogflow Essentials, Dialogflow CX, and how machine learning is used Create Dialogflow projects for individuals and enterprise usage Work with Dialogflow essential concepts such as intents, entities, custom entities, system entities, composites, and how to track context Build bots quickly using prebuilt agents, small talk modules, and FAQ knowledge bases Use Dialogflow for an out-of-the-box agent review Deploy text conversational UIs for web and social media channels Build voice agents for voice assistants, phone gateways, and contact centers Create multilingual chatbots Orchestrate many sub-chatbots to build a bigger conversational platform Use chatbot analytics and test the quality of your Dialogflow agent See the new Dialogflow CX concepts, how Dialogflow CX fits in, and what’s different in Dialogflow CX Who This Book Is For Everyone interested in building chatbots for web, social media, voice assistants, or contact centers using Google’s conversational AI/cloud technology.
The Myth of Artificial Intelligence
Author: Erik J. Larson
Publisher: Harvard University Press
ISBN: 0674983513
Category : Computers
Languages : en
Pages : 321
Book Description
“Artificial intelligence has always inspired outlandish visions—that AI is going to destroy us, save us, or at the very least radically transform us. Erik Larson exposes the vast gap between the actual science underlying AI and the dramatic claims being made for it. This is a timely, important, and even essential book.” —John Horgan, author of The End of Science Many futurists insist that AI will soon achieve human levels of intelligence. From there, it will quickly eclipse the most gifted human mind. The Myth of Artificial Intelligence argues that such claims are just that: myths. We are not on the path to developing truly intelligent machines. We don’t even know where that path might be. Erik Larson charts a journey through the landscape of AI, from Alan Turing’s early work to today’s dominant models of machine learning. Since the beginning, AI researchers and enthusiasts have equated the reasoning approaches of AI with those of human intelligence. But this is a profound mistake. Even cutting-edge AI looks nothing like human intelligence. Modern AI is based on inductive reasoning: computers make statistical correlations to determine which answer is likely to be right, allowing software to, say, detect a particular face in an image. But human reasoning is entirely different. Humans do not correlate data sets; we make conjectures sensitive to context—the best guess, given our observations and what we already know about the world. We haven’t a clue how to program this kind of reasoning, known as abduction. Yet it is the heart of common sense. Larson argues that all this AI hype is bad science and bad for science. A culture of invention thrives on exploring unknowns, not overselling existing methods. Inductive AI will continue to improve at narrow tasks, but if we are to make real progress, we must abandon futuristic talk and learn to better appreciate the only true intelligence we know—our own.
Publisher: Harvard University Press
ISBN: 0674983513
Category : Computers
Languages : en
Pages : 321
Book Description
“Artificial intelligence has always inspired outlandish visions—that AI is going to destroy us, save us, or at the very least radically transform us. Erik Larson exposes the vast gap between the actual science underlying AI and the dramatic claims being made for it. This is a timely, important, and even essential book.” —John Horgan, author of The End of Science Many futurists insist that AI will soon achieve human levels of intelligence. From there, it will quickly eclipse the most gifted human mind. The Myth of Artificial Intelligence argues that such claims are just that: myths. We are not on the path to developing truly intelligent machines. We don’t even know where that path might be. Erik Larson charts a journey through the landscape of AI, from Alan Turing’s early work to today’s dominant models of machine learning. Since the beginning, AI researchers and enthusiasts have equated the reasoning approaches of AI with those of human intelligence. But this is a profound mistake. Even cutting-edge AI looks nothing like human intelligence. Modern AI is based on inductive reasoning: computers make statistical correlations to determine which answer is likely to be right, allowing software to, say, detect a particular face in an image. But human reasoning is entirely different. Humans do not correlate data sets; we make conjectures sensitive to context—the best guess, given our observations and what we already know about the world. We haven’t a clue how to program this kind of reasoning, known as abduction. Yet it is the heart of common sense. Larson argues that all this AI hype is bad science and bad for science. A culture of invention thrives on exploring unknowns, not overselling existing methods. Inductive AI will continue to improve at narrow tasks, but if we are to make real progress, we must abandon futuristic talk and learn to better appreciate the only true intelligence we know—our own.