Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Data Engineering with AWS Cookbook

You're reading from   Data Engineering with AWS Cookbook A recipe-based approach to help you tackle data engineering problems with AWS services

Arrow left icon
Product type Paperback
Published in Nov 2024
Publisher Packt
ISBN-13 9781805127284
Length 528 pages
Edition 1st Edition
Languages
Arrow right icon
Authors (4):
Arrow left icon
Viquar Khan Viquar Khan
Author Profile Icon Viquar Khan
Viquar Khan
Gonzalo Herreros González Gonzalo Herreros González
Author Profile Icon Gonzalo Herreros González
Gonzalo Herreros González
Huda Nofal Huda Nofal
Author Profile Icon Huda Nofal
Huda Nofal
Trâm Ngọc Phạm Trâm Ngọc Phạm
Author Profile Icon Trâm Ngọc Phạm
Trâm Ngọc Phạm
Arrow right icon
View More author details
Toc

Table of Contents (16) Chapters Close

Preface 1. Chapter 1: Managing Data Lake Storage 2. Chapter 2: Sharing Your Data Across Environments and Accounts FREE CHAPTER 3. Chapter 3: Ingesting and Transforming Your Data with AWS Glue 4. Chapter 4: A Deep Dive into AWS Orchestration Frameworks 5. Chapter 5: Running Big Data Workloads with Amazon EMR 6. Chapter 6: Governing Your Platform 7. Chapter 7: Data Quality Management 8. Chapter 8: DevOps – Defining IaC and Building CI/CD Pipelines 9. Chapter 9: Monitoring Data Lake Cloud Infrastructure 10. Chapter 10: Building a Serving Layer with AWS Analytics Services 11. Chapter 11: Migrating to AWS – Steps, Strategies, and Best Practices for Modernizing Your Analytics and Big Data Workloads 12. Chapter 12: Harnessing the Power of AWS for Seamless Data Warehouse Migration 13. Chapter 13: Strategizing Hadoop Migrations – Cost, Data, and Workflow Modernization with AWS 14. Index 15. Other Books You May Enjoy

Preface

Hello and welcome! In today’s rapidly evolving data landscape, managing, migrating, and governing large-scale data systems are among the top priorities for data engineers. This book serves as a comprehensive guide to help you navigate these essential tasks, with a focus on three key pillars of modern data engineering:

  • Hadoop and data warehouse migration: Organizations are increasingly moving from traditional Hadoop clusters and on-premises data warehouses to more scalable, cloud-based data platforms. This book walks you through the best practices, methodologies, and how to use the tools for migrating large-scale data systems, ensuring data consistency, minimal downtime, and scalable performance.
  • Data lake operations: Building and maintaining a data lake in today’s multi-cloud, big data environment is complex and demands a strong operational strategy. This book covers how to ingest, transform, and manage data at scale using AWS services such as S3, Glue, and Athena. You will learn how to structure and maintain a robust data lake architecture that supports the varied needs of data analysts, data scientists, and business users alike.
  • Data lake governance: Managing and governing your data lake involves more than just operational efficiency; it requires stringent security protocols, data quality controls, and compliance measures. With the explosion of data, it’s more important than ever to have clear governance frameworks in place. This book delves into the best practices for implementing governance strategies using services such as AWS Lake Formation, Glue, and other AWS security frameworks. You’ll also learn about setting up policies that ensure your data lake is compliant with industry regulations while maintaining scalability and flexibility.

This cookbook is tailored to data engineers who are looking to implement best practices and take their cloud data platforms to the next level. Throughout this book, you’ll find practical examples, detailed recipes, and real-world scenarios from the authors’ experience of working with complex data environments across different industries.

By the end of this journey, you will have a thorough understanding of how to migrate, operate, and govern your data platforms at scale, all while aligning with industry best practices and modern technological advancements.

So, let’s dive in and build the future of data engineering together!

lock icon The rest of the chapter is locked
Next Section arrow right
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image