You're reading from Data Engineering with AWS Cookbook A recipe-based approach to help you tackle data engineering problems with AWS services

Product type Paperback

Published in Nov 2024

Publisher Packt

ISBN-13 9781805127284

Length 528 pages

Edition 1st Edition

Languages

Python

Tools

AWS Glue

Concepts

Data Engineering

Authors (6):

Trâm Ngọc Phạm

Tram Pham

Gonzalo Herreros González

Vaquar Khan

Huda Nofal

Viquar Khan

+2 more

View More author details

Table of Contents (16) Chapters

Preface

1. Chapter 1: Managing Data Lake Storage

2. Chapter 2: Sharing Your Data Across Environments and Accounts FREE CHAPTER

3. Chapter 3: Ingesting and Transforming Your Data with AWS Glue

4. Chapter 4: A Deep Dive into AWS Orchestration Frameworks

5. Chapter 5: Running Big Data Workloads with Amazon EMR

6. Chapter 6: Governing Your Platform

7. Chapter 7: Data Quality Management

8. Chapter 8: DevOps – Defining IaC and Building CI/CD Pipelines

9. Chapter 9: Monitoring Data Lake Cloud Infrastructure

10. Chapter 10: Building a Serving Layer with AWS Analytics Services

11. Chapter 11: Migrating to AWS – Steps, Strategies, and Best Practices for Modernizing Your Analytics and Big Data Workloads

12. Chapter 12: Harnessing the Power of AWS for Seamless Data Warehouse Migration

13. Chapter 13: Strategizing Hadoop Migrations – Cost, Data, and Workflow Modernization with AWS

14. Index

Why subscribe?

15. Other Books You May Enjoy

Converting ETL processes with big data frameworks

As the volume and complexity of data continue to grow, traditional ETL processes are struggling to keep pace. Big data frameworks, such as Apache Hadoop, Apache Spark, and AWS, offer a powerful solution for migrating and managing big data workloads, enabling organizations to process, analyze, and extract valuable insights effectively from their vast data repositories.

Getting ready

Let’s discover how AWS can help you overcome the limitations of traditional ETL processes and unlock new possibilities for data analysis:

Challenges of traditional ETL in big data: Traditional ETL processes face several limitations in handling the massive scale and complexity of big data:
- Scalability: Traditional ETL tools aren’t designed to handle the massive scale of big data, leading to performance bottlenecks and slow processing times
- Flexibility: Traditional ETL processes are often rigid and inflexible, making it difficult to...

The rest of the chapter is locked

You're reading from Data Engineering with AWS Cookbook A recipe-based approach to help you tackle data engineering problems with AWS services

Table of Contents (16) Chapters

Converting ETL processes with big data frameworks

Getting ready

Unlock this book and the full library FREE for 7 days

Authors (6)

Personalised recommendations for you