You're reading from Data Engineering with AWS Cookbook A recipe-based approach to help you tackle data engineering problems with AWS services

Product type Paperback

Published in Nov 2024

Publisher Packt

ISBN-13 9781805127284

Length 528 pages

Edition 1st Edition

Languages

Python

Tools

AWS Glue

Concepts

Cloud Computing Data Engineering

Authors (4):

Viquar Khan

Gonzalo Herreros González

Huda Nofal

Trâm Ngọc Phạm

View More author details

Table of Contents (16) Chapters

Preface

1. Chapter 1: Managing Data Lake Storage

2. Chapter 2: Sharing Your Data Across Environments and Accounts FREE CHAPTER

3. Chapter 3: Ingesting and Transforming Your Data with AWS Glue

4. Chapter 4: A Deep Dive into AWS Orchestration Frameworks

5. Chapter 5: Running Big Data Workloads with Amazon EMR

6. Chapter 6: Governing Your Platform

7. Chapter 7: Data Quality Management

8. Chapter 8: DevOps – Defining IaC and Building CI/CD Pipelines

9. Chapter 9: Monitoring Data Lake Cloud Infrastructure

10. Chapter 10: Building a Serving Layer with AWS Analytics Services

11. Chapter 11: Migrating to AWS – Steps, Strategies, and Best Practices for Modernizing Your Analytics and Big Data Workloads

12. Chapter 12: Harnessing the Power of AWS for Seamless Data Warehouse Migration

13. Chapter 13: Strategizing Hadoop Migrations – Cost, Data, and Workflow Modernization with AWS

14. Index

Why subscribe?

15. Other Books You May Enjoy

Setting up a pipeline using AWS Glue to ingest data from a JDBC database into a catalog table

Creating a full pipeline using AWS Glue to ingest data from a relational database on a regular basis involves setting up the necessary components such as a Glue job, Glue crawler, and a retry mechanism to handle transient errors. In this recipe, we are going to use the AWS Glue job with EventBridge and Step Functions workflow. We will read data from a relational database and store it in an S3 bucket.

How to do it…

Set up your environment:
1. Use your existing S3 bucket or create a new one. (To create a new S3 bucket, navigate to the S3 service in the AWS Management Console, click on Create bucket, and specify a unique name. Choose the region and configure settings such as versioning or encryption as needed, then click on Create.)
2. Create an RDS MySQL instance (please use the following link and follow the given instructions: https://aws.amazon.com/getting-started/hands-on/create...

The rest of the chapter is locked

You're reading from Data Engineering with AWS Cookbook A recipe-based approach to help you tackle data engineering problems with AWS services

Table of Contents (16) Chapters

Setting up a pipeline using AWS Glue to ingest data from a JDBC database into a catalog table

How to do it…

Unlock this book and the full library FREE for 7 days

Authors (4)

Personalised recommendations for you