You're reading from Geospatial Data Analytics on AWS Discover how to manage and analyze geospatial data in the cloud

Product type Paperback

Published in Jun 2023

Publisher Packt

ISBN-13 9781804613825

Length 276 pages

Edition 1st Edition

Languages

Tools

AWS

Concepts

Geospatial Analysis

Authors (3):

Scott Bateman

Jeff DeMuth

Janahan Gnanachandran

View More author details

Table of Contents (23) Chapters

Preface

1. Part 1: Introduction to the Geospatial Data Ecosystem

2. Chapter 1: Introduction to Geospatial Data in the Cloud FREE CHAPTER

3. Chapter 2: Quality and Temporal Geospatial Data Concepts

4. Part 2: Geospatial Data Lakes using Modern Data Architecture

5. Chapter 3: Geospatial Data Lake Architecture

6. Chapter 4: Using Geospatial Data with Amazon Redshift

7. Chapter 5: Using Geospatial Data with Amazon Aurora PostgreSQL

8. Chapter 6: Serverless Options for Geospatial

9. Chapter 7: Querying Geospatial Data with Amazon Athena

10. Part 3: Analyzing and Visualizing Geospatial Data in AWS

11. Chapter 8: Geospatial Containers on AWS

12. Chapter 9: Using Geospatial Data with Amazon EMR

13. Chapter 10: Geospatial Data Analysis Using R on AWS

14. Chapter 11: Geospatial Machine Learning with SageMaker

15. Chapter 12: Using Amazon QuickSight to Visualize Geospatial Data

16. Part 4: Accessing Open Source and Commercial Platforms and Services

17. Chapter 13: Open Data on AWS

18. Chapter 14: Leveraging OpenStreetMap on AWS

19. Chapter 15: Feature Servers and Map Servers on AWS

20. Chapter 16: Satellite and Aerial Imagery on AWS

21. Index

Why subscribe?

22. Other Books You May Enjoy

Common Hadoop frameworks

I’ve mentioned a few frameworks already such as PySpark, Spark, and Apache Pig, but there are hundreds more that do just about everything under the sun. You have data storage and database frameworks such as Hadoop Distributed File System (HDFS), NoSQL and SQL capabilities such as HBase and Hive, and machine learning with Mahout, to name a few. I have a good example of how these services came to be when I was first learning Hadoop. Over a decade ago, I got frustrated trying to run a Map Reduce job and stumbled across Apache Pig. Apache Pig (which was named after Pig Latin) was built to be a simple analytics language syntax for Hadoop and was an attempt to make it easier for users who didn’t know Java. This is also similar to what happened with PySpark, where the Hadoop users wanted a familiar language to work with Spark, so PySpark was born.

These frameworks are somewhat in the process of being disrupted with the release of Spark. Traditionally...

The rest of the chapter is locked

You're reading from Geospatial Data Analytics on AWS Discover how to manage and analyze geospatial data in the cloud

Table of Contents (23) Chapters

Common Hadoop frameworks

Unlock this book and the full library FREE for 7 days

Authors (3)

Personalised recommendations for you