What this book covers
Chapter 1, Basics of Modern Data Architecture, is a short introduction to data engineering, basic concepts of data engineering, and the role a Java data architect plays in data engineering.
Chapter 2, Data Storage and Databases, is a brief discussion about various data types, storage formats, data formats, and databases. It also discusses when to use them.
Chapter 3, Identifying the Right Data Platform, provides an overview of various platforms to deploy data pipelines and how to choose the correct platform.
Chapter 4, ETL Data Load – A Batch-Based Solution to Ingest Data in a Data Warehouse, discusses how to approach, analyze, and architect an effective solution for a batch-based data ingestion problem using Spring Batch and Java.
Chapter 5, Architecting a Batch Processing Pipeline, discusses how to architect and implement a data analysis pipeline in AWS using S3, Apache Spark (Java), AWS Elastic MapReduce (EMR), and AWS Athena for a big data use case.
Chapter 6, Architecting a Real-Time Processing Pipeline, provides a step-by-step guide to building a real-time streaming solution to predict the risk category of a loan application using Java, Kafka, and related technologies.
Chapter 7, Core Architectural Design Patterns, discusses various common architectural patterns used to solve data engineering problems and when to use them.
Chapter 8, Enabling Data security and Governance, introduces data governance and discusses how to apply it using a practical use case. It also briefly touches upon the topic of data security.
Chapter 9, Exposing MongoDB Data as a Service, provides a step-by-step guide on how to build Data as a Service to expose MongoDB data using a REST API.
Chapter 10, Federated and Scalable DaaS with GraphQL, discusses what GraphQL is, various GraphQL patterns, and how to publish data using GraphQL.
Chapter 11, Measuring Performance and Benchmarking Your Applications, provides an overview of performance engineering, how to measure performance and create benchmarks, and how to optimize performance.
Chapter 12, Evaluating, Recommending, and Presenting Your Solutions, discusses how to evaluate and choose the best-suited alternative among various architectures and how to present the recommended architecture effectively.