Managing Data Lake Storage
Amazon Simple Storage Service (Amazon S3) is a highly scalable and secure cloud storage service. It allows you to store and retrieve any amount of data at any time from anywhere in the world. S3 buckets aim to help enterprises and individuals achieve their data backup and delivery needs and serve a variety of use cases, including but not limited to web and mobile applications, big data analytics, data lakes, and data backup and archiving.
In this chapter, we will learn how to keep data secure in S3 buckets and configure your buckets in a way that best serves your use case from performance and cost perspectives.
The following recipes will be covered in this chapter:
- Controlling access to S3 buckets
- Storage types in S3 for optimized storage costs
- Enforcing encryption of S3 buckets
- Setting up retention policies for your objects
- Versioning your data
- Replicating your data
- Monitoring your S3 buckets