Summary
In this chapter, we learned how to use AWS SDK for pandas (aka awswrangler
). We explored various components available in the awswrangler
library and how it brings pandas DataFrames closer to the AWS ecosystem. We learned how to customize and install the library for different use cases and development environments. We also looked at awswrangler
integration with AWS services, such as Amazon S3, Amazon RDS, Amazon Redshift, and Amazon Athena, and the different features available within awswrangler
.
In the next chapter, we will learn about SageMaker Data Wrangler, which helps us perform data-wrangling activities as a part of ML pipelines.