Reviewing the AWS services for data analysis
AWS provides multiple services that are geared to help the data scientist analyze either structured, semi-structured, or unstructured data at scale. A common style across all these services is to provide users with the flexibility of choice to match the right aspects of each service as it applies to the use case. At times, it may seem confusing to the user which service to leverage for their use case.
Thus, in this section, we will map some of the AWS capabilities to the methodologies we’ve reviewed in the previous section.
Unifying the data into a common store
To address the requirement of storing the relevant global population of data from multiple sources in a common store, AWS provides the Amazon Simple Storage Service (S3) object storage service, allowing users to store structured, semi-structured, and unstructured data as objects within buckets.
Note
If you are unfamiliar with the S3 service, how it works, and...