Chapter 10: AIOps with Amazon DevOps Guru and Systems Manager OpsCenter
This chapter will introduce you to the concepts of artificial intelligence (AI) and machine learning (ML). We will also learn about AIOps, why we need it, and how it is applied to IT operations. We will cover some of the areas where AIOps can be helpful. We will learn about the AWS DevOps Guru AIOps tool and implement two use cases. We will deploy a serverless application and inject some failure, and then analyze the insights and remediation provided by Amazon DevOps Guru. Then, another use case will be covered about identifying anomalies in CPU, memory, and networking within an Elastic Container Service for Kubernetes (EKS) cluster.
This chapter contains the following main sections:
- AIOps and how it helps in IT operations
- AIOps using Amazon DevOps Guru
- Enabling DevOps Guru on EKS cluster resources
- Injecting failure and then reviewing the insights
- Deploying a serverless application and...