Security News > 2020 > December > Amazon DevOps Guru: An ML-powered operations service that improves application availability
With just a few clicks in the Amazon DevOps Guru console, historical application and infrastructure metrics like latency, error rates, and request rates for all resources are automatically ingested and analyzed to establish normal operating bounds, and Amazon DevOps Guru then uses a pre-trained machine learning model to identify deviations from the established baseline.
Together with Amazon CodeGuru - a developer tool powered by machine learning that provides intelligent recommendations for improving code quality and identifying an application's most expensive lines of code - Amazon DevOps Guru provides customers the automated benefits of machine learning for their operational data so that developers can more easily improve application availability and reliability.
Customers can also view correlated operational events and contextual data for operational insights within the Amazon DevOps Guru console and receive alerts via Amazon SNS. Additionally, Amazon DevOps Guru supports API endpoints through the AWS SDK, making it easy for partners and customers to integrate Amazon DevOps Guru into their existing solutions for ticketing, paging, and automatic notification of engineers for high-severity issues.
"With our new Opsgenie and Jira Service Management integration, the right teams can be immediately notified the instant Amazon DevOps Guru predicts a potential issue, or determines an incident has occurred. Amazon DevOps Guru provides a new dimension of insight, and Atlassian ensures the fastest response."
"We're excited to continue this commitment to DevOps with our latest integration with Amazon DevOps Guru. Leveraging Amazon's decades of operational excellence and Amazon DevOps Guru's machine learning capabilities, PagerDuty provides even more real-time signal-to-action capabilities to our joint customers. Through PagerDuty's ingestion of Amazon DevOps Guru's Amazon SNS, AWS customers can take real-time action on operational issues before they become customer-impacting outages."
News URL
http://feedproxy.google.com/~r/HelpNetSecurity/~3/9Wy3I3wOyQk/