Improve Incident Response

Cette page n'est pas encore disponible en français, sa traduction est en cours.
Si vous avez des questions ou des retours sur notre projet de traduction actuel, n'hésitez pas à nous contacter.

Software Catalog enhances incident response by:

  • Improving the on-call experience by verifying and consolidating ownership details, communication channels, and monitoring and troubleshooting resources.
  • Embedding solutions and tools–like runbooks and documentation–directly into existing observability workflows.
  • Accelerating incident recovery by simplifying the process of identifying owners of upstream and downstream dependencies.

Software Catalog also integrates with Datadog Incident Management and PagerDuty, allowing you to view related incidents in the Reliability tab on the Service Details page.

Note: Datadog Incidents automatically link to Software Catalog, but you should apply SERVICE tags to incidents to ensure each service’s incident data are accurate. The PagerDuty integration must be manually set up to integrate with incident information in Software Catalog.

The Reliability tab for a service, showing incident and error metrics for the service overall and by version

To view incident statuses for upstream and downstream dependencies, click a service in Software Catalog to open the Service Details page, and then click on the Dependencies tab.

The Dependencies tab for a service, showing upstream and downstream dependencies and highlighting those impacted by an incident

Further reading

Documentation, liens et articles supplémentaires utiles: