This article describes how a development team used metrics to find bottlenecks and improve the performance of a distributed system. The article is based on actual load testing that we did for a sample application.
This article is part of a series. Read the first part here.
https://learn.microsoft.com/en-us/azure/architecture/performance/event-streaming