The client's enterprise systems comprised multiple upstream and downstream systems that participated in fulfillment of a business function. There were a few issues with effective triaging across cross-functional microservices teams
- Lack of standardization in logs and ineffective usage of log analytics tools like Splunk, led to significant time spent by the Client’s teams in analysing the issue before it was assigned to the right team. Many times, this delay used to impact critical business activities
- No unified way of tracking and visualization of request from source service to the leaf node in the hierarchy (L0 to L5)
- No standard procedure of logging entries and error reporting by various cross functional teams.
- No visibility on significant latency issues in microservices.