ML Logs/Metrics Incident & Anomaly Detection Software for DevOps

Step 2 - Pattern and Anomaly Detection

Within the first hour, the patterns of each type of log event and metric are learnt (and the learning continues to improve with more data).

When the pattern of a log event or metric changes (e.g. change in periodicity or frequency, new/rare message starts, etc.), it is scored as to how "anomalous" it is, but these anomalies tend to be very noisy. In order to separate signal from noise, the ML then looks for hotspots of abnormally correlated anomalies across the metrics and logs.

Step 3 - Augment (optional)

If you use an Incident Management tool like PagerDuty, Opsgenie or Slack, or an existing log management or monitoring tool, Zebrium can augment any incident with a characterization of root cause.

A signal is sent to Zebrium when an incident occurs. Or you can trigger a signal from the Zebrium UI. Zebrium then finds any root cause reports or sets of anomalous log/metric patterns that coincide with the signal, and automatically feeds the information back to your incident management tool.

Machine Learning for Logs

ML for Logs Automatically Shows you the Root Cause

Step 1 - Ingest and Categorization

Step 2 - Pattern and Anomaly Detection

Step 3 - Augment (optional)

Step 4 - Root cause reports

Getting started is free and easy

Links

Contact