Creating Alerts with Netdata Alerts Configuration Manager
The Netdata Alerts Configuration Manager enables users with Business subscriptions to create alerts from the Netdata Dashboard with an intuitive user interface.
Using Alerts Configuration Manager
Go to the Metrics tab and navigate to the chart you want to configure an alert for.
Click the Alert icon on the top right corner of the chart.
The Alert Configuration Manager will open up with the default thresholds. You can modify the configuration as required and the alert definition on the right will be updated dynamically.
If you want more fine-grained control or access to more advanced settings, enable Show advanced
Copy the alert definition that is generated in the code box and add it to an existing health configuration file or a new custom file under
<path to netdata install>/etc/netdata/health.d/
on a Parent Agent or a Standalone Child Agent.Reload Netdata Alert Health checks
<path to netdata install>/usr/sbin/netdatacli reload-health
and the new alert is now configured.
Alerts Configuration Manager Sections
- Alert Detection MethodAn alert is triggered whenever a metric crosses a threshold:
- Based on a standard
threshold
- Based on metric
variance
- Based on the
anomaly rate
of the metric
- Based on a standard
Metrics Lookup, Filtering and Formula Section
Metrics Lookup: This is the Evaluate line of fields in the modal and it defines the parameters for the database lookup that is needed to get the value that will be compared against the alert definition. It corresponds to the
lookup
line of the Alert configuration file. The Alerts Configuration Manager provides a default selection for the lookup and can be modified to suit your requirements. The parameters that can be modified are:- METHOD (
avg
,sum
,min
,max
,cv
,stddev
) - COMPUTATION (sum of all dimensions or individually for each dimension)
- DIMENSIONS (All dimensions, or a selection of dimensions)
- DURATION (the period in time to run the lookup)
- OPTIONS (
absolute
,unaligned
,percentage
,min2max
)
- METHOD (
Alert Filtering: This functionality can be accessed through the Show advanced checkbox and it allows for filtering the alert health checks to be run only for specific components of the infrastructure. It helps in achieving a fine-grained configuration for any given alert.
HOSTS
/NODES
- By default all hosts are selected. You can pick nodes from the dropdown list, or enter a wildcard matching a list of hosts that you want the alert health check to run on. This field corresponds to thehosts
line of the Alert configuration file.INSTANCES
- All instances are selected by default. You can pick instances from the dropdown list, or enter a wildcard matching a list of instances that you want the alert health check to run on. This field corresponds to thecharts
line of the Alert configuration file.CHART LABELS
- All chart labels are selected by default. You can pick a chart label from the dropdown list or enter a wildcard matching a list of chart labels that you want the alert health check to run on. This field corresponds to thechart labels
line of the Alert configuration file.OS
- All Operating Systems are selected by default. You can choose which OS(s) an alert health check should run on. This field corresponds to theos
line of the Alert configuration file.
Formula / Calculation: This field is available through the Show advanced checkbox and it is used to define a formula to be run on top of the
lookup
value. The result of the lookup is available in the$this
variable, and after the formula is run, the result is also stored in$this
and can be accessed while setting the alert thresholds. This field corresponds to thecalc
line of the Alert configuration file.
Alerting conditions
Warning and Critical Thresholds: These fields are used to set the thresholds for the
Warning
andCritical
alert states, while also having the option to set the condition for the alert to be raised if it isabove
orbelow
the given threshold. If the advanced settings are selected, a formula option can also be used, to define a custom formula instead of a threshold. These fields correspond to thewarn
andcrit
lines of the Alert configuration file.- Recovery Thresholds: This field is available through the Show advanced checkbox, and it is used to set the threshold that the metric value needs to meet to de-escalate from a given severity status, like
Critical to Warning
and fromWarning to Clear
. The logic is appended to thewarn
andcrit
lines of the Alert configuration file and resembles a one-lineIF-THEN-ELSE
clause.
- Recovery Thresholds: This field is available through the Show advanced checkbox, and it is used to set the threshold that the metric value needs to meet to de-escalate from a given severity status, like
Check Interval: This field is used to define the frequency of the health check for the alert and corresponds to the
every
line of the Alert configuration file.Delay Notifications: This field is available through the Show advanced checkbox and it is used to set delay parameters on notifications for an alert severity
escalation
orde-escalation
. It corresponds to thedelay
line of the Alert configuration file.Agent Specific Options: These options are only available on the
Netdata Agent
and not honored onNetdata Cloud
. They can be accessed through the Show advanced checkbox.- Repeat Notifications: This field defines the repeat frequency for the alert notification when the alert is in either
warning
orcritical
status and corresponds to therepeat
line of the Alert configuration file. - Send to: This field is used to define a user role to which the alert notifications will be sent. If set to
silent
, then the alert won't be sent to any role. It corresponds to theto
line of the Alert configuration file. - Custom Exec Script: This field is used to define a custom script that will be executed when the alert is triggered (but needs to be carefully designed as it needs to call the
health_alarm_notify.sh
module) and corresponds to theexec
line of the Alert configuration file.
- Repeat Notifications: This field defines the repeat frequency for the alert notification when the alert is in either
Alert Name, Description and Summary Section
- Alert Template Name: This field uniquely identifies an alert and corresponds to the
template
field of the Alert configuration file. The Alerts Configuration Manager provides a default name for an Alert template but we recommend you modify this to have a meaningful name for your configured alert. - Alert Template Description: This field provides a brief explanation of the alert and corresponds to the
info
line of the Alert configuration file. - Alert Summary: This field enables the users to create a custom title for the alert notification (via Notification integrations) and corresponds to the
summary
line of the Alert configuration file.
- Alert Template Name: This field uniquely identifies an alert and corresponds to the
Do you have any feedback for this page? If so, you can open a new issue on our netdata/learn repository.