Replica delay


Job details

Name: Replica delay
Platform: Mariadb
Category: Cluster and Replication
Description: This job collects the number of bytes received and sent from from other Galera Cluster nodes.
Long description:
Version: 1.5
Default schedule: 30s
Requires engine install: No
Compatibility tag: .[type=‘instance’ & databasetype=‘mariadb’]/instance[is_mariadb_branch=‘1′]

Parameters

Name Default value Description
warning_threshold 120 Maximum number of seconds between the replication SQL (applier) thread and the replication I/O (receiver) thread before a warning is triggered.
alarm_threshold 600 Maximum number of seconds between the replication SQL (applier) thread and the replication I/O (receiver) thread before an alarm is triggered.
return_status_when_Replica_IO_not_running 2 Return status value (ALARM – 2, WARNING – 1, or OK – 0) when replication I/O (receiver) thread is not started or/and it has not connected successfully to the source.
return_status_when_Replica_SQL_not_running 1 Return status value (ALARM – 2, WARNING – 1, or OK – 0) when replication I/O (applier) thread is not started.

Job Summary

SHOW SLAVE STATUS;

Job Details

Monitoring Details

Status Calculation

Output and Reporting

Field Description
Status Indicates the overall status based on the configured thresholds and the running state of the replication threads (OK, WARNING, ALARM).
Details Provides specifics about the replication delay, including how many seconds the replica is behind the master and the operational state of the replication threads.

Alerting Logic

This monitoring job is crucial for database administrators to keep a close eye on the health and performance of their MariaDB replicas, ensuring data consistency and timely troubleshooting of replication issues.