We use OMD + naemon + mod_gearman + keepalived with dupserver to create an HA nagios setup and use keepalived to change the active_checks settings in naemon to only run checks on the box that has the keepalived IP.
We keep hitting an issue where we deploy our settings and restart OMD and it results in the keepalived failing over; which is basically expected. However if we are working on nodes and have acknowledged alerts or have services in downtime these will basically start alerting on failover. We’d like a way to submit ack and downtime to both naemon instances.
Is there any way of doing this currently? I can’t find anything obvious as it looks like downtime commands are submitted to naemon only and mod_gearman doesn’t pick them up?