Notification - Remote poller "offline"

10RUPTiV · December 15, 2025, 3:31pm

Hello,

Trying to achieve something…

We need to receive a notification when a remote poller is “offline”

Right now we have the service “cluster-zone” and “cluster” and notification are applied correctly but we never got a notification!

We really need that, the “master” sent a notification to someone when a remote poller is disconnected/offline…

bberg · December 15, 2025, 4:14pm

What is a remote poller? Do you mean an Agent?

I mean, you could simply use a Hostcheck like check_icmp? And then send a notification when the host gets the status DOWN?

10RUPTiV · December 15, 2025, 4:16pm

yeah sorry!

remote poller = agent

we can’t ping the remote agent… most of them are configured to connect to the master, not the opposite.

EDIT: We already have the “cluster-zone” as the check_command in the host itself with notification enabled but we never got them!

lorenz · December 16, 2025, 7:48am

This sounds like there might either not be a notification for that Host object (in Icinga Web 2, when visiting that Host is there a section “Notifications” and are the “Contact” fields empty or not?) or sending the notification might have failed.

jeanm · December 16, 2025, 11:53am

So, if i summarise what I understand, a notification must be sent from B (master) as soon as A (agent) can no longer reach B.

Assuming this is correct, I would define on B an object whose status is passively set from A at regular intervals, and a notification being sent if the last status update was sent more than x minutes ago.

My two cents,

Jean

10RUPTiV · December 16, 2025, 12:05pm

exactly, the objective is, IF “A” is not connected to “B” (master” that probably mean we lost Internet at location “A” and we need to notified…

rivad · December 16, 2025, 1:17pm

The cluster-zone as host check should already result in you receiving host notifications about the host going down. My guess, you need to debug you notification settings for the affected hosts.

10RUPTiV · December 17, 2025, 3:22pm

Strange issue, soft changes at 9h36am, 2nd soft changes at 9h37 and hard change at 9h38 but host ran into a problem at 15h40 and at the same time couple seconds later, host recovered.

So the notification for DOWN and UP are at the same time…

rivad · December 17, 2025, 3:40pm

Can you post the history tab?

10RUPTiV · December 17, 2025, 3:54pm

rivad · December 17, 2025, 4:11pm

In your picture, the Host ran into a problem Notification wasn’t directly after Hard state changed and I’m missing the state change before Host recovered.

Here an example how from the cluster-zone check in my setup:

OK, that is a service not a host…

10RUPTiV · December 17, 2025, 4:27pm

Host recovered after the the notification. But still, the notification for down/up are at the same time!

jeanm · December 17, 2025, 5:19pm

The notifications are probably being held at Agent until connectivity is restored, at which time both DOWN and UP are being sent by the Master.

This is precisely why I suggested the passive check approach, in which the Master will be able to detect that the Agent has disconnected.

Jean

rivad · December 18, 2025, 10:40am

Isn’t cluster-zone checked on the master and thus Independent of the agent?

jeanm · December 18, 2025, 10:58am

Good question, I tried to find information on these “internal” checks (icinga, cluster, cluster-zone), and could not find any info on where they are processed, or how.

I wanted to run them as command line, but I couldn’t find the way…

rivad · December 18, 2025, 11:19am

Probably a build in.
In my case, the check runs on a master node:

jeanm · December 18, 2025, 11:54am

So says the web interface (and on my instances I have the same), but is it genuinely run on the master node? Could someone familiar with the source code ascertain how it works?

bberg · December 18, 2025, 1:24pm

You cant run these builtin checks in the command line as far as i know. They can be run by any icinga2 instance, it depends on the zone configuration/the command_endpoint attribute.

jeanm · December 18, 2025, 3:28pm

The question becomes - where are they meant to be run?

I summarised below what I think the checks are doing. If anyone could verify this, I would be very grateful. Sorry the table is an image, I could not find an easy way to make tables here.

Somewhat related because it could help decide where the command should be run, here is a screenshot of the performance data returned by the cluster command:

Same for the cluster-zone command:

jeanm · January 6, 2026, 2:12pm

@apenning, @lorenz, could you or another insider comment on my last post above, please? I would like to ascertain that what I have written is correct, or have it rectified.

Thank you,

Jean