I’ve got a problem with understanding how a host can (apparently) be both, DOWN (hence the hostalive check returning Critical) and reachable at the same time. Which information can I derive from this constellation, and where might possible errors regarding configuration or connectivity between my master and the host lie?
Can you share with us the host object configuration and the service definition for the hostalive ?
Is the setup a distributed one or are you executing all the checks from the master ?
thanks for helping me out here The setup is partly distributed, but all checks for this specific machine are executed on the master node. The host object looks as follows
This might be unrelated to Icinga, but if the device is a router, is it possible it is blocking ICMP, while you can reach it on telnet/SSH or the other management ports (SNMP) used by the vendor ?
That might actually be it. So trivial I never thought of that, will look into it in a moment. In the meantime, do you happen to know why the host is marked as “reachable” anyways? How is Icinga determining if a host is reachable if ICMP is rejected?
The dependencies are used for limiting notifications of child hosts (e.g behind a central router/switch), so you don’t get bombarded with messages, if the central component is DOWN. You will then only get the notification for the parent host.