Hi togehter,
after checking the log i will find out that an event command runs on the master and the satellite.
But i need this event commad only on the master.
The log on the satellite shows me this line:
[2021-01-14 13:56:37 +0100] warning/PluginEventTask: Event command for object ‘xxxx-server.de!GENESYS_SERVER_ALARM’ (PID: 10880, arguments: ‘/usr/lib/nagios/plugins/eventhandler/notification_flagfile.pl’ ‘-h’ ‘xxxx-server.de’ ‘-n’ ‘GENESYS_SERVER_ALARM’ ‘-s’ ‘CRITICAL’) terminated with exit code 128, output: execvpe(/usr/lib/nagios/plugins/eventhandler/notification_flagfile.pl) failed: No such file or directory
Hi Dirk
Thanks for you reply and the links.
The documentation is clear, but i do not understand why the master node run the same eventhandler.
I‘m using the eventhandler since last year and the master works like i want. Today i‘d checked the logs on the satelite and found the error messages.
Regards
Sascha
Unfortunately it is a simple run on the agent endpoint if the check_endpoint is set and run on parents if not. It should be at least run in the zone the service is part of for consistent behaviour which would exactly not what you want, but it is not implemented like this. It also runs on all of them perhaps resulting in multiple restarts of a service or even more bad things, so we can only hope the developers will find a proper solution in the future. So it is totally on Icinga and not you or your configuration.
Hi Dirk,
ok - then I hope that the developers will soon process the open ticket. I believe that an ICINGA setup with multiple zones is not uncommon. Thanks again for the clarification
I also have this problem right now with an event handler, which is configured in the global-templates zone. The check I am running is executed by an satellite. It is a check by_ssh, so there is no check_endpoint defined. The satellite connects to the client via SSH and executes a plugin there.
My thought based on the docs was, the event_command is only running on the satellite, because the satellite executes the check. But unfortunately both the master and satellite execute the event_command, what leads to a double launch of a recovery job in AWX (triggered by the AWX API). My dirty workaround is only having the event_command script on the master. The satellite throws an error in the logs with “file not found”.
Are there any plans to give the event commands a little bit love?