Hi Team,
I am trying to use check_nwc_health plugin for monitoring the network components performances.
I downloaded the plugin from the console labs :
Attaching the commands.conf for references.Conf.txt (10.1 KB)
In UI, we are getting timeout error, where as in cli am getting the right output.
I am attaching both screenshots for references.
Much appreciated if i could get some help in resolving this issue.
what happens if you run the command with the icinga user and not with root? Also what happens if you set the check timeout higher in your service definition?
We have in our company sometimes similar issues with some kind of cisco hardware. Because the hardware needs longer to answer than the defined check-timeout is. Especially the hardware is fully working (cpu load higher than 80/90%).
It could also be dependent on it how many checks with nwc you are running parallel on your server and/or how often you run nwc with diffrent paramaters/modes (e.g. interface, hardware health, etc.)
If you can run the check with the root user the compiling worked fine. Usually you should test every check with the icinga/nagios user and not with the root user. That means if the access permissions are set correct incl. systems like SELinux or AppAmor (if used and configured)
if this is set correct than check the next things: In your case the network device is fast enough to answer. If not try to set a higher check-timeout in your service check.
And what happens if many checks are running parrallel? Has your icinga server enough ressources available to run many checks? Bash commands like (h)top are your friend in this situation.