Dude False Positives
Posted: Thu Jan 14, 2016 12:28 pm
We have moved from the old Dude on a Windows server to the new Dude 6.34rc34 running on a CHR on ESXi, the config has been recreated from scratch as the import did not work.
Yesterday it was running fine then last night it generated a lot of alerts saying devices were down, even this morning there are still a few that it is adamant are down but if I log on to the underlying CHR and ping the devices I get consistent replies with 0 packet loss meanwhile the probe down count is still going up in the dude, the probes are just simple pings.
I have tried increasing the probe interval, timeout and down count but it does not make a difference.
Yesterday it was running fine then last night it generated a lot of alerts saying devices were down, even this morning there are still a few that it is adamant are down but if I log on to the underlying CHR and ping the devices I get consistent replies with 0 packet loss meanwhile the probe down count is still going up in the dude, the probes are just simple pings.
I have tried increasing the probe interval, timeout and down count but it does not make a difference.