[Date Prev] [Date Next] | [Thread Prev] [Thread Next] | [Date Index] [Thread Index] |
Re: [snips-users] false/null alert
|
On Thu, Jun 20, 2002 at 09:27:01AM -0500, peiffer at engineer8 nts.umn.edu wrote: > Has anyone completed additional debug on hostmon null alerts? > > I too have found problems with hostmon similar to what Rusell reported back > in March. I have monitors based upon the hostmon script that I am recycling. > The scripts give null information on reporting monitor, device and variable. I /think/ mine may be related to the root directory getting close to full (ie. DFSpace for "/"). Looking in to the logfiles, I noticed that it's actually getting /logged/ that way and not just alerted that way, so the problem is probably in what's recording the issue (ie. hostmon itself in this case?). > Tue Jun 18 11:16:49 2002 []: DEVICE VAR 1127 900 LEVEL Critical LOGLEVEL Cri > tical STATE down old > Tue Jun 18 11:21:49 2002 []: DEVICE VAR 298 900 LEVEL Info LOGLEVEL Critical > STATE up > > Discussions locally suggested that there may be some interaction near > the polling time. Dumping various datafiles reveals no exact match, > ( ../bin/display_snips_datafile hostmon-output) but does indicate some > problems with 'unknown' state variables on startup. The only place that > I see null device/agent are hostmon,snmpmon, and dhcpmon (recycled hostmon). > The times are the same for all 3 monitors. Could there be a race condition, > or deadlock on resources common to all of the above? > > Tim Peiffer peiffer at umn edu Personally, I hope to revisit this soon... though with the (in)famous caveat... "as soon as work calms down enough that I can look at it." I have noticed, now though... now that I keep my root drives pretty wide open and away from the thresholds, it seems a lot quieter. Russell -- Russell M. Van Tassell russell at loosenut com Seleznick's Theory of Holistic Medicine: Ice Cream cures all ills. |