1. Should performance data for deleted hosts / services be purged automatically? '/usr/local/nagios/share/perfdata/' still contains data from hosts we've deleted. Should we delete this manually, or is it something that should be done automatically (if so, how frequently?)?
2. I've created the following custom 'rra.cfg' config:
Code: Select all
RRA:AVERAGE:0.5:1:43200
RRA:AVERAGE:0.5:5:25920
RRA:AVERAGE:0.5:30:8640
RRA:AVERAGE:0.5:360:1460
RRA:AVERAGE:0.5:1440:1095
RRA:MAX:0.5:1:43200
RRA:MAX:0.5:5:25920
RRA:MAX:0.5:30:8640
RRA:MAX:0.5:360:1460
RRA:MAX:0.5:1440:1095
RRA:MIN:0.5:1:43200
RRA:MIN:0.5:5:25920
RRA:MIN:0.5:30:8640
RRA:MIN:0.5:360:1460
RRA:MIN:0.5:1440:1095
1 Minute - 30 Days
5 Minutes - 90 Days
30 Minutes - 180 Days
6 Hours - 1 Year
1 Day - 3 Years
These RRD files are going to be a fair bit bigger than the 400k default. I was just wondering whether you've any data on how well Nagios will cope with larger RRD files? Is there an upper limit? We have rrdcached configured, so hopefully the I/O should be manageable.
3. When reading the documentation for creating plugins, I've noticed there's an option to specify 'critical'/'threshold' targets in the performance output: 'label'=value[UOM];[warn];[crit];. When I've written plugins previously, I've passed the thresholds to the plugin, and had the plugin calculate whether the thresholds were breached, and pass a return code back to Nagios. Is this not the preferred way of doing it? Does one method override the other?
Sorry for the bombardment of questions, we just want to make sure our Nagios installation is perfect!

Many thanks,
Gavin