Questions about Performance Data

Gavin · Post by **Gavin** » Mon Dec 31, 2012 5:15 am

I'm trying to get my head around the performance data, and I've got a few questions...

1. Should performance data for deleted hosts / services be purged automatically? '/usr/local/nagios/share/perfdata/' still contains data from hosts we've deleted. Should we delete this manually, or is it something that should be done automatically (if so, how frequently?)?

2. I've created the following custom 'rra.cfg' config:

Code: Select all

RRA:AVERAGE:0.5:1:43200
RRA:AVERAGE:0.5:5:25920
RRA:AVERAGE:0.5:30:8640
RRA:AVERAGE:0.5:360:1460
RRA:AVERAGE:0.5:1440:1095

RRA:MAX:0.5:1:43200
RRA:MAX:0.5:5:25920
RRA:MAX:0.5:30:8640
RRA:MAX:0.5:360:1460
RRA:MAX:0.5:1440:1095

RRA:MIN:0.5:1:43200
RRA:MIN:0.5:5:25920
RRA:MIN:0.5:30:8640
RRA:MIN:0.5:360:1460
RRA:MIN:0.5:1440:1095

This should record data as follows:

1 Minute - 30 Days
5 Minutes - 90 Days
30 Minutes - 180 Days
6 Hours - 1 Year
1 Day - 3 Years

These RRD files are going to be a fair bit bigger than the 400k default. I was just wondering whether you've any data on how well Nagios will cope with larger RRD files? Is there an upper limit? We have rrdcached configured, so hopefully the I/O should be manageable.

3. When reading the documentation for creating plugins, I've noticed there's an option to specify 'critical'/'threshold' targets in the performance output: 'label'=value[UOM];[warn];[crit];. When I've written plugins previously, I've passed the thresholds to the plugin, and had the plugin calculate whether the thresholds were breached, and pass a return code back to Nagios. Is this not the preferred way of doing it? Does one method override the other?

Sorry for the bombardment of questions, we just want to make sure our Nagios installation is perfect!

Many thanks,

Gavin

scottwilkerson · Post by **scottwilkerson** » Wed Jan 02, 2013 11:03 am

Gavin wrote:1. Should performance data for deleted hosts / services be purged automatically? '/usr/local/nagios/share/perfdata/' still contains data from hosts we've deleted. Should we delete this manually, or is it something that should be done automatically (if so, how frequently?)?

Nagios XI doesn't delete RRD because sometimes Items get deleted and then readded, or a host gets renamed and the new files need to be migrated to the new hostname.

Gavin wrote:These RRD files are going to be a fair bit bigger than the 400k default. I was just wondering whether you've any data on how well Nagios will cope with larger RRD files? Is there an upper limit? We have rrdcached configured, so hopefully the I/O should be manageable.

These RRD files are going to be a lot bigger, XI shouldn't have trouble with them, but make sure you have enough disk space to manage this....

Gavin wrote: 3. When reading the documentation for creating plugins, I've noticed there's an option to specify 'critical'/'threshold' targets in the performance output: 'label'=value[UOM];[warn];[crit];. When I've written plugins previously, I've passed the thresholds to the plugin, and had the plugin calculate whether the thresholds were breached, and pass a return code back to Nagios. Is this not the preferred way of doing it? Does one method override the other?

This is correct, but if you include the threshold in the perfdata, Nagios can draw a line on the graph at the critical value...

Nagios Support Forum

Questions about Performance Data

Questions about Performance Data

Re: Questions about Performance Data