Hi,
we are having trouble with some of our centos servers not showing perf graphs. all windows and most other linux systems are fine.
performance data is being shown in nagios, just no graphs. it is on all services
see attached screenshots and profile.
thank you.
Performance graphs not Populating (not all servers)
-
- Posts: 190
- Joined: Wed Sep 26, 2012 11:55 am
Performance graphs not Populating (not all servers)
You do not have the required permissions to view the files attached to this post.
-
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: Performance graphs not Populating (not all servers)
Hello, @blariv. The problem with passive checks and RRD is that if passive results don't come soon enough, the RRD tool will ignore them. The default heartbeat is 8640 seconds, meaning the data isn't inserted if there is an interval greater than 2.4 hours between points. You can verify the heartbeat settings on the databases by running:
rdtool dump /usr/local/nagios/share/perfdata/<hostname>/<servicedesc>.rrd | grep -i heart
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
-
- Posts: 190
- Joined: Wed Sep 26, 2012 11:55 am
Re: Performance graphs not Populating (not all servers)
<minimal_heartbeat>8460</minimal_heartbeat>
-
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: Performance graphs not Populating (not all servers)
@ blariv, How often does this service check get updated with the passive result? Is the interval larger than 2 hours or smaller?
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
-
- Posts: 190
- Joined: Wed Sep 26, 2012 11:55 am
Re: Performance graphs not Populating (not all servers)
just wanted to provide an update. its seems that when sending the passive check directly from the server we are seeing this error, therefore nothing is actually getting back to the XI server.
ERROR: The NRDP Server said
Use of uninitialized value $status in string eq at /usr/local/nrdp/clients/nrds/nrds_updater.pl line 161, <CONFIG
ERROR: Could not connect to http://nagiosxi.xxx.com/nrdp/. Check your cfg file. at /usr/local/nrdp/clients/nrdp
ERROR: The NRDP Server said
Use of uninitialized value $status in string eq at /usr/local/nrdp/clients/nrds/nrds_updater.pl line 161, <CONFIG
ERROR: Could not connect to http://nagiosxi.xxx.com/nrdp/. Check your cfg file. at /usr/local/nrdp/clients/nrdp
-
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: Performance graphs not Populating (not all servers)
@blariv, Please post all files from the following folder:
/usr/local/nrdp/clients/nrds
Also, from the client-server, please run the nmap command against the Nagios XI server and post the result.
/usr/local/nrdp/clients/nrds
Also, from the client-server, please run the nmap command against the Nagios XI server and post the result.
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
-
- Posts: 190
- Joined: Wed Sep 26, 2012 11:55 am
Re: Performance graphs not Populating (not all servers)
I seem to have fixed most of it.
looks like my nrds config on the hosts we pointing to http instead of https.
However of the 4 checks we use 3 are writing perf data.
command[Check Load]=/usr/local/nagios/libexec/check_load -w 30,25,20 -c 50,40,35
command[Check Disk]=/usr/local/nagios/libexec/check_disk -w 10% -c 5% -X nfs -X cifs
command[Check Swap]=/usr/local/nagios/libexec/check_swap -w 20% -c 10%
command[Check Zombie Procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z
Check Disk is not.
I grabbed the info from this command.
rrdtool dump /usr/local/nagios/share/perfdata/hostname.xxx.com/Check_Disk.rrd
This shows nothing written since 2018-03-15 14:00:00 EDT the rest are showing 2019-04-10 08:00:00
looks like my nrds config on the hosts we pointing to http instead of https.
However of the 4 checks we use 3 are writing perf data.
command[Check Load]=/usr/local/nagios/libexec/check_load -w 30,25,20 -c 50,40,35
command[Check Disk]=/usr/local/nagios/libexec/check_disk -w 10% -c 5% -X nfs -X cifs
command[Check Swap]=/usr/local/nagios/libexec/check_swap -w 20% -c 10%
command[Check Zombie Procs]=/usr/local/nagios/libexec/check_procs -w 5 -c 10 -s Z
Check Disk is not.
I grabbed the info from this command.
rrdtool dump /usr/local/nagios/share/perfdata/hostname.xxx.com/Check_Disk.rrd
This shows nothing written since 2018-03-15 14:00:00 EDT the rest are showing 2019-04-10 08:00:00
-
- Posts: 190
- Joined: Wed Sep 26, 2012 11:55 am
Re: Performance graphs not Populating (not all servers)
I noticed this in the /usr/local/nagios/share/perfdata/<hostname>/<servicedesc>.rrd file for this host
<RRD>
<RC>1</RC>
<TXT>/usr/local/nagios/share/perfdata/hostname.xxx.com/Check_Disk.rrd: found extra data on update argument: 0:125</TXT>
</RRD>
<RRD>
<RC>1</RC>
<TXT>/usr/local/nagios/share/perfdata/hostname.xxx.com/Check_Disk.rrd: found extra data on update argument: 0:125</TXT>
</RRD>
-
- Support Tech
- Posts: 3457
- Joined: Mon May 15, 2017 5:00 pm
Re: Performance graphs not Populating (not all servers)
@blariv, Can you move the Check_Disk.rrd and Check_Disk.xml files to a backup location and then remove them from this folder:
After that wait 15 minutes and check again to see if the graph started working./usr/local/nagios/share/perfdata/
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.
-
- Posts: 190
- Joined: Wed Sep 26, 2012 11:55 am
Re: Performance graphs not Populating (not all servers)
that did it.
is there an easy way to find that string in any .xml file so i can just recreate those?
is there an easy way to find that string in any .xml file so i can just recreate those?