Issue is happening on 3 of 25 production servers
Have performed troubleshooting per
https://support.nagios.com/kb/article/n ... ems-9.html
But to no success
There is no data in the perfdata.log, and no rrd files are being created.
No performance graph data for any hosts or services
-
- Posts: 54
- Joined: Mon Aug 03, 2020 10:37 am
-
- Posts: 1288
- Joined: Tue Jun 01, 2021 1:27 pm
Re: No performance graph data for any hosts or services
Hello @WillH
Thanks for reaching out, we want to see what is getting written to the '/usr/local/nagios/var/perfdata.log'.
Please send us the System Profile so we can see what is going on.
To send us your system profile.
Perry
Thanks for reaching out, we want to see what is getting written to the '/usr/local/nagios/var/perfdata.log'.
Please send us the System Profile so we can see what is going on.
To send us your system profile.
- Login to the Nagios XI GUI using a web browser.
- Click the "Admin" > "System Profile" Menu
- Click the "Download Profile" button
- Save the profile.zip file and send using private message.
Perry
-
- Posts: 54
- Joined: Mon Aug 03, 2020 10:37 am
Re: No performance graph data for any hosts or services
Thanks Perry, I sent you a direct message with profile.zip.
-Will
-Will
-
- Posts: 54
- Joined: Mon Aug 03, 2020 10:37 am
Re: No performance graph data for any hosts or services
Update:
I manually checked everything that the Ramdisk script from Nagios should do, and found some errors.
So I have host performance data, but not services.
Progress, but not completion.
I manually checked everything that the Ramdisk script from Nagios should do, and found some errors.
So I have host performance data, but not services.
Progress, but not completion.
-
- Posts: 1288
- Joined: Tue Jun 01, 2021 1:27 pm
Re: No performance graph data for any hosts or services
Hello @WillH
Thanks for following up and sending the System Profile after review, we see that the npcd service is chugging along which is good. We do see that perfdata from 'client 10.107.158.127' is failing due to undefined offsets.
We want to have you go through and create a Ramdisk for additional swap, found here:
https://assets.nagios.com/downloads/nagiosxi/docs/Utilizing_A_RAM_Disk_In_NagiosXI.pdf
Also, let's check to make sure that the Nagios user account is active:
If not then the command to enable:
Let's also have you verify the permissions and ownership to make sure that something did not change, found here.
Check on and increase the load_threshold in the npcd.log found here here.
After you make the updates and bounce the services please send along the npcd.log and the perfdata.log so we see what is going on.
Thanks,
Perry
/usr/local/nagios/var/npcd.log
Thanks for following up and sending the System Profile after review, we see that the npcd service is chugging along which is good. We do see that perfdata from 'client 10.107.158.127' is failing due to undefined offsets.
We want to have you go through and create a Ramdisk for additional swap, found here:
https://assets.nagios.com/downloads/nagiosxi/docs/Utilizing_A_RAM_Disk_In_NagiosXI.pdf
Also, let's check to make sure that the Nagios user account is active:
Code: Select all
chage -l nagios
Code: Select all
chage -I -1 -m 0 -M 99999 -E -1 nagios
Check on and increase the load_threshold in the npcd.log found here here.
After you make the updates and bounce the services please send along the npcd.log and the perfdata.log so we see what is going on.
Code: Select all
tar -czf /tmp/perf_log.tar.gz /usr/local/nagios/var/npcd.log /usr/local/nagios/var/perfdata.log
Perry
/usr/local/nagios/var/npcd.log
-
- Posts: 54
- Joined: Mon Aug 03, 2020 10:37 am
Re: No performance graph data for any hosts or services
Perry,
I was able to fully resolve. We re-ran the RRD cache script and manually made all of the ramdisk changes on one server. The ramdisk script for some reason failed to make all the required file changes properly on these three servers.
We then copied the known good config files from the ramdisk procedure, re-ran rrd cache on the other 2 servers and copied the files to them. After a restart of all nagios related services, all is well. We're going to watch them overnight to be sure we're all good.
Next step: Set up a service check or two to monitor the RRD behavior in case this pops up again.
I was able to fully resolve. We re-ran the RRD cache script and manually made all of the ramdisk changes on one server. The ramdisk script for some reason failed to make all the required file changes properly on these three servers.
We then copied the known good config files from the ramdisk procedure, re-ran rrd cache on the other 2 servers and copied the files to them. After a restart of all nagios related services, all is well. We're going to watch them overnight to be sure we're all good.
Next step: Set up a service check or two to monitor the RRD behavior in case this pops up again.
-
- Posts: 1288
- Joined: Tue Jun 01, 2021 1:27 pm
Re: No performance graph data for any hosts or services
Hello @WillH
Thanks for following up, sounds like you have figured things out, please let us know if you want us to go ahead and close this.
To followup on
Thanks,
Perry
Thanks for following up, sounds like you have figured things out, please let us know if you want us to go ahead and close this.
To followup on
Two options; one you could monitor the Nagios XI Server from the Configuration Wizard (found in the Configuration Wizard). Second option is to watch the number of files in the 'rrd' spool directory and alert if the number increases above a certain threshold (around 100 or so). There are a number of plugins found in the Nagios exchange.Next step: Set up a service check or two to monitor the RRD behavior in case this pops up again.
Thanks,
Perry
-
- Posts: 54
- Joined: Mon Aug 03, 2020 10:37 am
Re: No performance graph data for any hosts or services
Perry,
Thanks for the reply. This is definitely close worthy
Thanks for the reply. This is definitely close worthy
