I have a service with performance data which is run on multiple hosts.
Performance graph of the service on some host is updated fine with the correct performance data value.
But, for some other hosts, Performance graph is not getting updated at all.
For the host of which Performance graph is not getting updated, the value for the performance data at the bottom of the Performance graph is shown as 'non'. However, Critical and Warning thresholds included in the performance data are properly displayed at the bottom of Performance graph as shown below.
The performance data shown in Advanced tab of Service Status Detail web page is:
Performance Data: MPM_Combined=174.417721518987 MPM_Rx=162.721518987342;280;300;0 MPM_Tx=11.6962025316456;280;300;0
How can I have the performance data value updated on the Performance graph of the host?
Performance graph not getting updated
-
- Posts: 68
- Joined: Sat Mar 05, 2011 4:13 pm
Performance graph not getting updated
You do not have the required permissions to view the files attached to this post.
-
- Posts: 1040
- Joined: Wed May 11, 2011 8:54 am
Re: Performance graph not getting updated
cskang,
Can you manually run the plugin against the hosts that are not returning performance data? The NaNs on the graph lead me to believe that the plugin is bombing out when it is run. Remember to use the EXACT command line arguments that Nagios will use when it runs the plugin.
Can you manually run the plugin against the hosts that are not returning performance data? The NaNs on the graph lead me to believe that the plugin is bombing out when it is run. Remember to use the EXACT command line arguments that Nagios will use when it runs the plugin.
Nicholas Scott
Former Nagios employee
Former Nagios employee
-
- Posts: 68
- Joined: Sat Mar 05, 2011 4:13 pm
Re: Performance graph not getting updated
I manually ran the run the service by clicking "Schedule an immediate check" icon in Service Status Detail web page, and new performance data collected is reflected in Advanced Status Detail under Advanced tab. So, I can conclude that the plugin invoked by the service is running ok and collecting new data.
New Performance data look fine also.
In fact, the same service runs fine on other host as shown in the images below:
( I erased some private information such as ip address and other unique name in the images) However, the same service seems to be running ok when displaying the Advanced tab.
It shows the valid Peformance Data collected and reported: i.e.
MPM_Combined=185.066666666667
MPM_Rx=178.7 with Warning threshold of 280 and Critical threshold of 300
MPM_Tx=6.36666666666667 with Warning Threshold of 280 and Critical Threshold of 300
But, the Performance Chart does not display actual values for MPM_Combined, MPM_Rx, MPM_Tx.
What I found ironical is that Warning Threshold and Critical Threshold in MPM_Rx and MPM_Tx chart are shown correctly in the legend part though.
How could Performance Chart display MPM_Rx value of 178.7 as nan, and the Warning threshold of 280 and Critical threshold of 300 correctly?
(i.e.
MPM_Rx=178.7;280;3000
MPM_Tx=6.36666666666667;280;380;0
)
New Performance data look fine also.
In fact, the same service runs fine on other host as shown in the images below:
( I erased some private information such as ip address and other unique name in the images) However, the same service seems to be running ok when displaying the Advanced tab.
It shows the valid Peformance Data collected and reported: i.e.
MPM_Combined=185.066666666667
MPM_Rx=178.7 with Warning threshold of 280 and Critical threshold of 300
MPM_Tx=6.36666666666667 with Warning Threshold of 280 and Critical Threshold of 300
But, the Performance Chart does not display actual values for MPM_Combined, MPM_Rx, MPM_Tx.
What I found ironical is that Warning Threshold and Critical Threshold in MPM_Rx and MPM_Tx chart are shown correctly in the legend part though.
How could Performance Chart display MPM_Rx value of 178.7 as nan, and the Warning threshold of 280 and Critical threshold of 300 correctly?
(i.e.
MPM_Rx=178.7;280;3000
MPM_Tx=6.36666666666667;280;380;0
)
You do not have the required permissions to view the files attached to this post.
-
- Posts: 68
- Joined: Sat Mar 05, 2011 4:13 pm
Re: Performance graph not getting updated
This is resolved. Thanks for the support provided from Mike.
It was due to the change that I made to the custom plugin.
The performance data collection worked fine for new hosts added after the change to the plugin, but the hosts which had been running started to not display the performance chart because there was a mismatch in the number of performance data points.
It was due to the change that I made to the custom plugin.
The performance data collection worked fine for new hosts added after the change to the plugin, but the hosts which had been running started to not display the performance chart because there was a mismatch in the number of performance data points.
-
- Posts: 4380
- Joined: Mon Jun 14, 2010 10:21 am
Re: Performance graph not getting updated
For any users' future reference, here was the issue:
It looks like there could be an issue with this particular plugin returning performance data correctly. It looks like at one point it may have been returning 4 sets of performance data, and now it's only returning 3. To my knowledge, plugins must always return the same number of performance data sets, otherwise the rrd's won't update correctly. Is this a custom check that you're using?
Message in the XML file:
It looks like there could be an issue with this particular plugin returning performance data correctly. It looks like at one point it may have been returning 4 sets of performance data, and now it's only returning 3. To my knowledge, plugins must always return the same number of performance data sets, otherwise the rrd's won't update correctly. Is this a custom check that you're using?
Message in the XML file:
Perfdata Captured in the XML file./usr/local/nagios/share/perfdata/B11-BGW/mpm_for_915wib.rrd: expected 4 data source readings (got 3) from 1321994756
Also the same in the other file you sent.MPM_Combined=132.716666666667 MPM_Rx=129.683333333333;280;300;0 MPM_Tx=3.03333333333333;280;300;0
Right now you could probably fix this by just deleting the rrd's and the xml files, and then allowing PNP to rebuild them, but I wouldn't do that until you have the issue sorted out with the data sets./usr/local/nagios/share/perfdata/B11-BGW/mpm_for_915wib.rrd: expected 4 data source readings (got 3) from 1321994756
MPM_Combined=132.716666666667 MPM_Rx=129.683333333333;280;300;0 MPM_Tx=3.03333333333333;280;300;0