Nagios xi Server Configuration Details Required

This support forum board is for support questions relating to Nagios xi, our flagship commercial network monitoring solution.
RIDS_I2MP
Posts: 751
Joined: Thu Mar 13, 2014 9:25 am

Nagios xi Server Configuration Details Required

Post by RIDS_I2MP »

Hi Team,

We have below server level configuration for Nagios xi Servers with latest version installed on it.

Nagios Server Name IP Address CPU Memory
Nagios RIDS2 Production 10.144.223.155 8 16 GB
Nagios RIDS Production 10.144.223.156 8 16 GB
Nagios RIDS2 Non Production 10.147.108.45 8 16 GB
Nagios RIDS Non Production 10.147.108.46 8 16 GB

Could you please help me with number of Services/Servers threshold that can be Monitored & will not impact the Nagios application functionality/Performance.

Thanks
Nitin Parate
Thanks & Regards,
I2MP Team.
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Nagios xi Server Configuration Details Required

Post by benjaminsmith »

Hi,

This is a difficult question to answer since there are so many variables that can impact monitoring performance. In my experience the key variables that will impact server performance are:

1. Overall Check Load (# of hosts + service checks)
2. Check interval (how often is the server scheduling checks)
3. System Resources (Memory, CPU, Disk Speed)
4. Passive vs Active Checks
5. Performance Modifications
6. Plugins (some checks like SNMP require more resources than others)

A few questions:
1. Are you having performance trouble right now? If so, can you describe the problem?
2. Have you made any performance modifications (e.g. offloaded db, distributed setup...etc).
3. What is the anticipated growth of your network or how many more additional checks do you plan to add?

I'd like to get a system profile from each production server to take a closer look at the stats and any errors in the logs and then I can provide some recommendations.

Thanks,
Benjamin

### Download System Profile

To send us your system profile.
Login to the Nagios xi GUI using a web browser.
Click the "Admin" > "System Profile" Menu
Click the "Download Profile" button
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
RIDS_I2MP
Posts: 751
Joined: Thu Mar 13, 2014 9:25 am

Re: Nagios xi Server Configuration Details Required

Post by RIDS_I2MP »

Hi
Thanks for your updates.

We are not facing any issues at the moment. We are planning to add around 700-800 servers with 1000+ services in addition to existing 900 servers & 8400 services which are getting monitored via Nagios xi environment.

We have standard interval setup for all CPU, Memory, etc services, servers monitoring, SNMP devices etc.
I have attached System profile just for your reference.
So Just wanted to crosscheck if we reload the Environment with another 700-800 servers with 1000+ services will it cause impact.

There must be any standard thresholds defined by Nagios, So if you could help me with that details also that would be great.

Thanks
Nitin Parate
Thanks & Regards,
I2MP Team.
RIDS_I2MP
Posts: 751
Joined: Thu Mar 13, 2014 9:25 am

Re: Nagios xi Server Configuration Details Required

Post by RIDS_I2MP »

Attaching Profile

Moderator's Note: The profile has been shared with the support team but has been removed from the public forum.
Thanks & Regards,
I2MP Team.
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Nagios xi Server Configuration Details Required

Post by benjaminsmith »

Hi RIDS_I2MP,

Thanks for sending over the profile. The logs good but the top command is showing a higher than normal CPU load than what is normal for a system of this size with 8 CPU's.
top - 16:48:05 up 101 days, 8:35, 0 users, load average: 22.14, 24.25, 24.55
The main reason for this is almost all the MRTG service checks (which require more CPU than other plugins) have a check interval of 1 minute.

Code: Select all

check_command	check_xi_service_mrtgtraf!<redacted_436404224.rrd!20.00,20.00!32.00,32.00!G
check_interval	1.000000
Please use the Bulk Mods Tool, Configure > Tools > Bulk Mods Tool, to adjust the check interval for these services back to the default of 5 minutes. Then let the system run for a while and run the top command once more.

Best Regards,
Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!
RIDS_I2MP
Posts: 751
Joined: Thu Mar 13, 2014 9:25 am

Re: Nagios xi Server Configuration Details Required

Post by RIDS_I2MP »

Hi Team,

Could you please give more details about the MRTG service checks. Which services are being referred here.?

Also, I would like to know about the original concerns about how much load we should ideally allow (No.of Services/Hosts to be monitored) so that Nagios xi will work smoothly.

Thanks,
Nitin Parate
Thanks & Regards,
I2MP Team.
benjaminsmith
Posts: 5324
Joined: Wed Aug 22, 2018 4:39 pm
Location: saint paul

Re: Nagios xi Server Configuration Details Required

Post by benjaminsmith »

Hi Nintin,

Since it's kind of hard to look this up in the CCM, here's a quick script that will print out all the services with a check interval of 1. Please update his to your API key and IP address of your Nagios xi system.

Code: Select all

import requests
import json
import re

# Update IP Address / API key with your server information

api = {'baseurl':'http://192.168.23.113/nagiosxi',
       'key':'?apikey=sCWXTQ3rHtm483AgRUUtLi04v5ECCVmktCCGoU8mINpPPflWafJbRKeGO8fGjUh6',
       'config_hosts': '/api/v1/config/host',
       'config_services': '/api/v1/config/service'  
       }


def get_api_data(url,endpoint,key):
    data = requests.get(url + endpoint + key, verify=False )
    return data

r = get_api_data(api['baseurl'],api['config_services'],api['key'])
services = r.json()

# Print out list of services with check interval set to 1 minute
 
print("Services with Check Interval Set to 1 Minute")
for service in services:
    if service['check_interval'] == '1':
            print('Host', service['host_name'], 'Service:', service['service_description'])
Alternatively, you can 'grep' the following file to find this data.

Code: Select all

/usr/local/nagios/var/objects.cache
Going back to the original question, given the higher load, I would not recommend adding many more services without making configuration/settings adjustments or adding more CPU's.

The following guides are very helpful for tuning Nagiso xi to handle more host and service checks.
https://assets.nagios.com/downloads/nag ... ios-xi.pdf
https://assets.nagios.com/downloads/nag ... zation.pdf

Regards,
Benjamin
As of May 25th, 2018, all communications with Nagios Enterprises and its employees are covered under our new Privacy Policy.

Be sure to check out our Knowledgebase for helpful articles and solutions!