Nagios xi problems

This support forum board is for support questions relating to Nagios xi, our flagship commercial network monitoring solution.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Nagios xi problems

Post by mguthrie »

That older install is a CentOS 5.6 install without anything special. The Central is a CentOS 6.0 install and the other test install i made earlier on in this post is a CentOS 5.7 install. I hope this information helps.
We're going to run some additional tests with CentOS 6 and 5.7 and see if we can recreate this.

Just to be safe, try reverting the default shell back to /bin/bash for both the root and nagios users. There are commands that exist in bash that don't necessarily exist in other shells.

Just to make sure we're clear on what issues you're experiencing where, can you summarize what problems (if any) you're experiencing on:
-CentOS 5.6 install
-CentOS5.7 install
-CentOS6 install
SDohmen
Posts: 240
Joined: Thu Jun 30, 2011 4:14 am

Re: Nagios xi problems

Post by SDohmen »

-CentOS 5.6 install -> Our pre-buy testing installation. No errors at all here.

-CentOS5.7 install -> Our bought testing installation. I installed this to see what the fullinstall would do. Here there are also errors with files missing. Nagios itself cant restart because the command file isn't found or it has no permissions to run. For the rest no errors come into mind.

-CentOS6 install -> Our bought main central installation. This is the most important one and also the one with the errors. Including errors are nagios not running when trying to restart it. According to you there is also an yum install of nagios present (which is kinda strange since i have not installed it). Also the hosts i added dont show up after trying to restart or sync. Other problems include the not being able to mail even though from the command line it does work. This is a minor issue at the moment though and not really important enough. Lastly there are missing files on this install.

The login shell has been reverted to bash aswell.
agriffin
Posts: 876
Joined: Mon May 09, 2011 9:36 am

Re: Nagios xi problems

Post by agriffin »

Unless there is some reason this isn't an option, I'm going to recommend that you reinstall a minimal CentOS 6 installation and then install Nagios xi again after that. At this point I think it's just going to be easier and faster to troubleshoot if you do.

When you're asked to select a group of packages to install, please select "Minimal" and not "Desktop Environment" or GNOME. After xi is up and running you can install whatever else you like.
SDohmen
Posts: 240
Joined: Thu Jun 30, 2011 4:14 am

Re: Nagios xi problems

Post by SDohmen »

I will try that then tomorrow.

[EDIT]
Since i was busy working anyway i decided to make a new VM to use as Central with another clean install. The specs i used are CentOS 6 on a clean xen vm with 25gb hdd (lvm manual config),4gb mem and 4 cpu's.

During the install i installed the minimal version as subcribed everywhere. After the install i only installed wget and openssh-server. Then i downloaded the package and ran the installer. Right away i got a error that the epel-release couldn't be found. After rerunning the installer it just continued which is a bit strange. I should have the install ready in about 15min and then i will check if the command files etc are present.

[EDIT2]
ok the install seemed to have gone without a hitch. login etc works fine (with a password without special chars). The reset went also fine but the restart part in the tools didn't work at all. I am getting the error "Restart failed - Nagios command file not found or no execute permissions".

When i look for the 4 files posted earlier in this thread i am missing :
command_file=/usr/local/nagios/var/rw/nagios.cmd
debug_file=/usr/local/nagios/var/nagios.debug

Since this is a complete new and clean install there must be something going wrong during the install itself. I really cant think of anything else here.

[EDIT3]
I attached the install.log to this post. If you see strange chars its prolly because i checked it in notepad++ to remove all dns entries.
You do not have the required permissions to view the files attached to this post.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Nagios xi problems

Post by mguthrie »

Ok, this is good info. We'll see if we can get this recreated. Can you show us the permissions on the following directory:

Code: Select all

ll /usr/local/nagios/var
And see what happens when you run the following procedure. I know, it's a fresh install, but we'd like to see if that's causing an issue somewhere/
http://assets.nagios.com/downloads/nagi ... ssions.pdf
SDohmen
Posts: 240
Joined: Thu Jun 30, 2011 4:14 am

Re: Nagios xi problems

Post by SDohmen »

The permissions are as follows:

Code: Select all

[root@central nagiosxi]# ll /usr/local/nagios/var
total 124
drwxrwxr-x. 2 nagios nagios  4096 Sep 27 08:42 archives
-rw-rw-r--. 1 nagios nagios     0 Sep 27 08:53 host-perfdata
-rw-r--r--. 1 nagios nagios     6 Sep 27 08:45 nagios.lock
-rw-r--r--. 1 nagios root    1375 Sep 27 08:45 nagios.log
-rw-r--r--. 1 nagios nagios     6 Sep 27 08:45 ndo2db.lock
-rw-r--r--. 1 nagios nagios     0 Sep 27 08:45 ndomod.tmp
srwxr-xr-x. 1 nagios nagios     0 Sep 27 08:45 ndo.sock
-rw-r--r--. 1 nagios nagios 49712 Sep 27 08:52 npcd.log
-rw-r--r--. 1 nagios nagios 21892 Sep 27 08:45 objects.cache
-rw-r--r--. 1 nagios root       0 Sep 27 08:45 retention.dat
drwxrwsr-x. 2 nagios nagcmd  4096 Sep 27 08:45 rw
-rw-rw-r--. 1 nagios nagios     0 Sep 27 08:53 service-perfdata
drwxr-xr-x. 5 nagios nagios  4096 Sep 27 08:42 spool
drwxr-xr-x. 2 nagios nagios  4096 Sep 27 08:52 stats
-rw-rw-r--. 1 nagios nagios 14347 Sep 27 08:53 status.dat
[root@central nagiosxi]#
The global check permissions was strange. Part of that is displayed below:

Code: Select all

chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/css': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/css': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/css/hypermap.css': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/css/hypermap.css': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/css/base.css': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/css/base.css': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/Extras': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/Extras': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/Extras/excanvas.js': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/Extras/excanvas.js': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/jit.js': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/jit.js': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/example1.js': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/example1.js': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/hypermap.js': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/hypermap.js': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/jit-yc.js': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/js/jit-yc.js': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/dashlet.inc.php': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/dashlet.inc.php': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/index.php': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/index.php': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/hypermap_preview.png': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/hypermap_preview.png': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/ajax.inc.php': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/hypermap_replay/ajax.inc.php': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/tmp': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/tmp': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/tmp/prop-base': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/tmp/prop-base': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/tmp/text-base': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/tmp/text-base': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/tmp/props': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/tmp/props': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/entries': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/entries': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/all-wcprops': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/all-wcprops': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/prop-base': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/prop-base': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/text-base': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/text-base': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/text-base/bpa.inc.php.svn-base': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/text-base/bpa.inc.php.svn-base': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/props': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/.svn/props': No such file or directory
chmod: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/bpa.inc.php': No such file or directory
chown: cannot access `/usr/local/nagiosxi/html/includes/components/bpa/bpa.inc.php': No such file or directory
chmod +x "/usr/local/nagios/share/perfdata";
chmod +x "/usr/local/nagios/share/perfdata/.pnp-internal";
chmod +x "/usr/local/nagios/share/perfdata/localhost";
[root@central tmp]#

I would almost say that its missing something, although that is possible due too it being a total clean install with no extra components like the ones missing above. When i try to restart from the tools menu the same error is still present btw.

What i did notice before but havent really thought about was during the installation. At the start of the install it has to download all the packages needed to install nagiosxi. However it seems to me that this was always a different number then any other install. For example, i did 3 CentOS 6.0 installs so far of which 1 had 204 packages and 1 had 196, the other i cant remember exactly since i didn't really pay attention to that at the time. I hope this helps.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Nagios xi problems

Post by mguthrie »

This missing files are attempting to fix a large list of supported components and wizards that users might have on their system. Having them missing isn't a problem, we just wanted to make sure we got everything with the fix.

We've been running install tests and so far haven't been able to recreate what you're experiencing. However, there have been some sporadic issues with some users using either CentOS 6 and 5.7, but we haven't figured out what is causing the problem yet.

Just go give a forewarning, we have our Nagios World Conference starting tomorrow, so our response time may be slowed a bit, but we'll do our best to continue working through your issue.
SDohmen
Posts: 240
Joined: Thu Jun 30, 2011 4:14 am

Re: Nagios xi problems

Post by SDohmen »

Yeah, i noticed that. For now we have a older Nagios Core running our distributed monitoring so we can survive a couple more days :). I wish you good luck there and hopefull a solution somewhere later this week :)

[EDIT]
If it helps, with any remote control software (teamviewer or some kind) you are able to remote control my laptop to see the errors and/or check for possible solutions.
mguthrie
Posts: 4380
Joined: Mon Jun 14, 2010 10:21 am

Re: Nagios xi problems

Post by mguthrie »

Yeah if you can hold out until next week, I think we may end up just doing that. We'll see if any other ideas come to mind but it we haven't got it figured out by next week we'll take a look with a remote session.
SDohmen
Posts: 240
Joined: Thu Jun 30, 2011 4:14 am

Re: Nagios xi problems

Post by SDohmen »

Do you have any progress yet?