Folks,

I have a mystery. It is made of two parts. One is that my new server is mysteriously restarting. It has done it twice as far as I can tell in a month of being on. I found the log where it did a restart in /var/log/messages ;

Apr 16 21:45:03 fcs-server dhcpd: DHCPINFORM from 192.168.0.186 via eth0
Apr 16 21:45:03 fcs-server dhcpd: DHCPACK to 192.168.0.186
Apr 16 21:45:08 fcs-server syslog-ng[3632]: STATS: dropped 0
Apr 16 21:51:03 fcs-server syslog-ng[3663]: syslog-ng version 1.6.8 starting
Apr 16 21:51:03 fcs-server auditd[3711]: Init complete, auditd 1.2.9 listening for events
Apr 16 21:51:04 fcs-server syslog-ng[3663]: Changing permissions on special file /dev/xconsole
Apr 16 21:51:04 fcs-server syslog-ng[3663]: Changing permissions on special file /dev/tty10

It seems the server was going along, doing some DHCP stuff then poof, disappeared for 5 minutes, then back and doing NTP time sync. That's not good!

Because there is nothing in the log I'm suspicious of hardware, it is a new SUN X4140, so I am going to pursue seeing if the motherboard has any logging and errors. (have to wait for everyone to be gone)

DNS did not start after this reboot. I am using dynamic DNS/DHCP. DHCP did start. I have gone over the log and found the following right in the middle of NSS loading;

error: novell-named failed to start, check if novell-xregd is running!

Smoking gun, but what does it mean? Next morning, no DNS, so rebooted and everything is fine.

Here is the log file from the beginning to when the named failed. Can anyone with more experience than I see what wasn't working? This might give me a hint as to what was going on last night, that isn't going on now!

Apr 16 21:51:03 fcs-server syslog-ng[3663]: syslog-ng version 1.6.8 starting
Apr 16 21:51:03 fcs-server auditd[3711]: Init complete, auditd 1.2.9 listening for events
Apr 16 21:51:04 fcs-server syslog-ng[3663]: Changing permissions on special file /dev/xconsole
Apr 16 21:51:04 fcs-server syslog-ng[3663]: Changing permissions on special file /dev/tty10
Apr 16 21:51:04 fcs-server ntpdate[3860]: can't find host pool.ntp.org
Apr 16 21:51:04 fcs-server ntpdate[3860]: no servers can be used, exiting
Apr 16 21:51:04 fcs-server ntpd[3882]: ntpd 4.2.4p3@1.1502-o Fri Mar 13 10:55:33 UTC 2009 (1)
Apr 16 21:51:04 fcs-server ntpd[3895]: precision = 1.000 usec
Apr 16 21:51:04 fcs-server ntpd[3895]: ntp_io: estimated max descriptors: 1024, initial socket boundary: 16
Apr 16 21:51:04 fcs-server ntpd[3895]: Listening on interface #0 wildcard, 0.0.0.0#123 Disabled
Apr 16 21:51:04 fcs-server ntpd[3895]: Listening on interface #1 wildcard, ::#123 Disabled
Apr 16 21:51:04 fcs-server ntpd[3895]: Listening on interface #2 lo, ::1#123 Enabled
Apr 16 21:51:04 fcs-server ntpd[3895]: Listening on interface #3 eth0, fe80::214:4fff:feee:160#123 Enabled
Apr 16 21:51:04 fcs-server ntpd[3895]: Listening on interface #4 lo, 127.0.0.1#123 Enabled
Apr 16 21:51:04 fcs-server ntpd[3895]: Listening on interface #5 eth0, 192.168.0.3#123 Enabled
Apr 16 21:51:04 fcs-server ntpd[3895]: kernel time sync status 0040
Apr 16 21:51:04 fcs-server ntpd[3895]: frequency initialized 11.756 PPM from /var/lib/ntp/drift/ntp.drift
Apr 16 21:51:05 fcs-server id: nds_nss_GetGrpEnt: failed to init socket, status = -1
Apr 16 21:51:05 fcs-server id: nds_nss_GetGrpEnt: failed to init socket, status = -1
Apr 16 21:51:05 fcs-server rcpowersaved: enter 'powernow_k8' into CPUFREQD_MODULE in /etc/powersave/cpufreq.
Apr 16 21:51:05 fcs-server rcpowersaved: this will speed up starting powersaved and avoid unnecessary warnings in syslog.
Apr 16 21:51:05 fcs-server rcpowersaved: s2ram does not know your machine. See 's2ram -i' for details. (127)
Apr 16 21:51:05 fcs-server rcpowersaved: Use SUSPEND2RAM_FORCE=yes to override this detection.
Apr 16 21:51:05 fcs-server sshd[4059]: Server listening on :: port 22.
Apr 16 21:51:06 fcs-server [powersave]: WARNING (setOndemandConfig:211) Set sampling_rate (333000) for ondemand governor is lower than minimum (440000) which will be used now.
Apr 16 21:51:06 fcs-server [powersave]: WARNING (setOndemandConfig:211) Set sampling_rate (333000) for ondemand governor is lower than minimum (440000) which will be used now.
Apr 16 21:51:07 fcs-server /usr/sbin/cron[4209]: (CRON) STARTUP (V5.0)
Apr 16 21:51:08 fcs-server kernel: klogd 1.4.1, log source = /proc/kmsg started.
Apr 16 21:51:08 fcs-server kernel: Vendor: TSSTcorp Model: CD/DVDW TS-T632A Rev: SR03
Apr 16 21:51:08 fcs-server kernel: Type: CD-ROM ANSI SCSI revision: 00
Apr 16 21:51:08 fcs-server kernel: 13:0:0:0: Attached scsi generic sg10 type 5
Apr 16 21:51:08 fcs-server kernel: usb-storage: device scan complete
Apr 16 21:51:08 fcs-server kernel: sr0: scsi3-mmc drive: 24x/24x writer dvd-ram cd/rw xa/form2 cdda tray
Apr 16 21:51:08 fcs-server kernel: Uniform CD-ROM driver Revision: 3.20
Apr 16 21:51:08 fcs-server kernel: sr 13:0:0:0: Attached scsi CD-ROM sr0
Apr 16 21:51:08 fcs-server kernel: fuse init (API version 7.8)
Apr 16 21:51:08 fcs-server kernel: AppArmor: AppArmor initialized
Apr 16 21:51:08 fcs-server kernel: audit(1239933058.432:2): info="AppArmor initialized" pid=2611
Apr 16 21:51:08 fcs-server kernel: ACPI: Power Button (FF) [PWRF]
Apr 16 21:51:08 fcs-server kernel: ACPI: Power Button (CM) [PWRB]
Apr 16 21:51:08 fcs-server kernel: No dock devices found.
Apr 16 21:51:08 fcs-server kernel: NET: Registered protocol family 10
Apr 16 21:51:08 fcs-server kernel: lo: Disabled Privacy Extensions
Apr 16 21:51:08 fcs-server kernel: IPv6 over IPv4 tunneling driver
Apr 16 21:51:08 fcs-server kernel: audit(1239933063.408:3): audit_pid=3711 old=0 by auid=4294967295
Apr 16 21:51:08 fcs-server kernel: powernow-k8: Found 2 Dual-Core AMD Opteron(tm) Processor 2222 processors (4 cpu cores) (version 2.20.00)
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 0 : fid 0x16 (3000 MHz), vid 0xa
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 1 : fid 0x14 (2800 MHz), vid 0xc
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 2 : fid 0x12 (2600 MHz), vid 0xe
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 3 : fid 0x10 (2400 MHz), vid 0x10
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 4 : fid 0xe (2200 MHz), vid 0x10
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 5 : fid 0xc (2000 MHz), vid 0x10
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 6 : fid 0xa (1800 MHz), vid 0x10
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 7 : fid 0x2 (1000 MHz), vid 0x12
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 0 : fid 0x16 (3000 MHz), vid 0xa
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 1 : fid 0x14 (2800 MHz), vid 0xc
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 2 : fid 0x12 (2600 MHz), vid 0xe
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 3 : fid 0x10 (2400 MHz), vid 0x10
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 4 : fid 0xe (2200 MHz), vid 0x10
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 5 : fid 0xc (2000 MHz), vid 0x10
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 6 : fid 0xa (1800 MHz), vid 0x10
Apr 16 21:51:08 fcs-server kernel: powernow-k8: 7 : fid 0x2 (1000 MHz), vid 0x12
Apr 16 21:51:11 fcs-server zmd: NetworkManagerModule (WARN): Failed to connect to NetworkManager
Apr 16 21:51:11 fcs-server kernel: eth0: no IPv6 routers present
Apr 16 21:51:11 fcs-server id: nds_nss_GetGrpEnt: failed to init socket, status = -1
Apr 16 21:51:11 fcs-server id: nds_nss_GetGrpEnt: failed to init socket, status = -1
Apr 16 21:51:14 fcs-server dhcpd: Internet Systems Consortium DHCP Server V3.0.3
Apr 16 21:51:14 fcs-server dhcpd: Copyright 2004-2005 Internet Systems Consortium.
Apr 16 21:51:14 fcs-server dhcpd: All rights reserved.
Apr 16 21:51:14 fcs-server dhcpd: For info, please visit http://www.isc.org/sw/dhcp/
Apr 16 21:51:14 fcs-server namcd: SIGTTOU caught
Apr 16 21:51:14 fcs-server namcd: SIGTTIN caught
Apr 16 21:51:14 fcs-server namcd: SIGTSTP caught
Apr 16 21:51:14 fcs-server /usr/sbin/namcd[4789]: Starting namcd..
Apr 16 21:51:14 fcs-server /usr/sbin/namcd[4789]: namcd populating the user hash tables
Apr 16 21:51:14 fcs-server /usr/sbin/namcd[4789]: insertGidListIntoUserHash invoked for novlxsrvd
Apr 16 21:51:14 fcs-server /usr/sbin/namcd[4789]: insertGidListIntoUserHash invoked for novlxregd
Apr 16 21:51:14 fcs-server /usr/sbin/namcd[4789]: insertGidListIntoUserHash invoked for admin
Apr 16 21:51:14 fcs-server /usr/sbin/namcd[4789]: insertGidListIntoUserHash invoked for wwwrun
Apr 16 21:51:14 fcs-server /usr/sbin/namcd[4789]: namcd populating group hash tables
Apr 16 21:51:14 fcs-server /usr/sbin/namcd[4789]: namcd Populated hash tables
Apr 16 21:51:14 fcs-server /usr/sbin/namcd[4789]: Created all the threads
Apr 16 21:51:15 fcs-server dhcpd: Internet Systems Consortium DHCP Server V3.0.3
Apr 16 21:51:15 fcs-server dhcpd: Copyright 2004-2005 Internet Systems Consortium.
Apr 16 21:51:15 fcs-server dhcpd: All rights reserved.
Apr 16 21:51:15 fcs-server dhcpd: For info, please visit http://www.isc.org/sw/dhcp/
Apr 16 21:51:15 fcs-server dhcpd: Wrote 187 leases to leases file.
Apr 16 21:51:15 fcs-server kernel: NET: Registered protocol family 17
Apr 16 21:51:15 fcs-server dhcpd: Listening on LPF/eth0/00:14:4f:ee:01:60/192.168.0/24
Apr 16 21:51:15 fcs-server dhcpd: Sending on LPF/eth0/00:14:4f:ee:01:60/192.168.0/24
Apr 16 21:51:15 fcs-server dhcpd: Sending on Socket/fallback/fallback-net
Apr 16 21:51:15 fcs-server nss: Starting Novell Storage Services (NSS)
error: novell-named failed to start, check if novell-xregd is running!

Thanks In Advance
Craig Lyndes
Fairfield Center School