Hello,

NW6.0.5
ZDM6.5.2
AS9.01.sp1
PCI Card: ADPT160m.HAM -> TANDBERG MLR3
onboard : adpu320.HAM (not loaded)
SRCRX.HAM: 2x ICP Vortex RAID (1x RAID1=sys, 1x RAID5=data)

NSS: sys volume 18 Gig, data volume 0,9TB

How to figure out *what* process causes this work to do problem?
- When entering debugger in that 'hang' state, it's always in real
mode. Does this help to better isolate the cause?
- What processes make the server to switch to real mode? It does *NOT*
make any difference whether dosfat.nss is loaded or not (CDBE)
- a scandisk of C: is clean. (dos side, MS-DOS 6.22, no drivers dos
side)

KB just shows very few hits, about NW4.x, 5.x, and ADPT160m.HAM, but
nothing within the last 2 years. And the problem started suddenly after
longer months of no trouble at all, so I suspect some HW trouble might
be the cause.


The server was running smoothly for about 2 years. (nw6.0.3, ZFD4.0.1,
then IR6, AS9.01.sp1)

Then after some reboot it started to hang in the middle of the boot
process, *before* the ArcServe stuff was loaded. (see thread from Aug
2, 9:59 am "After reboot hanging in real mode")


Marcel Cox advised me to start the server "server -kf8". When doing so,
the server did *NOT* hang at boot time when stepping through this way.


About 2 months ago I applied NW60SP5, no problems after that reboot.

Last week I upgraded ZfD4.0.1_ir6 to ZDM6.5.2, I had trouble after that
boot, but another boot was OK.


Yesterday for the very first time the server was unresponsive during
normal operation, the last reboot was several days ago. (I started C1
from sys:public\mgmt\...)

So I did another reboot ( -kf8) and it did not hang with this slow step
by step boot process. But later on again hanged.

PING from another server shows, that first mo PINGs returned, but they
also were *NOT* dropped finally: within one refresh intervall of the
PING screen some ~80(?) PING replies arrived. Average ping time 1m38s.

If this situation comes up, keyboard -> debugger always shows:

DEBUGGER:
================================================== ==========
Break at 0010E543 because of Keyboard request
NOTICE: Executing in a real mode interrupt context.
Enter command "gp" tp <g>o until again in a <p>rotected mode context.
Current Focus Processor : 00
EAX = 00000000 EBX = 00000EB8 ECX = 0013CD58 EDX = 00000000
ESI = 004F7248 EDI = 00000003 EBP = 00000001 ESP = C81EA2B0
EIP = 0010e543 FLAGS = 00000046 (PF ZF)
0010E543 8AC7 MOV AL,BH
# ?
Address in LOADER.EXE at code start +000078A3h
Previous: -000000D7 0010E46C LOADER.EXE|RMHWIntPMEntryPoint
Current: 00000000 0010E543
Next: +0000004D 0010E590 LOADER.EXE|MapAbsoluteAddressToCodeOffset
#gp
-------------
server still hangs, pings return bulkwise.
When back "g" to OS, a CAD *DOES* cause the real mode typical reboot.
After 2-3 minutes the SERVER jumps into debugger on it's own:
-------------
Break because Debug Procedure wall called
Current Focus Processor : 00
EAX = 000000FF EBX = 00000000 ECX = 0013cd58 EDX = C8005000
ESI = 00000000 EDI = C81EA350 EBP = 00000000 ESP = C81EA2B8
EIP = 0010e442 FLAGS = 00000046 (PF ZF)
0010e443 5F POP EDI
#g
================================================== ==========

the keyboard buffer is filled (bell sounds) and like the PINGs return
bulkwise, the keystrokes suddenly show up at the console screen.

How to isolate the cause? Yesterday and today the server became
unresponsive for 5 times. 4 times recoverd on it's own. One time hanged
that 'hard' that even the debugger couldn't be entered any more.



Thanks for any suggestions,

Regards, Rudi.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~


current startup.ncf:
--------------------------
load acpidrv
Load Keyb Germany
Load Charset CP850
LOAD SCSIHD.CDM
LOAD SCSICD.CDM
load srcrx.ham slot=3
load srcrx.ham slot=6
load adpt160m slot=4
rem load adpu320 slot=901
rem load adpu320 slot=902
--------------------------


CONSOLE.LOG:
============

Auto-Loading Module TLI.NLM [
OK ]
Auto-Loading Module AFTER311.NLM [ PUB
EXISTS ]
Auto-Loading Module A3112.NLM [ PUB
EXISTS ]
Loading Module IPXS.NLM [
OK ]
Loading Module SERVINST.NLM [
OK ]
Loading Module CLIBAUX.NLM [
OK ]

CLibAux.NLM is a library that normally exports symbols needed to shim
more sup-
port atop CLib.NLM. None of these symbols is needed at present.
CLibAux.NLM
has been unloaded as unnecessary.

Loading Module PSVCS.NLM [
OK ]
Loading Module NWAIF103.NLM [
OK ]
Loading Module NWENC103.NLM [
OK ]
Loading Module DSAPI.NLM [NOT
MULTIPLE]
Loading Module CLXNLM32.NLM [NOT
MULTIPLE]
Loading Module NWMKDE.NLM [
OK ]
Auto-Loading Module NWUCMGR.NLM [
OK ]
Loading Module BTRIEVE.NLM [
OK ]
Loading Module NWBSRVCM.NLM [
OK ]
Loading Module CONLOG.NLM [
OK ]
Auto-Loading Module AFTER311.NLM [ PUB
EXISTS ]
Auto-Loading Module A3112.NLM [ PUB
EXISTS ]
Loading Module SNMP.NLM [NOT
MULTIPLE]
Loading Module BCALLSRV.NLM [
OK ]
Auto-Loading Module CSL.NLM [
OK ]
Auto-Loading Module AFTER311.NLM [ PUB
EXISTS ]
Auto-Loading Module A3112.NLM [ PUB
EXISTS ]
Loading Module CE1000.LAN [
OK ]
Auto-Loading Module ETHERTSM.NLM [
OK ]
Auto-Loading Module MSM.NLM [
OK ]
Interrupt assignment: 112 (Device driver is HIN aware.)
Loading Module CE1000.LAN [
OK ]
Loading Module TCPIP.NLM [
OK ]
Auto-Loading Module CSLIND.NLM [
OK ]
Auto-Loading Module NETLIB.NLM [
OK ]
Auto-Loading Module TCP.NLM [
OK ]
Loading Module BSDSOCK.NLM [
OK ]
Loading module BSDSOCK.NLM
SYSINIT: Binding IP to CE1000_1_EII.

TCPIP-6.10-112: Fri Dec 2 07:29:30 2005
Bound to board 2 with IP address 10.27.1.8 and mask FF.FF.00.00.
SYSINIT: Binding IP to CE1000_1_EII.

TCPIP-6.10-112: Fri Dec 2 07:29:30 2005
Bound to board 2 with IP address 192.168.181.8 and mask FF.FF.FF.00.
Reply To Get Nearest Server is ALREADY set to OFF
Loading Module IPXRTR.NLM [
OK ]
Loading Module BSPXCOM.NLM [
OK ]

2.12.2005 7.29.30 : DS-10551.29-264
Bindery open requested by the SERVER

Loading Module DXEVENT.NLM [
OK ]

2.12.2005 7.29.31 : SLP-2.9-0
SLPTCP bound to 10.27.1.8


2.12.2005 7.29.31 : SLP-2.9-0
SLPTCP bound to 192.168.181.8

Auto-Loading Module A3112.NLM [ PUB
EXISTS ]
Loading Module IPXRTRNM.NLM [
OK ]
Auto-Loading Module AFTER311.NLM [ PUB
EXISTS ]
Auto-Loading Module A3112.NLM [ PUB
EXISTS ]
Loading Module GAMS.NLM [
OK ]
Loading Module SPXCONFG.NLM [
OK ]
IPX NetBIOS Replication Option is ALREADY set to 1
Load Balance Local LAN is ALREADY set to OFF
SYSINIT: Binding IPX to CE1000_1_E83.
Loading Module NMAS.NLM [
OK ]
Loading Module ODINEB.NLM [
OK ]
Auto-Loading Module SPMNWCC.NLM [
OK ]
Loading Module ODINEB.NLM [
OK ]
Loading Module SPMDCLNT.NLM [
OK ]
Loading Module REMOTE.NLM [
OK ]
Loading Module RSPX.NLM [
OK ]

2.12.2005 7.29.31 : CE1000-7.34-0
CE1000-NW-000-Adapter 1-Board 1:
Link is up. 100 Mbs Full Duplex

Loading Module BTCPCOM.NLM [
OK ]

2.12.2005 7.29.32 : SLP-2.9-0
SLP registered DA: 10.27.1.9


2.12.2005 7.29.32 : SLP-2.9-0
SLP registered DA: 10.27.1.5

Loading Module NILE.NLM [
OK ]
Auto-Loading Module NWUTIL.NLM [
OK ]
Auto-Loading Module LDAPSDK.NLM [
OK ]
Auto-Loading Module PKIAPI.NLM [
OK ]
Auto-Loading Module PKI.NLM [
OK ]
Loading Module HTTPSTK.NLM [
OK ]
Loading Module PORTAL.NLM [
OK ]
Auto-Loading Module NWIDK.NLM [
OK ]

DNS resolving name for 192.168.181.8 --> nw03goe.goepfert.intern
Loading Module NDSIMON.NLM [
OK ]
Auto-Loading Module LANGMANI.NLM [
OK ]
Auto-Loading Module XI18N.NLM [
OK ]
Loading Module NICISDI.XLM [
OK ]
Loading module NICISDI.XLM [
OK ]
Security Domain Infrastructure
Version 26410.05 10 November 2003
Copyright 1998-2003, Novell, Inc. All rights reserved.
All Digitally Signed Objects successfully loaded.
Loading Module SASDFM.XLM [
OK ]
Loading module SASDFM.XLM [
OK ]
SAS Data Flow Manager
Version 26410.05 10 November 2003
Copyright 1999-2003, Novell, Inc. All rights reserved.
All Digitally Signed Objects successfully loaded.
Loading Module SAS.NLM [
OK ]
Loading Module PKI.NLM [NOT
MULTIPLE]
Loading Module NLDAP.NLM [
OK ]
Loading Module SMDR.NLM [
OK ]
Loading Module TSAFS.NLM [
OK ]
Search 6: [Server Path] SYS:\TOMCAT\33\BIN\
Loading Module SPXS.NLM [
OK ]
Auto-Loading Module A3112.NLM [ PUB
EXISTS ]
Loading Module DSBACKER.NLM [
OK ]
Loading Module JAVA.NLM [
OK ]
Auto-Loading Module JSOCK.NLM [
OK ]
Loading Module JSOCK6X.NLM [
OK ]
Search 7: [Server Path] SYS:\APACHE\
Loading Module NWFTPD.NLM [NOT
MULTIPLE]
Auto-Loading Module NETDB.NLM [
OK ]
Loading Module JVM.NLM [
OK ]
Auto-Loading Module FTPIF.NLM [
OK ]
Loading Module GAMS.NLM [
OK ]
Loading Module AFPTCP.NLM [
OK ]
Auto-Loading Module NMASGPXY.NLM [
OK ]
Loading Module VERIFY.NLM [
OK ]
Auto-Loading Module WSPDSI.NLM [
OK ]
Loading Module JVMLIB.NLM [
OK ]
Loading Module NFAP4NRM.NLM [
OK ]
Auto-Loading Module SETMD4.NLM [
OK ]
Loading Module ZIP.NLM [
OK ]
Loading Module GAMS.NLM [NOT
MULTIPLE]
Loading Module SETMD4.NLM [NOT
MULTIPLE]
Loading Module CIFS.NLM [
OK ]
CIFSNLM: Compile date and time is Apr 01 2004, 15:36:30
Loading Module JNET.NLM [
OK ]
Loading Module NLSTRAP.NLM [
OK ]

2.12.2005 7.29.50 : DS-10551.29-262
Directory Services: Local database is open

Loading Module LBURP.NLM [
OK ]
CIFSNLM Operating Parameters:
Server - "NW03GOE_W"
Comment - "CIFS Zugriff auf ZEN Server"
Authentication - "Local"
Workgroup - "GOEPFERT"
Oplocks - Disabled
Async Read - Disabled
Unicode - Disabled
Share point - "Export All mounted Volumes"
Loading Module CIFSPROX.NLM [
OK ]
Loading Module NFAP4NRM.NLM [NOT
MULTIPLE]
Loading Module LDAPXS.NLM [
OK ]
Loading Module NMASLDAP.NLM [
OK ]
Loading Module NFSADMIN.NLM [
OK ]
Auto-Loading Module PKERNEL.NLM [
OK ]
Auto-Loading Module RPCBSTUB.NLM [
OK ]
Loading Module NTLS.NLM [
OK ]
Loading Module NISSERV.NLM [
OK ]
Auto-Loading Module UNICRYPT.NLM [
OK ]
Auto-Loading Module NDSILIB.NLM [
OK ]
Auto-Loading Module NISBIND.NLM [
OK ]
Auto-Loading Module NISSWDD.NLM [
OK ]
Loading Module NFSSERV.NLM [
OK ]
Loading Module SASL.NLM [
OK ]
Auto-Loading Module NFS.NAM [
OK ]
Auto-Loading Module UNIXLIB.NLM [
OK ]
Auto-Loading Module UNIDLL.NLM [
OK ]
Auto-Loading Module TADJST.NLM [
OK ]
Loading Module RCONAG6.NLM [
OK ]
Loading Module JVM.NLM [NOT
MULTIPLE]
Search 8: [Server Path] SYS:JAVA\NJCLV2\BIN\
Loading Module CDROM.NLM [
OK ]
Auto-Loading Module JCLNTR.NLM [
OK ]
Loading Module JAVA.NLM [
OK ]
Loading Module JAVA.NLM [NOT
MULTIPLE]
Loading Module TFTP.NLM [
OK ]
Loading Module PDHCP.NLM [
OK ]
Loading Module ZENPXE.NLM [
OK ]
Loading Module DTS.NLM [
OK ]
Auto-Loading Module PMAP.NLM [
OK ]
Loading Module JNDPS.NLM [
OK ]
Auto-Loading Module DPRPCNLM.NLM [
OK ]
Loading Module IMGSERV.NLM [
OK ]
Auto-Loading Module DPLSV386.NLM [
OK ]
Auto-Loading Module NIPPED.NLM [
OK ]
Loading Module ZENWS.NLM [
OK ]
Auto-Loading Module ZENIMGDS.NLM [
OK ]
Search 13: [Server Path] SYS:\XTIER\
Loading Module NCPL.NLM [
OK ]
Loading Module APACHE.NLM [
OK ]
Auto-Loading Module APACHEC.NLM [
OK ]
Loading Module JNCPV2.NLM [
OK ]
Loading Module EMBOX.NLM [
OK ]
Loading Module MOD_LCGI.NLM [
OK ]
Auto-Loading Module XIS11.NLM [
OK ]
Auto-Loading Module NSLCGI.NLM [
OK ]
Auto-Loading Module CSSYSMSG.NLM [
OK ]
Loading Module LANGMAN.NLM [
OK ]
Loading Module MOD_NDS.NLM [
OK ]
Loading Module HT2SOAP.NLM [
OK ]
Loading Module MOD_TLS.NLM [
OK ]
Loading Module MOD_JK.NLM [NOT
MULTIPLE]
Loading Module EMBOXMGR.NLM [
OK ]
Auto-Loading Module EMBOXMSG.NLM [
OK ]
Loading Module DBSRV8.NLM [
OK ]
Loading Module IFOLDER.NLM [
OK ]
Loading Module RSS.NLM [
OK ]
Auto-Loading Module MATHLIB.NLM [
OK ]
This path is ALREADY in use as Search 3
Novell NetWare 6
Support Pack Revision 05
(C) Copyright 1983-2003 Novell Inc. All Rights Reserved. Patent Pending.
Server Version 5.60.05 May 27, 2004

Friday, 2 December 2005 7.30.24,809 CET
NW03GOE:
NW03GOE:

Loading Module PFC.NLM [
OK ]
Auto-Loading Module AFTER311.NLM [ PUB
EXISTS ]
Auto-Loading Module A3112.NLM [ PUB
EXISTS ]

2.12.2005 7.30.42 : SLP-2.9-0
SLP activated v2 DA 10.27.1.5


2.12.2005 7.30.42 : SLP-2.9-0
SLP activated v2 DA 10.27.1.9

Loading Module TLI.NLM [NOT
MULTIPLE]
Loading Module BTRIEVE.NLM [NOT
MULTIPLE]
Loading Module CSDBAPIB.NLM [
OK ]
Loading Module DSAPI.NLM [NOT
MULTIPLE]
Loading Module CANWPABD.NLM [
OK ]
Auto-Loading Module CANWPA.CDM [
OK ]
Auto-Loading Module BOARDSVR.NLM [
OK ]
Loading Module MLIB.NLM [
OK ]
Loading Module AWT.NLM [
OK ]
Auto-Loading Module XLIB.NLM [
OK ]
Loading Module PFC.NLM [
OK ]
Auto-Loading Module AFTER311.NLM [ PUB
EXISTS ]
Auto-Loading Module A3112.NLM [ PUB
EXISTS ]
Loading Module CATIRPC.NLM [
OK ]
Loading Module ASDB.NLM [
OK ]
Loading Module ARCSERVE.NLM [
OK ]
Loading Module UNIQSVR.NLM [
OK ]
Loading Module POOLUTIL.NLM [
OK ]
Loading Module UNIDB.NLM [
OK ]
Loading Module DISCOVER.NLM [
OK ]
Loading Module TAPESVR.NLM [
OK ]
Loading Module VALIDATE.NLM [
OK ]
Loading Module UNIDMSVR.NLM [
OK ]
Loading Module STANDARD.NLM [
OK ]
Loading Module STANDARD.NLM [
OK ]
Loading Module STANDARD.NLM [
OK ]
Loading Module STANDARD.NLM [
OK ]
Loading Module TAPEALRT.NLM [
OK ]
-----------------------------------
inbetween here are several hours and a short backup session of
AS9.01.sp1
-----------------------------------


2.12.2005 7.37.02 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 7.40.20 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 7.41.59 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 7.43.38 : SERVER-5.60-131
A scheduled "Delayed WorkToDo" took over one minute to be run.

2.12.2005 8.05.09 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.08.27 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.26.42 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.28.21 : SERVER-5.60-131
A scheduled "Delayed WorkToDo" took over one minute to be run.


2.12.2005 8.28.21 : SERVER-5.60-131
A scheduled "Delayed WorkToDo" took over one minute to be run.


2.12.2005 8.31.38 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.33.17 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.34.56 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.38.14 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.41.33 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.43.12 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.46.30 : SERVER-5.60-131
A scheduled "Delayed WorkToDo" took over one minute to be run.


2.12.2005 8.48.09 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.51.27 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.51.27 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.53.06 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.56.24 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 8.58.03 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.01.21 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.03.00 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.06.18 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.07.56 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.11.14 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.12.53 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.21.11 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.22.50 : SERVER-5.60-131
A scheduled "Delayed WorkToDo" took over one minute to be run.


2.12.2005 9.26.08 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.27.46 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.29.25 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.31.04 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.32.43 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.34.22 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.39.19 : SERVER-5.60-131
A scheduled "Delayed WorkToDo" took over one minute to be run.


2.12.2005 9.40.58 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.42.37 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.

Loading Module APROCESS.NLM [
OK ]

2.12.2005 9.50.55 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.52.33 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.55.52 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.


2.12.2005 9.57.31 : SERVER-5.60-131
A scheduled "Delayed WorkToDo" took over one minute to be run.


2.12.2005 10.00.49 : SERVER-5.60-276 [nmID=4001A]
A scheduled "Work To Do" took over one minute to be run.

-----------------------------------------------------------

Then the server was acting fine til ~ 11:30 AM.