Hi guys,
I've got a new IBM blade centre running 6 node cluster (OES-Netware
SP2). The cluster is still in testing.
I came in this morning to find all 6 nodes had poison pill abended all
within a few seconds of each other.

There's no errors on the SAN switch fabric log, and I don't think
there's any network issues.
Is there any way of finding out what caused a split brain to occur?

cheers
Dave
---------------------------------

Server CHISFS2 halted Sunday, 7 May 2006 12:00:05.677 am
Abend 1 on P00: Server-5.70.05-0: Ate Poison Pill in SbdWriteNodeTick
given by some other node.


Registers:
CS = 0060 DS = 007B ES = 007B FS = 007B GS = 007B SS = 0068
EAX = A6A52666 EBX = AA8094A0 ECX = FE005CA0 EDX = A85ABEAC
ESI = 00000000 EDI = AA8094A0 EBP = A85ABEB0 ESP = A85ABEA4
EIP = AA7E20E3 FLAGS = 00000286
AA7E20E3 83C404 ADD ESP, 00000004
EIP in CLSTRLIB.NLM at code start +000060E3h

The violation occurred while processing the following instruction:
AA7E20E3 83C404 ADD ESP, 00000004
AA7E20E6 EB38 JMP AA7E2120
AA7E20E8 837DFC01 CMP [EBP-04], 00000001
AA7E20EC 7406 JZ AA7E20F4
AA7E20EE 837DFC02 CMP [EBP-04], 00000002
AA7E20F2 7520 JNZ AA7E2114
AA7E20F4 C745FC00000000 MOV [EBP-04], 00000000
AA7E20FB 8D45FC LEA EAX, [EBP-04]
AA7E20FE 50 PUSH EAX
AA7E20FF B8B4477FAA MOV EAX, AA7F47B4



Running process: SBD Write Node Tick Thread Process
Thread Owned by NLM: SBD.NLM
Stack pointer: A85ABD64
OS Stack limit: A85A8000
CPU 0 (Thread AA8094A0) is in a NO SLEEP state
Scheduling priority: 67371008
Wait state: 3030070 Yielded CPU