Hi;

We have a two node OES2 SP3 Linux Cluster which has been running fine.Smdr,
Slp, EDirectory all network services were working fine and nothing has
changed.

Tuesday I noticed that my Clustered Resources weren't being backed up on one
node.
This node hosts the Master Replica and is the DA.

I opened Sep Sesam and could no longer browse to the Clustered Resources
when they are hosted on that node.

Slptool ran from any of our servers does not show the smdr.novell services
for the resources when hosted on that node any longer but does show
bindery.novell.

Migrate the resources over to the second node and everything is okay. That
is the smdr.novell services for those resources still shows up.

It's not a issue with Sep Sesam as the resources aren't showing up in the
SLP information.
Tsafs is set to run on dual mode on both boxes and clustering is enabled.

Since nothing was changed, all other services on this box register w/slp
just fine I'm at a lost on this one. Checked LUM and Edir which reported no
errors.

Copy of smdrd debug log is below, anyone have a idea on this.

Thanks
Greg

SMDR Debug Log Start Time : Wed Mar 14 23:37:20 2012
################################################## ##########
Os Details=Linux metro1 2.6.16.60-0.83.2-smp #1 SMP Fri Sep 2 13:49:16 UTC
2011 x86_64
Other Info :
###SMSDEBUG###[2]:fffffffcfffff7fc

###SMSDEBUG###[2]:fffffffcfffff7fc

###SMSDEBUG###[2]:fffffffcfffff7fc

f72ab6b0:SMdmem_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:SMshmlib_b_Init : Start
f72ab6b0:SMshmem_b_New : Start
f72ab6b0:SMmapreg_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:SMmapreg_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:SMmapreg_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:SMmapreg_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:SMmapreg_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:smshmlib_InitVTBL : Start
f72ab6b0:SMtgtloc_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:SMlist_b_New : Start
f72ab6b0:SMdmem_New : Start
F72AB6B0:w32smdr_RegisterProtocols[2672 ]:NWCLocalTargetRegistry has been
created..
f72ab6b0:FillRegistries : Start
f72ab6b0:w32smdr_RegisterProtocols: Start
F72AB6B0:w32smdr_RegisterProtocols[2478 ]:TCP present : TRUE SPX present :
FALSE
F72AB6B0:w32smdr_RegisterProtocols[2486 ]:UDS present : TRUE SPX present :
TRUE
F72AB6B0:w32smdr_RegisterProtocols[2504 ]:Registering the protocol : TCP
f72ab6b0:SMshmlib_b_RegisterProtoc: Start
f72ab6b0:SMproreg_RegisterProtocol: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:NWtcppro_New : Start
f72ab6b0:NWtcppro_v_LocalAddress : Start
F72AB6B0:NWtcppro_v_LocalAddress [630 ]:mAddr : (nil)
f72ab6b0:GetAgentIPAddress : Start
F72AB6B0:GetAgentIPAddress [3534 ]:Hostname : METRO1
F72AB6B0:GetAgentIPAddress [3597 ]:IP address : 10.1.0.1
f72ab6b0:GetAgentIPAddress = 1
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
F72AB6B0:w32smdr_RegisterProtocols[2512 ]:After registering the protocol
TCP, cCode : 0
f72ab6b0:SMshmlib_b_RegisterProtoc: Start
f72ab6b0:SMproreg_RegisterProtocol: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:NWtcppro_v_LocalAddress : Start
F72AB6B0:NWtcppro_v_LocalAddress [630 ]:mAddr : (nil)
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
F72AB6B0:w32smdr_RegisterProtocols[2522 ]:After registering the protocol
UDS, cCode : 0
F72AB6B0:FillRegistries [2856 ]:Number of protocols registered : 2
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:InitSmdrSslServer : Start
f72ab6b0:SigHandler : Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:InitializeDefaultResource: Start
f72ab6b0:GetAgentIPAddress : Start
F72AB6B0:GetAgentIPAddress [3534 ]:Hostname : METRO1
F72AB6B0:GetAgentIPAddress [3597 ]:IP address : 10.1.0.1
f72ab6b0:GetAgentIPAddress = 1
f72ab6b0:ProcessJoinEvent : Start
F72AB6B0:ProcessJoinEvent [6176 ]:svc : res :METRO1 procount :2
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCLocalTargetRegistry_b_= fffeffbd
f72ab6b0:NWCLocalTarget_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:NWCSvcRegistry_New : Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:SMlist_b_New : Start
f72ab6b0:SMdmem_New : Start
F72AB6B0:NWCSvcRegistry_New [1440 ]:TgtList is created..
F72AB6B0:NWCLocalTarget_New [537 ]::svcReg is :0x80934f4 cCode :0x0
F72AB6B0:NWCLocalTarget_New [542 ]:proName :TCP proAddr :16777482
F72AB6B0:NWCLocalTarget_New [542 ]:proName :UDS proAddr :0
f72ab6b0:ListenAndAdvertise : Start
F72AB6B0:ListenAndAdvertise [5711 ]:noOfPRos :2
F72AB6B0:ListenAndAdvertise [5714 ]:proName :UDS
F72AB6B0:ListenAndAdvertise [5714 ]:proName :TCP
f72ab6b0:SMchild_b_New : Start
f72ab6b0:SMshmem_b_New : Start
f72ab6b0:smchild_Init : Start
f72ab6b0:w95oslib_StartChild : Start
f7277ba0:SMdmem_New : Start
f7277ba0:SMshmem_b_New : Start
f7277ba0:_getFileNameFromLibtoolAr: Start
f7277ba0:SMdmem_New : Start
F7277BA0:SMentry_b_GetConstructor [764 ]:fn - NWtcppro_New
f7277ba0:NWtcppro_New : Start
F7277BA0:SMlsnr_Listen [212 ]:Inside while loop..
f7277ba0:NWtcppro_v_Listen : Start
f7277ba0:NWtcppro_v_LocalAddress : Start
F7277BA0:NWtcppro_v_LocalAddress [630 ]:mAddr : 0x8092f0c
F7277BA0:NWtcppro_v_Listen [836 ]:Socket created ..Socket :8
F7277BA0:NWtcppro_v_Listen [909 ]:setsockopt Nagle Disable
succeded..socket :8
F7277BA0:NWtcppro_v_Listen [923 ]:setsockopt kpAliveTimeout
succeded...socket :8
F7277BA0:NWtcppro_v_Listen [932 ]:setsockopt TCP_KEEPIDLE
succeded...socket :8
F7277BA0:NWtcppro_v_Listen [940 ]:setsockopt TCP_KEEPINTVL
succeded...socket :8
F72AB6B0:ListenAndAdvertise [5750 ]:TCP listener started at socket 413
f72ab6b0:advertiseSLPService : Start
F72AB6B0:advertiseSLPService [5508 ]:Service Name : METRO1 IP Addr
:16777482
F72AB6B0:advertiseSLPService [5570 ]:SMDR instance registered
successfully with SLP.
F72AB6B0:ListenAndAdvertise [5764 ]:Target name advertised
successfully.
F72AB6B0:ListenAndAdvertise [5711 ]:noOfPRos :2
F72AB6B0:ListenAndAdvertise [5714 ]:proName :UDS
f72ab6b0:SMchild_b_New : Start
f72ab6b0:smchild_Init : Start
f72ab6b0:w95oslib_StartChild : Start
f68c2ba0:SMdmem_New : Start
f68c2ba0:SMshmem_b_New : Start
f68c2ba0:_getFileNameFromLibtoolAr: Start
f68c2ba0:SMdmem_New : Start
F68C2BA0:SMentry_b_GetConstructor [764 ]:fn - NWUDS_New
F68C2BA0:SMlsnr_Listen [212 ]:Inside while loop..
f68c2ba0:NWtcppro_v_Listen : Start
f68c2ba0:NWtcppro_v_LocalAddress : Start
F68C2BA0:NWtcppro_v_LocalAddress [630 ]:mAddr : 0x8092f54
F68C2BA0:NWtcppro_v_Listen [836 ]:Socket created ..Socket :10
F72AB6B0:ListenAndAdvertise [5750 ]:UDS listener started at socket 413
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCLocalTargetRegistry_b_= fffefffe
F72AB6B0:main [382 ]:Default object created and added
into Registery.
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCSvcRegistry_b_AddSvcSu: Start
f72ab6b0:SMdmem_New : Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCSvcRegistry_b_AddSvcSu: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
F72AB6B0:_getFileNameFromLibtoolAr[3364 ]:Could not find module libtsands.la
f72ab6b0:_getFileNameFromLibtoolAr= fffeffa7
f72ab6b0:SMdmem_New : Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:_getFileNameFromLibtoolAr: Start
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCSvcRegistry_b_AddSvcSu: Start
f72ab6b0:create_queue : Start
f72ab6b0:getSDAndAddr : Start
f72ab6b0:read_message : Start
f72ab6b0:SigHandler : Start
f72ab6b0:SigHandler : Start
f72ab6b0:SigHandler : Start
F72AB6B0:read_message [128 ]:accept: Interrupted system
callf72ab6b0:read_message = ffffffff
f72ab6b0:LNX_SigTermHandler : Start
F72AB6B0:NWSMRetractModuleFromSMDR[1008 ]:nwSvc :0x80b44bc
F72AB6B0:NWSMRetractModuleFromSMDR[1019 ]:nwSvc :0x0x80b44bc found :1
F72AB6B0:NWSMRetractModuleFromSMDR[1037 ]:Start of Cluster Section.
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCLocalTargetRegistry_b_= fffeffbd
F72AB6B0:NWSMRetractModuleFromSMDR[1060 ]:End of Cluster Section..
f72ab6b0:SMshmlib_b_RetractService: Start
F72AB6B0:smdmem_v_Delete [307 ]:Deleting dmem..
F72AB6B0:NWSMRetractModuleFromSMDR[1008 ]:nwSvc :0x0
F72AB6B0:NWSMRetractModuleFromSMDR[1008 ]:nwSvc :0x80b4ac4
F72AB6B0:NWSMRetractModuleFromSMDR[1019 ]:nwSvc :0x0x80b4ac4 found :1
F72AB6B0:NWSMRetractModuleFromSMDR[1037 ]:Start of Cluster Section.
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCLocalTargetRegistry_b_= fffeffbd
F72AB6B0:NWSMRetractModuleFromSMDR[1060 ]:End of Cluster Section..
f72ab6b0:SMshmlib_b_RetractService: Start
F72AB6B0:smdmem_v_Delete [307 ]:Deleting dmem..
F72AB6B0:NWSMRetractModuleFromSMDR[1008 ]:nwSvc :0x0
F72AB6B0:NWSMRetractModuleFromSMDR[1008 ]:nwSvc :0x0
F72AB6B0:NWSMRetractModuleFromSMDR[1008 ]:nwSvc :0x80bb38c
F72AB6B0:NWSMRetractModuleFromSMDR[1019 ]:nwSvc :0x0x80bb38c found :1
F72AB6B0:NWSMRetractModuleFromSMDR[1037 ]:Start of Cluster Section.
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCLocalTargetRegistry_b_: Start
f72ab6b0:NWCLocalTargetRegistry_b_= fffeffbd
F72AB6B0:NWSMRetractModuleFromSMDR[1060 ]:End of Cluster Section..
f72ab6b0:SMshmlib_b_RetractService: Start
F72AB6B0:smdmem_v_Delete [307 ]:Deleting dmem..
f72ab6b0:ProcessLeaveEvent : Start
F72AB6B0:ProcessLeaveEvent [6264 ]:res Name :METRO1
f72ab6b0:StopListenerAndAdvertiser: Start
F72AB6B0:StopListenerAndAdvertiser[5874 ]:noOfPRos :1
F72AB6B0:StopListenerAndAdvertiser[5877 ]:proName :UDS
F72AB6B0:StopListenerAndAdvertiser[5877 ]:proName :TCP
f72ab6b0:stopSLPService : Start
F72AB6B0:stopSLPService [5633 ]:SMDR instance De-registered
successfully with SLP.
F72AB6B0:StopListenerAndAdvertiser[5896 ]:Target name de-advertised
successfully.
F72AB6B0:StopListenerAndAdvertiser[5874 ]:noOfPRos :0
F72AB6B0:StopListenerAndAdvertiser[5877 ]:proName :UDS
F72AB6B0:StopListenerAndAdvertiser[5968 ]:Closing Listener socket :8
F72AB6B0:StopListenerAndAdvertiser[5984 ]:Killing thread with thread
id: -148407392
F72AB6B0:StopListenerAndAdvertiser[5968 ]:Closing Listener socket :10
f7277ba0:SigHandler : Start
f7277ba0:NWtcppro_v_Listen = 80200205
F7277BA0:SMlsnr_Listen [260 ]:Could not start TCP listener on
10.1.0.1
F7277BA0:smdmem_v_Delete [307 ]:Deleting dmem..
F7277BA0:SMchild_Main [529 ]:Child exited with error code
2149581317
f7277ba0:smshmem_v_Delete : Start
F7277BA0:smdmem_v_Delete [307 ]:Deleting dmem..
F72AB6B0:StopListenerAndAdvertiser[5984 ]:Killing thread with thread
id: -158585952
F72AB6B0:StopListenerAndAdvertiser[5991 ]:noOfLsnrs :2 cCode :0
f68c2ba0:SigHandler : Start
f68c2ba0:NWtcppro_v_Listen [00866]: socket failed.
ret: 00000009.
f68c2ba0:NWtcppro_v_Listen = 80200205
F68C2BA0:SMlsnr_Listen [250 ]:Could not start UDS listener
F68C2BA0:smdmem_v_Delete [307 ]:Deleting dmem..
F68C2BA0:SMchild_Main [529 ]:Child exited with error code
2149581317
f68c2ba0:smshmem_v_Delete : Start
F68C2BA0:smdmem_v_Delete [307 ]:Deleting dmem..
F72AB6B0:StopListenerAndAdvertiser[5999 ]:Try: 10
F72AB6B0:StopListenerAndAdvertiser[6007 ]:Setting smdr->mListeners[0] to
NULL
F72AB6B0:StopListenerAndAdvertiser[6007 ]:Setting smdr->mListeners[1] to
NULL
F72AB6B0:StopListenerAndAdvertiser[6017 ]:All Listeners have been deleted.
f72ab6b0:NWCSvcRegistry_b_RemoveFi: Start
f72ab6b0:NWCSvcRegistry_b_RemoveFi= fffeffbd
F72AB6B0:smdmem_v_Delete [307 ]:Deleting dmem..
F72AB6B0:smdmem_v_Delete [307 ]:Deleting dmem..
F72AB6B0:smdmem_v_Delete [307 ]:Deleting dmem..
f72ab6b0:DeinitSmdrSslServer : Start
f72ab6b0:smtgtloc_v_Delete : Start
F72AB6B0:smdmem_v_Delete [307 ]:Deleting dmem..

################################################## ##########
SMDR Debug Log End Time : Wed Mar 14 23:38:28 2012
################################################## ##########