View Single Post
  #1 (permalink)  
Old 2007-01-27
Trealex Trealex is offline
Junior Member
 
Join Date: 2007-01-27
Posts: 1
Rep Power: 0
Trealex has an average reputation (10+)
Default an unexpected reboot

I am having a problem with a cluster, R60 + hfa03 + hotfix 603 running on two HP DL380.
The cluster reported member1 was down. After that member2 was unavailable too.

Looking at the /var/log/messages-all members did an unexpected reboot:
Logs on member1:
Dec 16 15:17:56 cpngx1 syslogd 1.4.1: restart.
Dec 16 15:18:08 cpngx1 kernel: eth0: HP NC7782 Gigabit Server Adapter found at mem fddf0000, IRQ 25, node addr 001279cf6255
Dec 16 15:18:08 cpngx1 kernel: eth1: HP NC7782 Gigabit Server Adapter found at mem fdde0000, IRQ 26, node addr 001279cf6254
Dec 16 15:18:20 cpngx1 kernel: CPHA: failed to add multicast MAC address - not enough memory
Dec 16 15:18:26 cpngx1 cpboot:
Dec 16 15:18:26 cpngx1 cpboot: Installing Security Policy Police_for_CA_Firewall_1 on all.all@cpngx1
Dec 16 15:18:26 cpngx1 kernel: FW-1: Warning: Interface eth0 is defined but not currently present
Dec 16 15:18:26 cpngx1 kernel: FW-1: Warning: Interface eth1 is defined but not currently present
Dec 16 15:18:34 cpngx1 cpboot: Fetching Security Policy from localhost succeeded Next reboot solved the problem. Cluster has two members: cpngx1 and cpngx2. eth0 - LAN eth1 - WAN It's onboard Broadcom interfaces eth2 - sync, Intel additional card. SmartCenter on different machine into the LAN No instructions. No additional info.

Error Message(s):
cluster_info: (ClusterXL) member 1 (10.10.10.1) is down.
cluster_info: (ClusterXL) member 2 (10.34.1.236) is down (Interface Active Check on member 2 (10.34.1.236) detected a problem (3 interfaces required, only 2 up).).

cluster_info: (ClusterXL) interface eth2 of member 2 (10.34.1.236) is down (receive up, transmit down)
cluster_info: (ClusterXL) interface eth2 of member 2 (10.34.1.236) is down (receive down, transmit down)

So, both members got an unexpected reboot. But worst of all that both nodes do not comes up after unexpected reboot. I have to manualy reboot them. After manul reboot everything goes up.

Does somebody had the same problem?
Any ideas would be appreciated!
Reply With Quote