Hi,
Hopefully someone can give me light on this issue...
scenario: VSX VSLS upgrade R77.10 to R77.30 Jumbo HFA 205 using connectivity upgrade method. All VS'es running on one member during the upgrade. VSX gateways being managed with Multi-Domain
Status before issuing 'cphacu start'
Code:
[Expert@VSX2:0]# cphaprob state
Cluster Mode: Virtual System Load Sharing
Number Unique Address Assigned Load State
2 (local) 10.255.255.66 0% Ready
(*) 'Ready' state might be caused due to configuration inconsistency between members:
32bit/64bit/usermode, number of CoreXL instances or different SW version.
Cluster name: FWVSX
Virtual Devices Status on each Cluster Member
=============================================
ID | Weight| FWVSX2
| | [local]
-------+-------+-----------
1 | 10 | Ready
2 | 10 | Ready
3 | 10 | Ready
4 | 10 | Ready
5 | 10 | Ready
6 | 10 | Ready
---------------+-----------
Active | 0
Weight | 0
Weight (%) | 0
Legend: Init - Initializing, Active! - Active Attention
Down! - ClusterXL Inactive or Virtual System is Down
'cphacu start' finishes with the following error: get_old_member_info: Connectivity upgrade failed since CPHAPROB can't see old member on VSID 0.
Code:
[Expert@VSX2:0]# cphacu start
Starting Connectivity Upgrade...
Waiting until the new member is able to communicate with Active old member... 1/10
Waiting until the new member is able to communicate with Active old member... 2/10
Waiting until the new member is able to communicate with Active old member... 3/10
Waiting until the new member is able to communicate with Active old member... 4/10
Waiting until the new member is able to communicate with Active old member... 5/10
Waiting until the new member is able to communicate with Active old member... 6/10
Waiting until the new member is able to communicate with Active old member... 7/10
Waiting until the new member is able to communicate with Active old member... 8/10
Waiting until the new member is able to communicate with Active old member... 9/10
Waiting until the new member is able to communicate with Active old member... 10/10
get_old_member_info: Connectivity upgrade failed since CPHAPROB can't see old member on VSID 0.
Traffic is impacted over the VS'es running in the Active VSX after 'cphacu start' finishes
Status in active gateway is
Code:
[Expert@VSX1:0]# cphaprob state
Cluster Mode: Virtual System Load Sharing
Number Unique Address Assigned Load State
1 (local) 10.255.255.65 100% Active
2 10.255.255.66 0% ClusterXL Inactive or Machine is Down
Cluster name: FWVSX
Virtual Devices Status on each Cluster Member
=============================================
ID | Weight| FWVSX1 | FWVSX2
| | [local] |
-------+-------+-----------+-----------
1 | 10 | Active! | Down!
2 | 10 | Active! | Down!
3 | 10 | Active! | Down!
4 | 10 | Active! | Down!
5 | 10 | Active! | Down!
6 | 10 | Active! | Down!
---------------+-----------+-----------
Active | 6 | 0
Weight | 60 | 0
Weight (%) | 100 | 0
Legend: Init - Initializing, Active! - Active Attention
Down! - ClusterXL Inactive or Virtual System is Down
Status in upgraded gateway after 'cphacu start' is finished
Code:
[Expert@VSX2:0]# cphaprob state
Cluster Mode: Virtual System Load Sharing
Number Unique Address Assigned Load State
1 10.255.255.6 0% ClusterXL Inactive or Machine is Down
2 (local) 10.255.255.66 0% Ready
(*) 'Ready' state might be caused due to configuration inconsistency between members:
32bit/64bit/usermode, number of CoreXL instances or different SW version.
Cluster name: FWVSX
Virtual Devices Status on each Cluster Member
=============================================
ID | Weight| FWVSX1 | FWVSX2
| | | [local]
-------+-------+-----------+-----------
1 | 10 | Down! | Ready
2 | 10 | Down! | Ready
3 | 10 | Down! | Ready
4 | 10 | Down! | Ready
5 | 10 | Down! | Ready
6 | 10 | Down! | Ready
---------------+-----------+-----------
Active | 0 | 0
Weight | 0 | 0
Weight (%) | 0 | 0
Legend: Init - Initializing, Active! - Active Attention
Down! - ClusterXL Inactive or Virtual System is Down
I have managed to upgrade this cluster (with a small outage). However, additional upgrades are planned over the following weeks and having minimal outage is a must.
Has anyone faced this issue?
The Connectivity Upgrade document describes a similar error with this description
When CU starts, the two members begin to communicate, and the new member sees the old member as Active. Check communication on the sync interface, and make sure that the MAC Magic Configuration is correct.
I have checked and:
- Magic MAC configuration seems to be the same in both gateways (even though the upgraded VSX makes use of the cluster_id parameter)
- SYNC interfaces are communicating properly
Thanks for any idea on this issue..
Regards
Ed
Bookmarks