550   A clustered system cannot be formed because of a lack of clustered system resources.

Explanation

The node canister cannot become active in a clustered system because it is unable to connect to enough clustered system resources. The clustered system resources are the node canisters in the system and the active quorum drive. The node canister needs to be able to connect to a majority of the resources before that group forms an online clustered system. This prevents the clustered system splitting into two or more active parts, with both parts independently performing I/O.

The error data lists the missing resources. This includes a list of node canisters and optionally a drive that is operating as the quorum drive.

If a drive in one of the system enclosures is the missing quorum disk, it is listed as enclosure:slot[part identification] where enclosure:slot is the location of the drive when the node shutdown, enclosure is the seven-digit product serial number of the enclosure, slot is a number between 1 and 24. The part identification is the 22 character string that starts with "11S" found on a label on a drive. The part identification cannot be seen until the drive is removed from the enclosure.

User response

Follow troubleshooting procedures to correct connectivity issues between the system canisters and the quorum devices.
  1. Check the status of other node canisters in the system and resolve any faults.
  2. Check that all enclosures in the system are powered on and that the SAS cabling between the enclosures has not been disturbed. If any wiring changes have been made, check that all cables are securely connected and that the cabling rules have been followed.

    Check that all nodes in the system are shown in the service assistant or by using the service command: sainfo lsservicenodes. Investigate any missing nodes.

  3. Check all nodes and quorum disks shown in the error data and check the communication links from this node to those nodes and quorum disks.
    1. If a quorum drive in a system enclosure is shown as missing, find the drive and check that it is working. The drive may have been moved from the location shown. In that case, find the drive and ensure it is installed and working. If the drive is not located in the control enclosure, try moving it to the control enclosure. A problem in SAS connectivity might be the issue.
      Note: If you are able to reestablish the system's operation, you will be able to use the extra diagnostics the system provides to diagnose problem on SAS cables and expansion enclosures.
    2. If a quorum disk on an external storage system is shown as missing, find the storage controller and confirm that the LUN is available. Check that the Fibre Channel connections between the storage controller and the 2076 are working and that any changes made to the SAN configuration and zoning have not effected the connectivity. Check the status of the Fibre Channel ports on the node and resolve any issues.
  4. If all canisters have either node error 578 or 550, attempt to reestablish a clustered system by following the service procedures for the nodes showing node error 578. If this is not successful, follow the system recovery procedures.