IBM Spectrum Scale APARs Resolved in 5.1.7.x

IJ45803

Spectrum Scale and systemhealth monitor (sysmon) start independently after a node reboot.
During initialization, Spectrum Scale checks if all declared NFS exports are available.
The sysmon configuration has the flag "preventnfsstartuponmissingfs" enabled, so the expected behavior was that NFS is not started if a required filesystem is unmounted. But in fact, NFS was started anyway. (show details)

Symptom	Unexpected Results/Behavior
Environment	ALL Linux OS environments running CES with enabled NFS protocol
Trigger	Spectrum Scale and systemhealth monitor (sysmon) start independently after a node reboot. During initialization, Spectrum Scale checks if all declared NFS exports are available. At that point in time the sysmon was still initializing and has not yet done this evaluation, so it returns "no bad configuration found" which triggers then the NFS startup. The sysmon configuration has the flag "preventnfsstartuponmissingfs" enabled, so the expected behavior was that NFS does not come up. Ganesha will fail later and trigger an IP address failover, which disturbs the cluster operation.
Workaround	N/A Make sure that the exported filesystems have the automount feature enabled, if possible. If the missing exported filesystem is not in use anyway, then remove it from the declared export list.

5.1.7.1

System Health

IJ45804

While online mmfsckx is in progress and if a user tries to do I/O on a file or directory that has an inode number greater than 32 bit integer number then in some cases it can cause the node to assert with LOGASSERT(i64 == INVALID_INODE_NUMBER || (i64 & 0xFFFFFFFF00000000ULL) == 0) (show details)

Symptom	Will see the assert LOGASSERT(i64 == INVALID_INODE_NUMBER \|\| (i64 & 0xFFFFFFFF00000000ULL) == 0)
Environment	ALL
Trigger	This issue can only happen when online mmfsckx is run on a file system having more than 2^32 inodes (roughly > 4.3billion) and a user is doing I/O on a file or directory that has an inode number greater than 32 bit while mmfsckx is running.
Workaround	None

5.1.7.1

FSCKX

IJ45590

With File Audit Logging (FAL) enabled, when kx Ganesha operation op 112 (GET_XSTAT) is being handled, the NFS client ip is malloc'ed and inserted into a table by the current Ganesha threadfor use by FAL.
The responsibility for freeing the ip is left to close during a close file routine.
However, the routine is called by a different thread and not immediately after the kxGanesha op 112 call.
This results in the ip remaining in the table and not being freed, leading to memory leaks and subsequent memory exhaustion. (show details)

Symptom	Error output/message
Environment	Linux only
Trigger	With File Audit Logging and CES NFS enabled, perform GET_XSTAT work loads (e.g, stat, nfs4_getfacl) to files/directories in a NFS Ganesha mount for some period of time until seeing out of memory issues.
Workaround	Disable File Audit Logging for the effected file system (for Scale version 5.1.3 to 5.1.6). Restart the mmfs daemon on the CES nodes (for Scale version 5.1.7).

5.1.7.1

NFS and File Audit Logging

IJ45609

Due to an issue identified in offline fsck mmfsck it can cause it to report false positive lost blocks and also not report properly genuine incorrect blocks and duplicates. (show details)

Symptom	Will see corruptions like duplicates even after offline fsck repair and subsequent offline fsck runs will show lost blocks and incorrect blocks.
Environment	ALL Operating System environments
Trigger	This issue will happen on a file system where the user created two or more dataOnly pools and then at some point of time deleted the earlier data pool/s in an out of order fashion (i.e. a dataonly pool (n) is deleted with other data pools (n+x) are present).
Workaround	- Create one or more "dummy" dataOnly pool by adding a single NSD of that "dummy" dataOnly pool to the file system. The NSD of this "dummy" data pool can be of a minimum small size as we do not need to have any data on that "dummy" data pool. - After that run offline fsck on the file system and now it should report and repair lost blocks/incorrect block and duplicates in the right way. - Once the file system is fixed you can delete the "dummy" data pool by deleting the only NSD in it.

5.1.7.1

FSCK

IJ46129

Daemon asserts when an AFM fileset is unlinked with junction path. (show details)

Symptom	Crash
Environment	All
Trigger	Unlinking AFM fileset with junction path
Workaround	Use fileset name to unlink the fileset instead of using the junction path.

5.1.7.1

AFM

IJ45806

There is a peculiar case where the local bit on the .ptrash directory inside AFM filesets gets reset.
This causes the .ptrash directory to be treated like a normal directory and in Write modes, the temporary files generated for recovery/resync policy start getting replicated to the remote site.
For Read modes this causes the ptrash directory to show up as a dangling entry because a normal lookup is sent to home - and since the .ptrash doesn't have remote attrs - it fails to complete this lookup successfully.
This also causes errors when the user wants to empty the ptrash with rm -rf since the lookups to remote site don't succeed. (show details)

Symptom	Unexpected Behavior
Environment	All Linux OS environments (AFM Gateway nodes) All OS Platforms (Application nodes in AFM enabled clusters)
Trigger	ptrash local bit getting reset unintentionally and follow up operations performed on the fileset - like ls or recovery
Workaround	Manually set the local bit on ptrash on seeing issues.

5.1.7.1

AFM

IJ45805

The command to start smb traces "mmprotocoltrace start smb -c <ip address>" failed with an error message "/tmp/mmfs: No such file or directory".
The corresponding log file /var/adm/ras/mmprotocoltrace.log shows error messages of this failing command, but not any reason detail. (show details)

Symptom	Error output/message
Environment	All Linux OS environments running CES protocols
Trigger	The mmprotocoltrace command tries to access all CES nodes to collect data from them. Each of these nodes must have the given data collection directory available and the correct rights for the mmprotocoltrace user (like root or sudo user) to access it.
Workaround	Manually check if all CES nodes have the data collection path available. This is /tmp/mmfs by default, or the path given with the "-l LogFileDir" option in the mmprotocoltrace command.

5.1.7.1

System Health

IJ46130

AFM Recovery uses an external program to detect renames/removes done that were not replicated.
This external program was seen to leak few memory blocks which is now addressed. (show details)

Symptom	Unexpected Behavior
Environment	All Linux OS Plarforms (AFM Gateway nodes)
Trigger	AFM recovery triggered with renames/removes that need to be recovered.
Workaround	None

5.1.7.1

AFM

IJ46208

Add hardware information to scheduled call home data. (show details)

Symptom	N/A
Environment	ESS only
Trigger	N/A
Workaround	None

5.1.7.1

ESS

IJ45880

A GPFS Windows node that has been running for a few hours, may enter a state where-in even under no load, the idle GPFS threads might spin causing 100% CPU utilization.
This is because of a potential error in time management and computation on Windows. (show details)

Symptom	Performance Impact/Degradation.
Environment	Windows/x86_64 only.
Trigger	GPFS must be up and running on a Windows node for a few hours.
Workaround	A possible work-around is to bounce GPFS on the Windows node (mmshutdown followed by mmstartup).

5.1.7.1

Windows performance.

IJ45891

All non-posix operations like SetAttr, SetXattr, Peer snapshots, etc, are not going through from Cache/Primary to the Home/Secondary.
Because we're prevented from using the AFM special control file at Home/Secondary (show details)

Symptom	Unexpected Behavior
Environment	Linux Only (AFM Gateway nodes)
Trigger	Performing SetAttr, SetXattr or creating Peer snapshot at the Cache/Primary and expecting it to replicate to the Home/Secondary.
Workaround	None

5.1.7.1

AFM

IJ46131

Adding/Removing Gateway node roles to the cluster when Active I/O is happening to an AFM fileset can cause deadlocks owing to how the node join/leave protocol handles leading to One applicaiton node thinking of a certain Gateway node to be the Gateway node for the fileset Vs other nodes thinking other nodes to be fileset gateway nodes. (show details)

Symptom	Deadlock
Environment	ALL Operating System environments
Trigger	Running mmchnode --gateway/--nogateway when there is Active I/O happening on AFM filesets.
Workaround	Avoid running mmchnode --gateway/--nogateway when there is Active I/O happening on AFM filesets.

5.1.7.1

AFM

IJ46132

While uploading file to COS if node goes down, when node comes up and tries to recover file it was trying to recover file from snapshot path instead of live FS. (show details)

Symptom	Unexpected Results/Behavior
Environment	Linux Only
Trigger	While upload is running/queued do mmshutdown for node / make node down.
Workaround	None

5.1.7.1

AFM

IJ46133

AFM Gateway node shall hit an assertion when running IO from application node to a dependent fileset inside AFM independent fileset or AFM filesystem level replication enabled. (show details)

Symptom	Crash
Environment	All Linux OS environments (AFM Gateway nodes)
Trigger	Running I/O to dependent fileset inside AFM independent fileset or to an AFM enabled Filesystem
Workaround	None

5.1.7.1

AFM

IJ46148

During filesystem restripe process, for example, mmrestripefs -R, a file replication setting may be changed if the file is ill-replicated, and quota is not handling correctly after the file data blocks are replicated or un-replicated as needed to match the new replication settings.
As result, some quota accounting data become unreliable over time. (show details)

Symptom	Wrongly quota accounting data.
Environment	ALL Operating System environments
Trigger	Quota is not handling correctly from data blocks replicated or un-replicated logic.
Workaround	Run mmcheckquota to correct quota values.

5.1.7.1

Quotas

IJ45690

There were unknown NFS errors hit during recovery and there was no bypass around these to get recovery to go through. (show details)

Symptom	Unexpected Behavior
Environment	All Linux OS Environments (AFM Gateway nodes)
Trigger	Recovery unable to proceed upon hitting unknown persistent AFM Recovery errors.
Workaround	None

5.1.7.1

AFM

IJ46138

Prefetch support for skip-dirs needs to attempt removal of each skipped-dir in thread by setting special GID at the binary level.
So any other thread in the same binary attempting to perform lookup with the remote site is treated to be local since the GID is treated to be local. (show details)

Symptom	Unexpected Behavior
Environment	All Linux OS Environments (AFM Gateway nodes)
Trigger	Performing --directory prefetch with --skip-dir-list-file option with changes that needs to be fetched in other directories at home.
Workaround	None

5.1.7.1

AFM

IJ46205

The tsapolicy adds each client process (agent) information to agentVctr to keep track activities. If agent is retrieved from agentVctr While a helper is being added, it could get vogus agent address and it could result tsapolicy hang. Adding lock while retrieving agent info can avoid this problem. (show details)

Symptom	Component Level Outage
Environment	All platforms that support mmapplypolicy
Trigger	This problem could occur by mmapplypolicy with large number of client nodes (-N option)
Workaround	None

5.1.7.1

mmapplypolicy

IJ46323

Systems running Scale v 5.1.5.x, 5.1.6.x, and 5.1.7.0 may experience an unexpected termination of mmfsd or mmsdrserv. This will be seen in log entries, etc.
It may be reflected in a runtime Assert messages on affected quorum nodes involving an invalid socket. (show details)

Symptom	Abend of mmsdrserv or mmfsd components, possible failure of CCR operations
Environment	ALL Operating System environments
Trigger	Systems running in clusters at the 5.1.5 or later level, especially (but not limited to) on error-prone network connections experience a socket leak that eventually triggers the subject assertion.
Workaround	None

5.1.7.1

CCR

IJ46327

getfacl may not display a POSIX default ACL that has been set on a directory.
This occurs in this situation:
- A default ACL is set on a directory in a Scale filesystem using setfacl, but not an access ACL.
- The filesystem is shared using the NFS server included with the operating system.
- The NFS client mounts the filesystem using NFS version 3.
Functionally things seem to work correctly even though getfacl is missing the default ACL information. (show details)

Symptom	Under certain circumstances, getfacl command will not display information about the default ACLs that has been set on a directory using setfacl.
Environment	ALL Operating System environments
Trigger	getfacl may not display a POSIX default ACL that has been set on a directory. This occurs in this situtation: - A default ACL is set on a directory in a Scale filesystem using setfacl, but not an access ACL. - The filesystem is shared using the NFS server included with the operating system. - The NFS client mounts the filesystem using NFS version 3.
Workaround	Also set the access ACL using setfacl on affected directories.

5.1.7.1

NFS and POSIX default ACLs

IJ45446

With QoS throttling configuration on a subset of nodes in the cluster, the I/Os on the rest client nodes without QoS throttling are seriously throttled unexpectedly. (show details)

Symptom	I/O hang
Environment	All Operating Systems
Trigger	Configure QoS throttling for a subset nodes in the cluster.
Workaround	Create a node class for the non-QoS throttled nodes and set "unlimited" QoS throttling for that node class when configuring QoS for a subset nodes in the cluster.

5.1.7.1

QoS

IJ45706

In a replicated file system (-r 2), when disks of a failure group are not available, e.g. one of the failure groups in two is suspended, the file writes succeed allocating disk space on the available failure group but only one replica per logical block is allocated - the file is ill-replicated.
In such scenario, quota is not handling correctly the partial successful block allocation as GetLocalQuota and FixLocalQuota routines are out of sync.
As result, some quota shares (in-doubt) become not reclaimable and leading to increase of in-doubt values over time. (show details)

Symptom	Quota in_doubt will not decrease after workload ceased.
Environment	ALL Operating System environments
Trigger	Unavailability of disks in an entire failure group in a replicated file system with two failure groups.
Workaround	Run mmcheckquota to correct outstanding in-doubt values.

5.1.7.1

Quotas

IJ46329

After enabling file audit logging, immediately listing or accessing the created audit log directory (SpectrumScale_XYZ) inside the .audit_log directory returns the “No such file or directory” message.
In addition, performing ls -l on the .audit_log directory returns ‘?’ in the output for the created audit log directory. (show details)

Symptom	Error output/message
Environment	Linux Only
Trigger	- Enable file audit logging - List or accessed the created audit log directory of the most recent enablement - Generate events for the fs - List the .audit_log directory
Workaround	Do not list the audit log directory right after enabling file audit logging.

5.1.7.1

File Audit Logging

IJ46394

The TCT recall process could fail or report some errors during deleting a non-resident (stub) file that is also in a snapshot. (show details)

Symptom	Unexpected behavior and results.
Environment	All Operating Systems
Trigger	Deleting a non-resident stub file that is also in a snapshot.
Workaround	Deleting the the snapshots that contains such being deleted non-resident stub file.

5.1.7.1

TCT migration/LWE

IJ45626

Adding any disk into a file system, the ill_unbalanced flag would be set to indicate that the file system can be further rebalanced.
With this ill_unbalanced flag, the mmhealth will see it and downgrade the file system until an mmrestripefs command -b option is done. (show details)

Symptom	mmhealth report ill_unbalanced_fs state.
Environment	All Operating Systems
Trigger	Adding descOnly disk to a Scale file system.
Workaround	None

5.1.7.1

All Scale Users