PK11595: WEBSPHERE APPLICATION SERVER DRS DEADLOCK IN THE DISTHUB

 A fix is available

Obtain the fix for this APAR



APAR status
Closed as program error.

Error description
Roll up of Distributed apars relating to DRS :
.

PK08259 - DEAD LOCK, HANG IN DRS -- IN METHOD DRSPOOLS.GETACCJMS

PK07887 - DURING PERF TEST,MEMORY GROWS FROM 600MB TO1.5GB
          OVER 45 MINS POOLED TOPICSUBSCRIBER WRAPPERS
          ARE NOT GETTING UNSUBSCRIBED
Local fix Problem summary
****************************************************************
* USERS AFFECTED: All users of WebSphere Application Server    *
*                 V5.0 for z/OS                                *
****************************************************************
* PROBLEM DESCRIPTION: Hang caused by lost notify in           *
*                      replication service: getAccJMS          *
****************************************************************
* RECOMMENDATION:                                              *
****************************************************************
Application server hang caused by lost notify in wait/notify
operation in DRSPools.getAccJMS of replication service.
Sequential javacores/threaddumps showed many threads blocked,
unchanging and waiting in DRSPools.getAccJMS.
Problem conclusion
The synchronization on the lock for the pooled objects was
reworked to eliminate the deadlock.

The problem for which this APAR was opened corresponds to
distributed APAR 
PK08259.  In order to port the code for 
PK08259
to z/OS, it was necessary to roll up the z/OS Data Replication
Service (DRS) component code to the latest DRS code planned
for distributed V5.1.1.7 systems (level cf70537.02).

The following defects are included in the Rollup:

  Defect       Abstract

  
PK07187      Hang in data replication server (DRS)
               object replication operations.

  
PK07887      Replication causes memory growth when resets occu

  
PK07954      Replication communication times out when another
               instance has been stopped.

  
PK08259      Hang caused by lost notify in replication
               service: getAccJMS

  
PK11117      Data Replication Domain has an option to encrypt
               the data during replication. It was observed that
               ClassCastExceptions were thrown by the server
               receiving the replica. This problem does not
               happen if encryption is enabled.  This exception
               results in the update to the session replica not
               being saved..

APAR PK11595 is associated with SERVICE LEVEL W502035 of
WebSphere Application Server V5.0 for z/OS.
Temporary fix Comments
APAR information
APAR number PK11595
Reported component name WEBSPHERE FOR Z
Reported component ID 5655I3500
Reported release 500
Status CLOSED PER
PE NoPE
HIPER NoHIPER
Special Attention NoSpecatt
Submitted date 2005-09-08
Closed date 2005-10-21
Last modified date 2005-11-03

APAR is sysrouted FROM one or more of the following:

APAR is sysrouted TO one or more of the following:
PK11597

Modules/Macros
BBOUBINF          

Publications Referenced

Fix information
Fixed component name WEBSPHERE FOR Z
Fixed component ID 5655I3500

Applicable component levels
R500 PSY UK08352    UP05/10/27 P F510

  Fix is available
Select the PTF appropriate for your component level. You will be required to sign in. Distribution on physical media is not available in all countries.


Document Information


Current web document: swg1PK11595.html
Product categories: Software > Application Servers > Distributed Application & Web Servers > WebSphere Application Server for z/OS
Operating system(s):
Software version: 500
Software edition:
Reference #: PK11595
IBM Group: Software Group
Modified date: Nov 3, 2005