PQ85848: A recursive call in the WLM code causes an out of memory or java segmentation fault in a workload managed environment. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
![]() |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
![]() APAR status Closed as program error. Error description PROBLEM: Threads in a workload management environment can make many calls that can cause recursion. The stack of the problem can be seen below: com.ibm.ws.wlm.client.selection.NoUsableTargetException com.ibm.ws.wlm.client.selection.MasterProxy.getNextClone 262 Exception = com.ibm.ws.wlm.client.selection.NoUsableTargetExceptionSource = com.ibm.ws.wlm.client.selection.MasterProxy.getNextCloneprobeid = 262 Stack Dump = com.ibm.ws.wlm.client.selection.NoUsableTargetException at com.ibm.ws.wlm.client.selection.WeightedOutstandingRequestTarget Selector.getNextClone(WeightedOutstandingRequestTargetSelector.j ava(Compiled Code)) at com.ibm.ws.wlm.client.selection.MasterProxy.getNextClone(MasterP roxy.java(Compiled Code)) at com.ibm.ws.wlm.client.selection.SelectionManager.getTarget(Selec tionManager.java(Compiled Code)) at com.ibm.ws.wlm.client.WLMClient.getNextTarget(WLMClient.java(Com piled Code)) at com.ibm.ws.wlm.client.WLMClient.handleRemoteException(WLMClient. java(Compiled Code)) at com.ibm.rmi.corba.ClientDelegate._createRequest_WLM(ClientDelega te.java(Compiled Code)) at com.ibm.rmi.corba.ClientDelegate.createRequest(ClientDelegate.ja va(Compiled Code)) at com.ibm.rmi.corba.ClientDelegate.createRequest(ClientDelegate.ja va(Inlined Compiled Code)) at com.ibm.rmi.corba.ClientDelegate.request(ClientDelegate.java(Com piled Code)) at org.omg.CORBA.portable.ObjectImpl._request(ObjectImpl.java(Inlin ed Compiled Code)) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub.get_context_id_str ing(Unknown Source)(Compiled Code)Local fix Defect 168004 has been created that limits the number of requests that a thread can make before it quits in a WLM environment to prevent this recursion.Problem summary **************************************************************** * USERS AFFECTED: WebSphere Application Server users of WLM * * or Workload Managment. * **************************************************************** * PROBLEM DESCRIPTION: A recursive call in the WLM code * * causes an out of memory or java * * segmentation fault in a workload * * managed environment. * **************************************************************** * RECOMMENDATION: * **************************************************************** A recursive call in the WLM code causes an out of memory or java segmentation fault in a workload managed environment. Threads in a workload management environment can make calls that can cause deep recursion resulting in the out of memory error. The stack below is an example of a sympton of the problem that can be found in an ffdc log. In this case though, the out of memory error did not occur since the NoUSableTargetException was eventually thrown. The problem occurs when the exception is not thrown. com.ibm.ws.wlm.client.selection.MasterProxy.getNextClone 262 Exception = com.ibm.ws.wlm.client.selection.NoUsableTargetException Source = com.ibm.ws.wlm.client.selection.MasterProxy.getNextClone probeid = 262 Stack Dump = com.ibm.ws.wlm.client.selection.NoUsableTargetException at com.ibm.ws.wlm.client.selection .WeightedOutstandingRequestTargetSelector .getNextClone(WeightedOutstandingRequestTargetSelector .java(Compiled Code)) at com.ibm.ws.wlm.client.selection.MasterProxy .getNextClone(MasterProxy.java(Compiled Code)) at com.ibm.ws.wlm.client.selection.SelectionManager .getTarget(SelectionManager.java(Compiled Code)) at com.ibm.ws.wlm.client.WLMClient .getNextTarget(WLMClient.java(Compiled Code)) at com.ibm.ws.wlm.client.WLMClient .handleRemoteException(WLMClient.java(Compiled Code)) at com.ibm.rmi.corba.ClientDelegate ._createRequest_WLM(ClientDelegate.java(Compiled Code)) at com.ibm.rmi.corba.ClientDelegate .createRequest(ClientDelegate.java(Compiled Code)) at com.ibm.rmi.corba.ClientDelegate .createRequest(ClientDelegate.java(Inlined Compiled Code)) at com.ibm.rmi.corba.ClientDelegate .request(ClientDelegate.java(Compiled Code)) at org.omg.CORBA.portable.ObjectImpl ._request(ObjectImpl.java(Inlined Compiled Code)) at com.ibm.WsnOptimizedNaming._NamingContextStub .get_context_id_string(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub .get_context_id_string(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub .get_context_id_string(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub .get_context_id_string(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub .get_context_id_string(Unknown Source)(Compiled Code) at com.ibm.WsnOptimizedNaming._NamingContextStub .get_context_id_string(Unknown Source)(Compiled Code) ... ...Problem conclusion A check needed to be made to limit the depth of the callsTemporary fix When recycling the application servers and node agents, stop the deployment manager last. Then restart the deployment manager first before restarting any of the node agents or application servers.Comments
APAR is sysrouted FROM one or more of the following: APAR is sysrouted TO one or more of the following: Modules/Macros
Publications Referenced
|
Product categories: Software > Application Servers >
Distributed Application & Web Servers > WebSphere Application
Server > General
Operating system(s):
Software version: 00A
Software edition:
Reference #: PQ85848
IBM Group: Software Group
Modified date: Jul 21, 2004
(C) Copyright IBM Corporation 2000, 2008. All Rights Reserved.