We are using ECC 5.0 640_EX2 Kernel and patch level 304.
For Production server we are using Cluster HA scenario.
We have DB CI in cluster and 18 apps server connected to CI.
We faced following problem repetative time on this server
Problem : Not able to login CI server.
Observation : 1. All DIAG WPs are stuck on CI server
DIAG WPS in running state but nothing is running actually (No reports nothing shown)we tried to kill few DIAG WPs but not released. We monitored this at OS level (AIX 5.3) using DPMON.
2. Lock error found in SM21 log
There was no more activity possible on server as no more lock possible and lock error occuring.
Issue resolved after restart of the CI server and manually clean of IPCs processes.
Kindly let us know where and how can we check the reason for such incident and precautions we can take?
Is this any know issue? and what can be the reason?