Recently experienced the situation where one of our AIX application servers went into memory distress. The server has been in production for a significant amount of time and there are 4 other servers that share the production load and none have ever experienced this and it is a first for this particular server as well.
Via SM04 observed that a user using only a single session was using 32538 Megabytes of memory. Without looking any further I went to SM50 and cacelled without core the workprocess that was executing this thread.The memory wasnearly instantly cleared up and the server returned to normal operation
My question is with the memory parameter as defined below how could this have happened? I would have thought that long before it got to this point it would have shortdumped with TSV_TNEW_PAGE_ALLOC_FAILED or something similar.
Thanks for any insight,
OS = AIX
Physical Memory = 12Gb
Paging Space 20Gb