We have migrated our SAP Landscape 5 months back from HP-UX/Oracle to Windows 2012 R2 and SQL Server 2012 R2 SP2. The servers are VM (VMware 5.1) on HP ESX host. From past 3 months we have started getting below error which is very random. This is happening in SAP BI system.Jobs get cancelled with the message saying System Shutdown and on checking the SM21 logs it shows “operating system call receive failed (error no. 10054)”
The installation is distributed with DB on one host and CI & App on other hosts within same data center.
At the same time there is a dump in ST22
Things which we have already done:
Scheduled niping with the option long Lan stability test, but there doesn’t seems to any drop. It can be because there was no such incident where the error occurred and niping was running, as the error is very random. Happened twice in April and then in Jul.
This issue is not happening in QA which is having same setting as Prod except for the fact that it is kept in different DC. Also we are not able to replicate the issue as an when desired as it is too random.
Any help or direction to troubleshoot will be a great help.