Skip to Content
0
Former Member
Oct 28, 2013 at 05:51 AM

Need help - statisticsserver does not restart (Rev 62)

55 Views

Hi

We are on AWS HANA. Rev 62. Repeatedly the HANA system goes haywire (i.e., not able to replicate query results) and then the restart fails. We have tried getting a new instance on AWS and seeding it with a backup. After some hours that too crashes the same way. This is the fourth system. We also had similar problems with Rev 52 and Rev 60.

I don't know what is wrong in what we are doing, but we would greatly appreciate some help in getting it back on its feet.

Thanks a lot.

Regards

Dinesh

The statisticsserver trace log says:

[BUILD] build information: (2013-10-28 05:34:33 904 Local)

Version : 1.00.66.382664 (NewDB100_REL)

Build host : ld7270.wdf.sap.corp

Build time : 2013-09-14 16:41:37

Platform : linuxx86_64

Compiler : cc (SAP release 20130125, based on SUSE gcc43-4.3.4_20091019-0.22.17.5096.1.PTF.799300) 4.3.4 [gcc-4_3-branch revision 152973]

Branch : NewDB100_REL

Git hash : not set

Git mergetime : not set

Weekstone : 0000.00.0

--

[CRASH_SHORTINFO] exception short info: (2013-10-28 05:34:33 904 Local)

SIGNAL 6 (SIGABRT) caught, sender PID: 9664, PID: 9664, thread: 3096[thr=9747]: Assign, value int: 2356, ptr: 0x0000000000000934, time: 2013-10-28 05:34:33 000 Local

Instance HDB/00, OS Linux hanaserver 2.6.32.27-0.2-default #1 SMP 2010-12-29 15:03:02 +0100 x86_64

--

[CRASH_CONTEXT] context info: (2013-10-28 05:34:33 905 Local)

----> Crashing context information <----

ContextStack at (0x00007fb070bc8160)

stack: 7fb070aca000-7fb070bc9fff, size 1048576

guard: 7fb070aba000-7fb070ac9fff, size 65536

last known stack pointer cur: 0, max: 0

ctx addr: 0x00007fb1c462d000, ctx link: 0x00007fb1c462d000, ctx owner: 0x00007fb1c462d000

ctx name: Assign, ctx type: thread, ctx id: 3096

ctx command text:

ctx update transactionID : -1

ctx transactionID: -1

ctx connectionID : -1

ctx sqlusername:

ctx appusername:

[OK]

--

[CRASH_STACK] stacktrace of crash: (2013-10-28 05:34:33 905 Local)

----> Pending exceptions (possible root cause) <----

exception 1: no.2100002 (ptime/storage/recovery/RowStoreTransactionCallback.cc:391)

log position inconsistency detected - plz contact to HANA dev support before clearing or throwing away your redo log files

exception throw location:

1: 0x00007fb1dd69f41d in ptime::RowStoreTransactionCallback::replayQueryOpenTransactionState(ltt::vector const&, ltt::vector >&)+0xab9 at RowStoreTransactionCallback.cc:391 ( libhdbrskernel.so)

2: 0x00007fb1eedc5612 in DataRecovery::RecoveryHandlerImpl::finishRecovery(bool)+0x3f0 at RecoveryHandlerImpl.cpp:1600 (libhdbdataaccess.so)

3: 0x00007fb1eebaf1f5 in DataAccess::PersistenceManagerImpl::replayOrClearLog(bool, bool)+0xa1 at PersistenceManagerImpl.cpp:3596 (libhdbdataaccess.so)

4: 0x00007fb1eebaf605 in DataAccess::PersistenceManager::replay()+0xa1 at PersistenceManagerImpl.cpp:1828 (libhdbdataaccess.so)

5: 0x00007fb1fe080ecc in PersistenceLayer::PersistenceSystem::start(NameServer::ServiceStartInfo const&, bool, bool)+0x268 at PersistenceSystem.cpp:445 (libhdbpersistence.so)

6: 0x00007fb1fe071f40 in PersistenceLayer::PersistenceFactory::startPersistence(NameServer::ServiceStartInfo&, bool, bool, DataAccess::TablePreloadWriteCallback*)+0xc0 at PersistenceFactory.cpp:353 (libhdbpersistence.so)

7: 0x00007fb1fddab65d in PersistenceController::startup(PersistenceLayer::PERSISTENCE_MODE, NameServer::ServiceStartInfo*, bool, DataAccess::TablePreloadWriteCallback*, DataAccess::TablePreloadReadCallback*)+0x519 at PersistenceController.cpp:512 (libhdblogger.so)

8: 0x00000000004b0388 in TRexAPI::TREXIndexServer::assign(NameServer::ServiceStartInfo&, bool, TREX_ERROR::TRexError&)+0x484 at TREXIndexServer.cpp:537 (hdbstatisticsserver)

9: 0x000000000052a85d in TRexAPI::AssignThread::run(void*)+0x39 at TREXIndexServer.cpp:338 (hdbstatisticsserver)

10: 0x00007fb1fe9040c7 in TrexThreads::PoolThread::run()+0x9a3 at PoolThread.cpp:276 (libhdbbasement.so)

11: 0x00007fb1fe9062a8 in TrexThreads::PoolThread::run(void*&)+0x14 at PoolThread.cpp:124 (libhdbbasement.so)

12: 0x00007fb1f14af615 in Execution::Thread::staticMainImp(void**)+0x681 at Thread.cpp:476 (libhdbbasis.so)

13: 0x00007fb1f14af7bd in Execution::Thread::staticMain(void*)+0x39 at Thread.cpp:544 (libhdbbasis.so)

There is a lot more in the trace file, I don't know how much is directly relevant here.