Eosxd client regular crashes

franck-jrc · July 24, 2020, 1:01pm

We have another crash to report with a generated backtrace :

Stack trace (most recent call last) in thread 35074:
#10   Object ", at 0xffffffffffffffff, in 
#9    Object "/usr/lib64/libc.so.6, at 0x7ff8c583302c, in clone
#8    Source "pthread_create.c", line 0, in start_thread [0x7ff8c5b09dd4]
#7    Object "/opt/eos/xrootd/lib64/libXrdCl.so.2, at 0x7ff8c98752e8, in XrdCl::JobManager::RunJobs()
#6    Object "/opt/eos/xrootd/lib64/libXrdCl.so.2, at 0x7ff8c987508e, in XrdCl::JobManager::RunJobs()
#5    Object "/opt/eos/xrootd/lib64/libXrdCl.so.2, at 0x7ff8c980c72d, in XrdCl::Stream::HandleIncMsgJob::Run(void*)
#4    Object "/opt/eos/xrootd/lib64/libXrdCl.so.2, at 0x7ff8c982bc47, in XrdCl::XRootDMsgHandler::Process(XrdCl::Message*)
#3    Object "/opt/eos/xrootd/lib64/libXrdCl.so.2, at 0x7ff8c98281c8, in XrdCl::XRootDMsgHandler::HandleResponse()
#2    Object "/opt/eos/xrootd/lib64/libXrdCl.so.2, at 0x7ff8c9846fd5, in XrdCl::FileStateHandler::OnStateError(XrdCl::XRootDStatus*, XrdCl::Message*, XrdCl::ResponseHandler*, XrdCl::MessageSendParams&)
BFD: Dwarf Error: Could not find abbrev number 90.
BFD: Dwarf Error: Could not find abbrev number 84.
BFD: Dwarf Error: Could not find abbrev number 5766.
BFD: Dwarf Error: Could not find abbrev number 5766.
BFD: Dwarf Error: Offset (1994106625) greater than or equal to .debug_str size (19800455).
#1    Source "/opt/eos/xrootd/include/xrootd/XrdCl/XrdClXRootDResponses.hh", line 873, in  [0x692e3d]
        870:                                             HostList     *hostList )
        871:       {
        872:         delete hostList;
      > 873:         HandleResponse( status, response );
        874:       }
        875: 
        876:       //------------------------------------------------------------------------
#0    Source "/opt/rh/devtoolset-6/root/usr/include/c++/6.3.1/bits/stl_vector.h", line 656, in  [0x69ad3f]
Segmentation fault (Address not mapped to object [0x8])
# umounthandler: executing fusermount -u -z /eos/jeodpp# umounthandler: sighandler received signal 11 - emitting signal 11 again
###### cleaning stale cache directory '/var/cache/eos/fusex/md-cache/jeodpp/f3b1d0f6-51a5-11ea-affe-39c430d8ae18'
200721 17:45:34 t=1595346334.431071 f=mdcommunicate    l=WARN  tid=00007f202f3fa700 s=md:2691                  MGM asked us to set our heartbeat interval to 10 seconds, enable dentry-messaging, enable writesizeflush, accepts appname, accepts mdquery and server-version=4.5.15::1
200721 17:45:34 t=1595346334.529307 f=run              l=WARN  tid=00007f204499fe00 s=eosfuse:1541             ********************************************************************************
200721 17:45:34 t=1595346334.529331 f=run              l=WARN  tid=00007f204499fe00 s=eosfuse:1543             eosxd started version 4.5.15 - FUSE protocol version 28

CERN Accelerating science

Eosxd client regular crashes