Dear All!
I’m trying to initialize an ALICE eos setup from scratch at KFKI with no luck so far. As a first step I would like to start the mgm node and then add some fst nodes later. I tried to put together the config according to different sources (including this forum) and I believe that the answer for my problems is pretty trivial so please excuse me for asking such beginner questions.
Currently I’m trying to start the eos mgm service as master. The machine has CentOS 7-8.2003, and I try to run the service with the command systemctl start eos@master
.
The status of the services:
# systemctl status eos@mgm -l
● eos@mgm.service - EOS mgm
Loaded: loaded (/usr/lib/systemd/system/eos@.service; disabled; vendor preset: disabled)
Active: active (running) since Thu 2020-09-17 17:20:39 UTC; 3min 47s ago
Process: 24263 ExecStartPre=/bin/sh -c /usr/sbin/eos_start_pre.sh eos-start-pre %i (code=exited, status=0/SUCCESS)
Main PID: 24297 (sh)
CGroup: /system.slice/system-eos.slice/eos@mgm.service
├─24297 /bin/sh -c XRDPROG=/usr/bin/xrootd; test -e /opt/eos/xrootd/bin/xrootd && XRDPROG=/opt/eos/xrootd/bin/xrootd; echo $XRDPROG ; $XRDPROG -n mgm -c /etc/xrd.cf.mgm -l /var/log/eos/xrdlog.mgm -s /tmp/xrootd.mgm.pid -Rdaemon
├─24299 /opt/eos/xrootd/bin/xrootd -n mgm -c /etc/xrd.cf.mgm -l /var/log/eos/xrdlog.mgm -s /tmp/xrootd.mgm.pid -Rdaemon
├─24412 /opt/eos/xrootd/bin/xrootd -n mgm -c /etc/xrd.cf.mgm -l /var/log/eos/xrdlog.mgm -s /tmp/xrootd.mgm.pid -Rdaemon
└─24510 eos -b console log _MGMID_
Sep 17 17:20:39 eos-mgm1.alice-af.wigner.hu sh[24297]: /opt/eos/xrootd/bin/xrootd
Sep 17 17:20:40 eos-mgm1.alice-af.wigner.hu sh[24297]: Register objects provide by NsInMemoryPlugin ...
Sep 17 17:20:40 eos-mgm1.alice-af.wigner.hu sh[24297]: =====> XrdAliceTokenAcc: Public key in use is EXPORT PATH:/ VO:* ACCESS:ALLOW CERT:*
Sep 17 17:20:40 eos-mgm1.alice-af.wigner.hu sh[24297]: =====> XrdAliceTokenAcc: Cannot load public key !
Sep 17 17:20:40 eos-mgm1.alice-af.wigner.hu sh[24297]: =====> XrdAliceTokenAcc: Public key in use is RULE PATH:/ AUTHZ:delete|read|write|write-once| NOAUTHZ:| VO:*| CERT:IGNORE
Sep 17 17:20:40 eos-mgm1.alice-af.wigner.hu sh[24297]: =====> XrdAliceTokenAcc: Cannot load public key !
Sep 17 17:20:40 eos-mgm1.alice-af.wigner.hu sh[24297]: =====> XrdAliceTokenAcc: Public key in use is RULE PATH:/eos/alicekfki/grid/ AUTHZ:| NOAUTHZ:delete|read|write|write-once| VO:*| CERT:IGNORE
Sep 17 17:20:40 eos-mgm1.alice-af.wigner.hu sh[24297]: =====> XrdAliceTokenAcc: Cannot load public key !
Sep 17 17:20:40 eos-mgm1.alice-af.wigner.hu sh[24297]: =====> XrdAliceTokenAcc: Public key in use is KEY VO:* PRIVKEY:/etc/grid-security/xrootd/privkey.pem PUBKEY:/etc/grid-security/xrootd/pubkey.pem
Sep 17 17:20:40 eos-mgm1.alice-af.wigner.hu sh[24297]: =====> XrdAliceTokenAcc: Cannot load public key !
#systemctl status eos@master -l
● eos@master.service - EOS Master
Loaded: loaded (/usr/lib/systemd/system/eos@master.service; disabled; vendor preset: disabled)
Active: inactive (dead)
Sep 17 16:35:45 eos-mgm1.alice-af.wigner.hu systemd[1]: Started EOS Master.
Sep 17 16:41:49 eos-mgm1.alice-af.wigner.hu systemd[1]: Starting EOS Master...
Sep 17 16:41:49 eos-mgm1.alice-af.wigner.hu echo[22106]: Configured on localhost as master
Sep 17 16:41:49 eos-mgm1.alice-af.wigner.hu systemd[1]: Started EOS Master.
Sep 17 16:45:16 eos-mgm1.alice-af.wigner.hu systemd[1]: Starting EOS Master...
Sep 17 16:45:16 eos-mgm1.alice-af.wigner.hu echo[22373]: Configured on localhost as master
Sep 17 16:45:16 eos-mgm1.alice-af.wigner.hu systemd[1]: Started EOS Master.
Sep 17 16:46:34 eos-mgm1.alice-af.wigner.hu systemd[1]: Starting EOS Master...
Sep 17 16:46:34 eos-mgm1.alice-af.wigner.hu echo[22467]: Configured on localhost as master
Sep 17 16:46:34 eos-mgm1.alice-af.wigner.hu systemd[1]: Started EOS Master.
Some output of xrdlog.mgm:
....
200917 17:23:30 time=1600363410.761590 func=Recycler level=INFO logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94573fb700 source=Recycle:99 tident= sec=(null) uid=99 gid=99 name=- geo="" snooze-time=30
200917 17:23:30 time=1600363410.786900 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94637fd700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://daemon@localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/report_mq_test?xmqclient.advisory.flushbacklog=0&xmqclient.advisory.query=0&xmqclient.advisory.status=0"
200917 17:23:32 time=1600363412.676406 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f9478f0f700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/mgm_mq_test?xmqclient.advisory.flushbacklog=1&xmqclient.advisory.query=1&xmqclient.advisory.status=1"
200917 17:23:32 time=1600363412.787787 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94637fd700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://daemon@localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/report_mq_test?xmqclient.advisory.flushbacklog=0&xmqclient.advisory.query=0&xmqclient.advisory.status=0"
200917 17:23:34 time=1600363414.677352 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f9478f0f700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/mgm_mq_test?xmqclient.advisory.flushbacklog=1&xmqclient.advisory.query=1&xmqclient.advisory.status=1"
200917 17:23:34 time=1600363414.788693 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94637fd700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://daemon@localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/report_mq_test?xmqclient.advisory.flushbacklog=0&xmqclient.advisory.query=0&xmqclient.advisory.status=0"
200917 17:23:36 time=1600363416.678341 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f9478f0f700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/mgm_mq_test?xmqclient.advisory.flushbacklog=1&xmqclient.advisory.query=1&xmqclient.advisory.status=1"
200917 17:23:36 time=1600363416.789539 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94637fd700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://daemon@localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/report_mq_test?xmqclient.advisory.flushbacklog=0&xmqclient.advisory.query=0&xmqclient.advisory.status=0"
200917 17:23:38 time=1600363418.679123 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f9478f0f700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/mgm_mq_test?xmqclient.advisory.flushbacklog=1&xmqclient.advisory.query=1&xmqclient.advisory.status=1"
200917 17:23:38 time=1600363418.790373 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94637fd700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://daemon@localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/report_mq_test?xmqclient.advisory.flushbacklog=0&xmqclient.advisory.query=0&xmqclient.advisory.status=0"
200917 17:23:40 time=1600363420.680030 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f9478f0f700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/mgm_mq_test?xmqclient.advisory.flushbacklog=1&xmqclient.advisory.query=1&xmqclient.advisory.status=1"
200917 17:23:40 time=1600363420.791211 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94637fd700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://daemon@localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/report_mq_test?xmqclient.advisory.flushbacklog=0&xmqclient.advisory.query=0&xmqclient.advisory.status=0"
200917 17:23:42 time=1600363422.681025 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f9478f0f700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/mgm_mq_test?xmqclient.advisory.flushbacklog=1&xmqclient.advisory.query=1&xmqclient.advisory.status=1"
200917 17:23:42 time=1600363422.792145 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94637fd700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://daemon@localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/report_mq_test?xmqclient.advisory.flushbacklog=0&xmqclient.advisory.query=0&xmqclient.advisory.status=0"
200917 17:23:44 time=1600363424.681865 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f9478f0f700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/mgm_mq_test?xmqclient.advisory.flushbacklog=1&xmqclient.advisory.query=1&xmqclient.advisory.status=1"
200917 17:23:44 time=1600363424.792988 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94637fd700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://daemon@localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/report_mq_test?xmqclient.advisory.flushbacklog=0&xmqclient.advisory.query=0&xmqclient.advisory.status=0"
200917 17:23:46 time=1600363426.682523 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f9478f0f700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/mgm_mq_test?xmqclient.advisory.flushbacklog=1&xmqclient.advisory.query=1&xmqclient.advisory.status=1"
200917 17:23:46 time=1600363426.793828 func=RefreshBrokersEndpoints level=ERROR logid=static.............................. unit=mgm@eos-mgm1.alice-af.wigner.hu:1094 tid=00007f94637fd700 source=XrdMqClient:498 tident= sec=(null) uid=99 gid=99 name=- geo="" msg="failed to contact broker" url="root://daemon@localhost:1097//eos/eos-mgm1.alice-af.wigner.hu/report_mq_test?xmqclient.advisory.flushbacklog=0&xmqclient.advisory.query=0&xmqclient.advisory.status=0"
The relevant part of /etc/sysconfig/eos_env
:
#-------------------------------------------------------------------------------
# EOS roles - Systemd Services
#-------------------------------------------------------------------------------
XRD_ROLES="mq mgm fed sync"
#-------------------------------------------------------------------------------
# EOS Configuration
#-------------------------------------------------------------------------------
# The fully qualified hostname of current MGM
EOS_MGM_HOST=mgm-master.localdomain
# The fully qualified hostname of target MGM
EOS_MGM_HOST_TARGET=mgm-slave.localdomain
# The EOS instance name
EOS_INSTANCE_NAME=eosalicekfki
# The EOS configuration to load after daemon start
EOS_AUTOLOAD_CONFIG=default
# The EOS broker URL
EOS_BROKER_URL=root://localhost:1097//eos/
# The EOS host geo location tag used to sort hosts into geographical (rack) locations
EOS_GEOTAG=""
# The alias which selects master 1 or 2
EOS_MGM_ALIAS="eos-mgm1.alice-af.wigner.hu"
# The fully qualified hostname of MGM master1
EOS_MGM_MASTER1="${EOS_MGM_ALIAS}"
# The fully qualified hostname of MGM master2
EOS_MGM_MASTER2="${EOS_MGM_ALIAS}"
I’m really stucked, so any help is more than welcome.
Thanks,
Gabor