CERN Accelerating science

Enable third party copy :


(Denis Pugnere) #1

Hi EOS community,

I try to enable TPC on my EOS instance:

$ xrdcp root://np02ss18.cern.ch//eos/users/dpugnere/toto-100M .
[100MB/100MB][100%][==================================================][100MB/s]  
$ xrdcp toto-100M root://eosuser.cern.ch//eos/user/d/dpugnere
[100MB/100MB][100%][==================================================][100MB/s]  
$ xrdfs eosuser.cern.ch rm /eos/user/d/dpugnere/toto-100M
$ xrdcp --debug=2 --tpc only root://np02ss18.cern.ch//eos/users/dpugnere/toto-100M root://eosuser.cern.ch//eos/user/d/dpugnere
Run: [ERROR] Server responded with an error: [3005] TPC open failed

debug extract :

...
[2018-04-05 11:57:56.643883 +0200][Debug  ][Utility           ] Checking if the data server np02ss06.cern.ch:1095 supports tpc
[2018-04-05 11:57:56.644317 +0200][Debug  ][Utility           ] Third party copy supported at: np02ss06.cern.ch:1095
...
[2018-04-05 11:57:56.703536 +0200][Debug  ][File              ] [0x15e0f50@root://eosuser.cern.ch:1094//eos/user/d/dpugnere//toto-100M?oss.asize=104857600&tpc.key=0009d5ab2a9849315ac5f324&tpc.lfn=/eos/users/dpugnere/toto-100M&tpc.src=np02ss06.cern.ch:1095&tpc.stage=copy] Sending a sync command for handle 0x0 to p06253937w63885.cern.ch:1095
[2018-04-05 11:57:56.709286 +0200][Debug  ][XRootD            ] [p06253937w63885.cern.ch:1095] Handling error while processing kXR_sync (handle: 0x00000000): [ERROR] Error response: Input/output error.
[2018-04-05 11:57:56.709341 +0200][Error  ][File              ] [0x15e0f50@root://eosuser.cern.ch:1094//eos/user/d/dpugnere//toto-100M?oss.asize=104857600&tpc.key=0009d5ab2a9849315ac5f324&tpc.lfn=/eos/users/dpugnere/toto-100M&tpc.src=np02ss06.cern.ch:1095&tpc.stage=copy] Fatal file state error. Message kXR_sync (handle: 0x00000000) returned with [ERROR] Server responded with an error: [3005] TPC open failed
[2018-04-05 11:57:57.703757 +0200][Debug  ][File              ] [0x15e0f50@root://eosuser.cern.ch:1094//eos/user/d/dpugnere//toto-100M?oss.asize=104857600&tpc.key=0009d5ab2a9849315ac5f324&tpc.lfn=/eos/users/dpugnere/toto-100M&tpc.src=np02ss06.cern.ch:1095&tpc.stage=copy] Sending a stat command for handle 0x0 to p06253937w63885.cern.ch:1095
^M[0B/100MB][  0%][>                                                 ][0B/s]  [2018-04-05 11:57:57.704467 +0200][Error  ][Utility           ] Third party copy from root://np02ss18.cern.ch:1094//eos/users/dpugnere/toto-100M to root://eosuser.cern.ch:1094//eos/user/d/dpugnere//toto-100M failed: [ERROR] Server responded with an error: [3005] TPC open failed
...
[2018-04-05 11:57:57.709696 +0200][Debug  ][File              ] [0x15e0f50@root://eosuser.cern.ch:1094//eos/user/d/dpugnere//toto-100M?oss.asize=104857600&tpc.key=0009d5ab2a9849315ac5f324&tpc.lfn=/eos/users/dpugnere/toto-100M&tpc.src=np02ss06.cern.ch:1095&tpc.stage=copy] Close returned from p06253937w63885.cern.ch:1095 with: [ERROR] Server responded with an error: [3007] Unable to store file - file has been cleaned because the stored file does not match the provided targetsize /eos/user/d/dpugnere/toto-100M; input/output error
...

on the MGM (np02ss18.cern.ch) /var/log/eos/mgm/xrdlog.mgm :

180405 11:57:56 time=1522922276.636682 func=open                     level=INFO  logid=d0b30b6e-38b7-11e8-8058-f4e9d4cf7750 unit=mgm
@np02ss18.cern.ch:1094 tid=00007f0d785fc700 source=XrdMgmOfsFile:191              tident=dpugnere.18737:108@np02ss19 sec=krb5  uid=2
8122 gid=1395 name=dpugnere geo="" op=read path=/eos/users/dpugnere/toto-100M info=tpc.stage=placement
180405 11:57:56 time=1522922276.636754 func=open                     level=INFO  logid=d0b30b6e-38b7-11e8-8058-f4e9d4cf7750 unit=mgm
@np02ss18.cern.ch:1094 tid=00007f0d785fc700 source=XrdMgmOfsFile:615              tident=dpugnere.18737:108@np02ss19 sec=krb5  uid=2
8122 gid=1395 name=dpugnere geo="" msg="client authenticated as directory owner" path="/eos/users/dpugnere/toto-100M"uid="28122=>139
5" gid="28122=>2841"
180405 11:57:56 time=1522922276.636781 func=open                     level=INFO  logid=d0b30b6e-38b7-11e8-8058-f4e9d4cf7750 unit=mgm
@np02ss18.cern.ch:1094 tid=00007f0d785fc700 source=XrdMgmOfsFile:635              tident=dpugnere.18737:108@np02ss19 sec=krb5  uid=2
8122 gid=2841 name=dpugnere geo="" acl=0 r=0 w=0 wo=0 egroup=0 shared=0 mutable=1
180405 11:57:56 time=1522922276.636943 func=open                     level=INFO  logid=d0b30b6e-38b7-11e8-8058-f4e9d4cf7750 unit=mgm@np02ss18.cern.ch:1094 tid=00007f0d785fc700 source=XrdMgmOfsFile:2338             tident=dpugnere.18737:108@np02ss19 sec=krb5  uid=28122 gid=2841 name=dpugnere geo="" op=read  path=/eos/users/dpugnere/toto-100M info=tpc.stage=placement  redirection=np02ss06.cern.ch?&cap.sym=<...>&cap.msg=<...>&mgm.logid=d0b30b6e-38b7-11e8-8058-f4e9d4cf7750&mgm.replicaindex=0&mgm.replicahead=0&mgm.id=0000021b:1095
180405 11:57:56 time=1522922276.636970 func=open                     level=INFO  logid=d0b30b6e-38b7-11e8-8058-f4e9d4cf7750 unit=mgm@np02ss18.cern.ch:1094 tid=00007f0d785fc700 source=XrdMgmOfsFile:2342             tident=dpugnere.18737:108@np02ss19 sec=krb5  uid=28122 gid=2841 name=dpugnere geo="" info="redirection" hostport=np02ss06.cern.ch?&cap.sym=<...>&cap.msg=<...>&mgm.logid=d0b30b6e-38b7-11e8-8058-f4e9d4cf7750&mgm.replicaindex=0&mgm.replicahead=0&mgm.id=0000021b:1095

On the FST (np02ss06.cern.ch) :

180405 11:57:58 time=1522922278.322226 func=Report level=INFO logid=static.............................. unit=fst@np02ss06.cern.ch:1095 tid=00007fd0f9afe700 source=Report:55 tident= sec=(null) uid=99 gid=99 name=- geo="" log=d0b30b6e-38b7-11e8-8058-f4e9d4cf7750&path=/eos/users/dpugnere/toto-100M&ruid=28122&rgid=2841&td=dpugnere.18737:29@np02ss19&host=np02ss06.cern.ch&lid=1048578&fid=539&fsid=41&ots=1522922276&otms=702&cts=1522922277&ctms=704&nrc=0&nwc=0&rb=0&rb_min=0&rb_max=0&rb_sigma=0.00&rv_op=0&rvb_min=0&rvb_max=0&rvb_sum=0&rvb_sigma=0.00&rs_op=0&rsb_min=0&rsb_max=0&rsb_sum=0&rsb_sigma=0.00&rc_min=0&rc_max=0&rc_sum=0&rc_sigma=0.00&wb=0&wb_min=0&wb_max=0&wb_sigma=0.00&sfwdb=0&sbwdb=0&sxlfwdb=0&sxlbwdb=0&nfwds=0&nbwds=0&nxlfwds=0&nxlbwds=0&rt=0.00&rvt=0.00&wt=0.00&osize=104857600&csize=104857600&sec.prot=krb5&sec.name=dpugnere&sec.host=np02ss19.cern.ch&sec.vorg=&sec.grps=&sec.role=&sec.info=&sec.app=

the FST /etc/sysconfig/eos_env config file is :

DAEMON_COREFILE_LIMIT=unlimited
XRD_ROLES="fst"
LD_PRELOAD=/usr/lib64/libjemalloc.so.1
EOS_BROKER_URL=root://np02ss18.cern.ch:1097//eos/

EOS_MGM_ALIAS=np02ss18.cern.ch
EOS_GEOTAG="np02-daq"
EOS_FST_NO_SSS_ENFORCMENT=1

I don’t know if it is an authentication error of if it is another problem.
I’m looking forward any comment or suggestion.
Denis