Can you please confirm if there will be a new citrine release, i.e 4.8.104? The latest citrine version, 4.8.103 has a timestamp, 2023-06-16 13:42, in our mirror repo.
We would need to coordinate on this with the CTA team, but currently there are many people absent. Therefore, I will try to give you a better estimate on this next week.
Thanks for this Elvin. For my reference, the reason you would like to coordinate with the CTA team is that the above ENODEV change requires a change also in the CTA code?
No, I don’t think there are any changes required on the CTA side. I just want to make sure there is nothing else that CTA would like in this (last) EOS 4 release.
We just released EOS 4.8.104 that includes the two fixes you are interested in. You can get the packages from the usual location: https://storage-ci.web.cern.ch/storage-ci/eos/citrine/tag/testing/el-7/x86_64/eos-server-4.8.104-1.el7.cern.x86_64.rpm
We installed EOS 4.8.104 on our preprod instance. Everything works as expected except WebDAV reads: when I try to copy of out of EOS a file that has been staged from tape I get the following error
Things look fine at the MGM, could you send me the logs for the same transfer (around 10:40:29) from the following FST daemon: antares-eos96.scd.rl.ac.uk with the HTTP port 8001?
Looking over the FST logs, I still don’t see anything wrong there. The open arrives at the FST and the open is done and a reply is sent to the client but then it disconnects.
Also in 4.8.104 there is no code change to the FST part of the EOS setup so I don’t think this is a regression from the previous version.
Can you send me the logs from the command when running with the following options? gfal-copy -vvv --log-file=gfal2.log
Are you actually able to copy out the file with simple xrdcp?
Yes, I can copy out the with xrdcp (and also with gfal-copy root://…) so using XRootD as a protocol. It is the WebDAV protocol that generates the above error.
Just to mention that after the above error - using gfal-copy https://… - the destination local file (random_400MB_local) is a stub, i.e. it has a zero size,
There was indeed a regression in the last version, this is now fixed and a new release is building as we speak. This will be eos-4.8.105.
This issue affects only the FST nodes so if you are in a hurry you can run in a mixed setup with the MGM eos-4.8.104 and the FSTs eos-4.8.98. Otherwise, you can install the upcoming 4.8.105 everywhere.
I will let you know once it’s available in the usual yum repositories. Thank you for the bug report!
The new packages are available in the usual location: https://storage-ci.web.cern.ch/storage-ci/eos/citrine/tag/testing/el-7/x86_64/eos-server-4.8.105-1.el7.cern.x86_64.rpm