Irrugularities in layout of raid6 stripes and files list of the bad stripes/replicas

Dear Team,

The raid6 layout stripes of different files were shown different values like 4,5 and 6 (someones are 7).
We have 8 FSTs and 16 groups. EOS version 4.8.105.
On eos attribute, we had define layout and nstripes as :-
sys.forced.layout=“raid6”
sys.forced.nstripes=“7”

But, still there were difference in stripes layout and Rep in many files.

[root@eos-mgm tmp]# eos file info /eos/alicekolkata/grid/03/64752/4acd2eec-6381-11e1-b780-2faecd17438c
File: ‘/eos/alicekolkata/grid/03/64752/4acd2eec-6381-11e1-b780-2faecd17438c’ Flags: 0600
Size: 93956852
Status: locations::incomplete
Modify: Sat Mar 30 03:37:24 2019 Timestamp: 1553897244.633960000
Change: Sat Mar 30 03:36:31 2019 Timestamp: 1553897191.807177025
Birth: Thu Jan 1 05:30:00 1970 Timestamp: 0.000000000
CUid: 10367 CGid: 1395 Fxid: 002d9f13 Fid: 2989843 Pid: 1737 Pxid: 000006c9
XStype: adler XS: 52 36 33 18 ETAGs: “802579869073408:52363318”
Layout: raid6 Stripes: 7 Blocksize: 1M LayoutId: 20640642 Redundancy: d2::t0
#Rep: 6
0 91 NA
┌───┬──────┬────────────────────────┬────────────────┬────────────────┬──────────┬──────────────┬────────────┬────────┬────────────────────────┐
│no.│ fs-id│ host│ schedgroup│ path│ boot│ configstatus│ drain│ active│ geotag│
└───┴──────┴────────────────────────┴────────────────┴────────────────┴──────────┴──────────────┴────────────┴────────┴────────────────────────┘
1 90 eos10.tier2-kol.res.in default.12 /xdata6 booted rw nodrain online Kolkata::EOS2
2 86 eos08.tier2-kol.res.in default.12 /xdata6 booted rw nodrain online Kolkata::EOS2
3 87 eos05.tier2-kol.res.in default.12 /xdata6 booted rw nodrain online Kolkata::EOS2
4 125 eos11.tier2-kol.res.in default.12 /xdata6 booted rw nodrain online Kolkata::EOS2
5 141 eos09.tier2-kol.res.in default.12 /xdata6 booted rw nodrain online Kolkata::EOS2


[root@eos-mgm ~]# eos file info /eos/alicekolkata/grid/04/45205/2945e052-db49-11e7-a88c-ffa4a6ea3020
File: ‘/eos/alicekolkata/grid/04/45205/2945e052-db49-11e7-a88c-ffa4a6ea3020’ Flags: 0644
Size: 155858104
Status: locations::incomplete
Modify: Thu Sep 26 04:29:00 2019 Timestamp: 1569452340.549856000
Change: Thu Sep 26 04:21:10 2019 Timestamp: 1569451870.240102165
Birth: Thu Jan 1 05:30:00 1970 Timestamp: 0.000000000
CUid: 10367 CGid: 1395 Fxid: 00ec8328 Fid: 15500072 Pid: 13624 Pxid: 00003538
XStype: adler XS: 8f 15 21 c5 ETAGs: “4160768895352832:8f1521c5”
Layout: raid6 Stripes: 7 Blocksize: 1M LayoutId: 20640642 Redundancy: d1::t0
#Rep: 5
┌───┬──────┬────────────────────────┬────────────────┬────────────────┬──────────┬──────────────┬────────────┬────────┬────────────────────────┐
│no.│ fs-id│ host│ schedgroup│ path│ boot│ configstatus│ drain│ active│ geotag│
└───┴──────┴────────────────────────┴────────────────┴────────────────┴──────────┴──────────────┴────────────┴────────┴────────────────────────┘
0 55 eos10.tier2-kol.res.in default.7 /xdata15 booted rw nodrain online Kolkata::EOS2
1 120 eos11.tier2-kol.res.in default.7 /xdata15 booted rw nodrain online Kolkata::EOS2
2 51 eos08.tier2-kol.res.in default.7 /xdata15 booted rw nodrain online Kolkata::EOS2
3 54 eos06.tier2-kol.res.in default.7 /xdata15 booted rw nodrain online Kolkata::EOS2
4 56 eos09.tier2-kol.res.in default.7 /xdata15 booted rw nodrain online Kolkata::EOS2


[root@eos-mgm ~]# eos file info /eos/alicekolkata/grid/04/65267/fff053ae-1de0-11ea-9a61-ab6f1693a835
File: ‘/eos/alicekolkata/grid/04/65267/fff053ae-1de0-11ea-9a61-ab6f1693a835’ Flags: 0664
Size: 13606514
Status: healthy
Modify: Sat Dec 14 01:15:44 2019 Timestamp: 1576266344.299131000
Change: Sat Dec 14 01:15:43 2019 Timestamp: 1576266343.661693709
Birth: Sat Dec 14 01:15:43 2019 Timestamp: 1576266343.661693709
CUid: 10367 CGid: 1395 Fxid: 01342e61 Fid: 20196961 Pid: 4899 Pxid: 00001323
XStype: adler XS: 78 65 b1 ae ETAGs: “5421580435849216:7865b1ae”
Layout: raid6 Stripes: 7 Blocksize: 1M LayoutId: 20640642 Redundancy: d3::t0
#Rep: 7
0 77 NA
┌───┬──────┬────────────────────────┬────────────────┬────────────────┬──────────┬──────────────┬────────────┬────────┬────────────────────────┐
│no.│ fs-id│ host│ schedgroup│ path│ boot│ configstatus│ drain│ active│ geotag│
└───┴──────┴────────────────────────┴────────────────┴────────────────┴──────────┴──────────────┴────────────┴────────┴────────────────────────┘
1 76 eos10.tier2-kol.res.in default.10 /xdata4 booted rw nodrain online Kolkata::EOS2
2 72 eos08.tier2-kol.res.in default.10 /xdata4 booted rw nodrain online Kolkata::EOS2
3 74 eos07.tier2-kol.res.in default.10 /xdata4 booted rw nodrain online Kolkata::EOS2
4 123 eos11.tier2-kol.res.in default.10 /xdata4 booted rw nodrain online Kolkata::EOS2
5 139 eos09.tier2-kol.res.in default.10 /xdata4 booted rw nodrain online Kolkata::EOS2
6 71 eos04.tier2-kol.res.in default.10 /xdata4 booted rw nodrain online Kolkata::EOS2

(But it’s show only 6 replica )


[root@eos-mgm tmp]# eos file info /eos/alicekolkata/grid/08/06690/f2c9f622-b728-11e9-95c4-9fa63bdc7f7a
File: ‘/eos/alicekolkata/grid/08/06690/f2c9f622-b728-11e9-95c4-9fa63bdc7f7a’ Flags: 0664
Size: 24545
Status: locations::incomplete
Modify: Mon Aug 5 08:00:17 2019 Timestamp: 1564972217.528912000
Change: Mon Aug 5 08:00:11 2019 Timestamp: 1564972211.303030278
Birth: Thu Jan 1 05:30:00 1970 Timestamp: 0.000000000
CUid: 10367 CGid: 1395 Fxid: 00de4300 Fid: 14566144 Pid: 5529 Pxid: 00001599
XStype: adler XS: b4 2c 29 e1 ETAGs: “3910069506801664:b42c29e1”
Layout: raid6 Stripes: 7 Blocksize: 1M LayoutId: 20640642 Redundancy: d1::t0
#Rep: 5
0 35 NA
┌───┬──────┬────────────────────────┬────────────────┬────────────────┬──────────┬──────────────┬────────────┬────────┬────────────────────────┐
│no.│ fs-id│ host│ schedgroup│ path│ boot│ configstatus│ drain│ active│ geotag│
└───┴──────┴────────────────────────┴────────────────┴────────────────┴──────────┴──────────────┴────────────┴────────┴────────────────────────┘
1 32 eos07.tier2-kol.res.in default.4 /xdata12 booted rw nodrain online Kolkata::EOS2
2 31 eos05.tier2-kol.res.in default.4 /xdata12 booted rw nodrain online Kolkata::EOS2
3 30 eos08.tier2-kol.res.in default.4 /xdata12 booted rw nodrain online Kolkata::EOS2
4 117 eos11.tier2-kol.res.in default.4 /xdata12 booted rw nodrain online Kolkata::EOS2

(Only 4 replica shown)


[root@eos-mgm tmp]# eos file check /eos/alicekolkata/grid/08/06690/f2c9f622-b728-11e9-95c4-9fa63bdc7f7a
path=“/eos/alicekolkata/grid/08/06690/f2c9f622-b728-11e9-95c4-9fa63bdc7f7a” fxid=“(null)” size=“24545” nrep=“5” checksumtype=“adler” checksum=“b42c29e100000000000000000000000000000000000000000000000000000000”
It’s show only path . But other parameter i.e. nrep, fstpath, fsid, hostname etc are missing. Means this file is corrupt.

In above files list, some files are healthy and some are incomplete.
As per my understand, for raid6, if layout of stripes are 7 then Rep (replica) has also 7 (5 + 2parity).

So, how to fix the stripes and Replica issue?
Also, how to find the list of files which are corrupted and bad replicas ?