RAID6-2018: Unterschied zwischen den Versionen

Aus OrgaMon Wiki
Zur Navigation springen Zur Suche springen
 
(130 dazwischenliegende Versionen von 2 Benutzern werden nicht angezeigt)
Zeile 1: Zeile 1:
== Info ==
== Info ==
=== Hardware ===


/dev/md127:
* Supermicro X10SRA-F
        Version : 1.2
** https://www.supermicro.com/en/products/motherboard/X10SRA-F
  Creation Time : Sun Nov  8 00:30:12 2015
* 32 GB ECC Memory
    Raid Level : raid6
 
    Array Size : 23441313792 (22355.38 GiB 24003.91 GB)
=== Raid ===
  Used Dev Size : 3906885632 (3725.90 GiB 4000.65 GB)
 
  Raid Devices : 8
        Version : 1.2
  Total Devices : 8
  Creation Time : Sun Nov  8 00:30:12 2015
    Persistence : Superblock is persistent
      Raid Level : raid6
      Array Size : 46883398656 (44711.49 GiB 48008.60 GB)
  Used Dev Size : 7813899776 (7451.92 GiB 8001.43 GB)
    Raid Devices : 8
  Total Devices : 9
    Persistence : Superblock is persistent
  Intent Bitmap : Internal
    Update Time : Sat Aug  7 12:32:17 2021
          State : clean
  Active Devices : 8
Working Devices : 9
  Failed Devices : 0
  Spare Devices : 1
          Layout : left-symmetric
      Chunk Size : 512K
            Name : Tokio:0
            UUID : 39c0b55f:74c0ab19:5f939236:16921f79
          Events : 399926
   
   
   Intent Bitmap : Internal
Cage Number Major Minor RaidDevice State      dev      id                                Size  Since
<s>A3</s>    0      8    81    0      active sync   <s>ata-WDC_WD4000FYYZ-01UL1B2_WD-WMC130F7EL9R</s> 
A3                                            ata-ST8000VN0022-2EL112_ZA1CY63J            <b>8TB</b>  2018-12
<s>A4</s>    1      8    65    1      active sync  <s>ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134LLA4YE</s>  4TB  2015-12
D3                                            ata-ST8000VN004-2M2101_WKD1RM6S              <b>8TB</b>  2020-07
<s>A1</s>    2      8    49    2      active sync  <s>ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134HF0CFD</s>  4TB  2015-12
A2                                            ata-ST8000VN004-2M2101_WKD2PZZS              <b>8TB</b>  2020-12
<s>A2</s>    3      8    33    3      active sync  <s>ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC133XLHK2N</s>  4TB  2015-12
A4                                            ata-WDC_WD80EFAX-68KNBN0_VDJK7XTK            <b>8TB</b>  2020-07
B1    6      8    113  4      active sync  <s>ata-HGST_HDN724040ALE640_PK2338P4HGJ6RC</s>      4TB  2016-03
                                              ata-ST8000VN004-2M2101_WKD3LLY8              <b>8TB</b>  2021-07
B2    5      8    97    5      active sync  <s>ata-HGST_HDN724040ALE640_PK2338P4HGDRYC</s>      4TB  2016-03
                                              ata-ST8000VN004-3CP101_WP001TWL              <b>8TB</b>  2021-08
<s>B4</s>    4      8    17    6      active sync  <s>ata-HGST_HDN724040ALE640_PK1334PCKNY90X</s>      4TB  2016-03
B4                                            ata-ST8000DM004-2CX188_ZCT40SH2              <b>8TB</b>  2021-07
A1                                            ata-ST8000DM004-2CX188_ZCT3MVCJ              <b>8TB</b>  2020-12
B3    7      8    1    7      active sync  <s>ata-HGST_HDN724040ALE640_PK1334PEHYT9XS</s>      4TB  2017-11
   
   
     Update Time : Tue Oct 16 16:34:23 2018
D4    11    8     129  -      spare        <s>ata-HGST_HDN724040ALE640_PK1334PEKDZDDS</s>      4TB  2020-07
          State : clean
                                              ata-ST8000VN004-3CP101_WP000N57              <b>8TB</b>  2021-08
  Active Devices : 8
 
  Wrking Devices : 8
* ohne SATA Anschluss
  Failed Devices : 0
 
  Spare Devices : 0
C1
C2
C3
C4
D1
D2
 
* Resync dauert 16 Stunden
 
[Sat Aug 21 23:19:21 2021] md: requested-resync of RAID array md127
  [Sun Aug 22 15:17:38 2021] md: md127: requested-resync done.
 
* Mainboard
 
  Supermicro X10SRA-F
  BIOS 1.0a
   
   
        Layout : left-symmetric
 
    Chunk Size : 512K
 
=== aktuelle Device-Names ===
          Name : Tokio:0
 
          UUID : 39c0b55f:74c0ab19:5f939236:16921f79
     Number  Major  Minor  RaidDevice State         Parition    HDD-Serial Location
        Events : 269988
      10       8     113       0      active sync  /dev/sdh1 | ZA1CY63J | A.3
      12       8      33       1      active sync  /dev/sdc1 | WKD1RM6S | D.3
     Number  Major  Minor  RaidDevice State
      14       8      65       2      active sync  /dev/sde1 | WKD2PZZS | A.2
      0       8       81       0      active sync  /dev/sdf1 ata-WDC_WD4000FYYZ-01UL1B2_WD-WMC130F7EL9R
      13       8      97       3      active sync  /dev/sdg1 | VDJK7XTK | A.4
      1       8      65       1      active sync  /dev/sde1 ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134LLA4YE
      15       8       17       4      active sync  /dev/sdb1 | ZCT40SH2 | B.4
      2       8      49       2      active sync  /dev/sdd1 ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134HF0CFD
      16       8     145       5      active sync  /dev/sdj1 | WKD3LLY8 | B.1
      3       8      33       3      active sync  /dev/sdc1 ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC133XLHK2N
       8       8      81       6      active sync  /dev/sdf1 | ZCT3MVCJ | A.1
      6       8     113       4      active sync  /dev/sdh1 ata-HGST_HDN724040ALE640_PK2338P4HGJ6RC
       9       8     129       7      active sync  /dev/sdi1 | WP001TWL | B.2
      5       8       97       5      active sync  /dev/sdg1 ata-HGST_HDN724040ALE640_PK2338P4HGDRYC
      17      8        1        -      spare        /dev/sda1 | WP000N57 | B.3
       4       8      17       6      active sync  /dev/sdb1 ata-HGST_HDN724040ALE640_PK1334PCKNY90X
       7       8       1       7      active sync  /dev/sda1 ata-HGST_HDN724040ALE640_PK1334PEHYT9XS


== Lage der Platten ==
== Lage der Platten ==
Zeile 43: Zeile 90:




[A.1|A.2|A.3|A.4] [B.1|B.2|B.3|B.4]
* oberhalb von Tokio
[A.1   |A.2   |A.3   |A.4   ] [B.1   |B.2   |B.3   |B.4   ]
[HF0CFD|XLHK2N|<s>F7EL9R</s>|LLA4YE] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[HF0CFD|XLHK2N|1CY63J|<s>LLA4YE</s>] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[HF0CFD|<s>XLHK2N</s>|1CY63J|JK7XTK] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[<s>HF0CFD</s>|D2PZZS|1CY63J|JK7XTK] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [HGJ6RC|HGDRYC|HYT9XS|<s>KNY90X</s>]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [<s>HGJ6RC</s>|HGDRYC|HYT9XS|T40SH2]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [D3LLY8|<s>HGDRYC</s>|HYT9XS|T40SH2]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [D3LLY8|001TWL|<s>HYT9XS</s>|T40SH2]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [D3LLY8|001TWL|000N57|<s>T40SH2</s>]
 
* unterhalb von Tokio
 
[C.1  |C.2  |C.3  |C.4  ] [D.1  |D.2  |D.3  |D.4  ]
[      |      |      |      ] [      |      |D1RM6S|KDZDDS]
[      |      |      |      ] [      |      |D1RM6S|<s>KDZDDS</s>]
[      |      |      |      ] [      |      |D1RM6S|001CVK]


== Beschaffung ==
== Beschaffung ==


  Es sollen schrittweise alle Platten durch 8 TB Exemplare ersetzt werden.
  alle Platten sind 8 TB Exemplare


  https://www.alternate.de/html/listings/1458214498740?order=ASC&lk=8323&showFilter=false&hideFilter=false&disableFilter=false&filter_-1=3500&filter_-1=142900&filter_1021=8000.0&filter_2147482612=1037
  https://www.alternate.de/html/listings/1458214498740?order=ASC&lk=8323&showFilter=false&hideFilter=false&disableFilter=false&filter_-1=3500&filter_-1=142900&filter_1021=8000.0&filter_2147482612=1037
Zeile 53: Zeile 117:
== Ereignisse ==
== Ereignisse ==


  ??.??.2018 I/O Error bei Platte sdf  
  May 09 01:22:50 tokio kernel: blk_update_request: I/O error, dev sdf, sector 7010129740
May 09 01:22:53 tokio kernel: blk_update_request: I/O error, dev sdf, sector 7010129740
Jul 26 01:22:52 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521277045
Jul 26 01:22:54 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521277045
Jul 26 01:22:57 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521278897
Jul 26 01:22:59 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521278897
Oct 18 01:18:41 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6620009643
Oct 18 01:18:44 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6620009643
  27.11.2018 9 Uhr Ausfall von 2 Platten, sieht aus wie ein kurzes "Power Fail" Event
  27.11.2018 9 Uhr Ausfall von 2 Platten, sieht aus wie ein kurzes "Power Fail" Event
27.11.2018 11 Uhr Anfrage wegen zu wenig Platz, Reduziere Sicherungen von 10 auf 7
            11 Uhr Anfrage wegen zu wenig Platz, Reduziere Sicherungen von 10 auf 7
27.11.2018 Lösche die Sicherungsverzeichnisse 7+8+9
            Lösche die Sicherungsverzeichnisse 7+8+9
27.11.2018 stelle den Ausfall von 2 Platten fest
            stelle den Ausfall von 2 Platten fest
27.11.2018 rebuild von Platte sdd
            rebuild von Platte sdd
  28.11.2018 rebuild von Platte sdg
  28.11.2018 rebuild von Platte sdg
  28.11.2018 I/O Fehler bei Platte sdf
            I/O Fehler bei Platte sdf aus der Vergangenheit entdeckt
  28.11.2018 Beschaffung einer neuen 8TB Platte
            Beschaffung einer neuen 8TB Platte, soll "sdf" ersetzen
  30.11.2018 neue Platte als 9. Platte hinzugehängt
            --add-spare & --replace angestossen
            + ata-ST8000VN0022-2EL112_ZA1CY63J (sdi)
            - ata-WDC_WD4000FYYZ-01UL1B2_WD-WMC130F7EL9R (sdf)
06.06.2020 sde, ata7.00: failed command: READ FPDMA QUEUED,
            blk_update_request: I/O error, dev sde, sector 6211109969
            blk_update_request: I/O error, dev sde, sector 6211111867
            einbau eines Spare vorgeschlagen
22.07.2020 +Spare: HGST HDN724040AL (4 TB), Serial=PK1334PEKDZDDS, /dev/sdi eingebaut
24.07.2020 neue Device-Names vom System nach einem Neustart vergeben
27.07.2020 [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366136 on sdf1)
            [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366144 on sdf1)
            [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366152 on sdf1)
            [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366160 on sdf1)
            [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366168 on sdf1)
            [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366176 on sdf1)
            [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366184 on sdf1)
            [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366192 on sdf1)
            [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366200 on sdf1)
            [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366208 on sdf1)
            [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314736 on sdf1)
            [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314744 on sdf1)
            [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314752 on sdf1)
            [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314760 on sdf1)
            [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314768 on sdf1)
            -> A4, LLA4YE ist Tauschkandidat
29.07.2020 ST8000VN004-2M21 (WKD1RM6S) wurde eingebaut, /dev/sdj
            maximale 8TB Partition erstellt, sdj1, als spare eingebunden
            # mdadm /dev/md127 --replace /dev/sdf1 --with /dev/sdj1
            mdadm: Marked /dev/sdf1 (device 1 in /dev/md127) for replacement
            mdadm: Marked /dev/sdj1 in /dev/md127 as replacement for device 1
            sdf1 zero superblock, partition entfernt
            anschliessendes "scrub"
30.07.2020 +ata-WDC_WD80EFAX-68KNBN0_VDJK7XTK wurde eingebaut, /dev/sdf
            -ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC133XLHK2N
31.07.2020 Ausbau von "XLHK2N", da "removed"
11.12.2020 +ata-ST8000VN004-2M2101_WKD2PZZS wurde eingebaut, /dev/sdj
            -ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134HF0CFD
            Replace von "HF0CFD" (sde), da 5,2 Jahre alt
14.12.2020 +ata-ST8000DM004-2CX188_ZCT3MVCJ wurde eingebaut, /dev/sde
            -ata-HGST_HDN724040ALE640_PK1334PCKNY90X
            Replace von sdb, da Betrieb bei 54 °C und heisseste Platte
29.07.2021 +ata-ST8000DM004-2CX188_ZCT40SH2 (B4)
            -ata-HGST_HDN724040ALE640_PK2338P4HGJ6RC (B1)
            Replace von sdi, da über 5 Jahre in Betrieb
30.07.2021 +ata-ST8000VN004-2M2101_WKD3LLY8 (B1)
            -ata-HGST_HDN724040ALE640_PK2338P4HGDRYC (B2)
            Replace von sdh, da über 5 Jahre in Betrieb
02.08.2021 +ata-ST8000VN004-3CP101_WP001TWL (B2)
            -ata-HGST_HDN724040ALE640_PK1334PEHYT9XS (B3)
            Replace von sda, da 4 Jahre und noch 4 TB
03.08.2021 +ata-ST8000VN004-3CP101_WP000N57 (B3)
            -ata-HGST_HDN724040ALE640_PK1334PEKDZDDS (D4)
            Replace von sdd, da 4 TB aber nur 9065 Betriebsstunden
05.08.2021 Partition grow, nun 42,7 TB
            resize2fs hängt bei 100%, siehe [[Linux.raid#resize2fs_100.25_CPU_Usage]]
06.08.2021 [Fri Aug  6 02:49:37 2021] sd 1:0:0:0: [sdb] tag#18 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
            [Fri Aug  6 02:49:37 2021] sd 1:0:0:0: [sdb] tag#18 CDB: Read(16) 88 00 00 00 00 03 8e 3f c0 08 00 00 05 40 00 00
            [Fri Aug  6 02:49:37 2021] blk_update_request: I/O error, dev sdb, sector 15271444488
            +12x
            [Sun Aug  8 02:59:12 2021] md/raid:md127: read error corrected (8 sectors at 3934061840 on sdb1)
            +12x
            sdb war bisher die langsamste Platte, jetzt hat sie fehlerhafte Sektoren
  11.08.2021 [Wed Aug 11 02:41:28 2021] ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
            [Wed Aug 11 02:41:28 2021] ata2.00: failed command: SMART
            [Wed Aug 11 02:41:28 2021] ata2.00: cmd b0/d0:01:00:4f:c2/00:00:00:00:00/00 tag 19 pio 512 in res 40/00:82:82:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
            [Wed Aug 11 02:41:28 2021] ata2: hard resetting link
            [Wed Aug 11 09:37:54 2021] ata2.00: failed command: FLUSH CACHE EXT
            [Wed Aug 11 09:37:54 2021] ata2: hard resetting link
            ...
            [Wed Aug 11 09:38:41 2021] ata2: hard resetting link
            [Wed Aug 11 09:38:46 2021] ata2: link is slow to respond, please be patient (ready=0)
            [Wed Aug 11 09:38:51 2021] ata2: COMRESET failed (errno=-16)
            [Wed Aug 11 09:38:51 2021] ata2: hard resetting link
            ...
            [Wed Aug 11 09:40:15 2021] blk_update_request: I/O error, dev sdb, sector 2064
            [Wed Aug 11 09:40:15 2021] md: super_written gets error=-5, uptodate=0
            [Wed Aug 11 09:40:15 2021] md/raid:md127: Disk failure on sdb1, disabling device. md/raid:md127: Operation continuing on 7 devices.
            wir haben sdb verloren, der resync mit dem Spare läuft
16.08.2021 wir laufen ohne spare, Array ist clean
            ZCT40SH2 (B4) kann entnommen werden
25.08.2021 - T40SH2 (B4)
            + /dev/disk/by-id/ata-ST8000VN004-3CP101_WP001CVK als spare (D4)
??.??.2021 via Knoppix, resize2fs durchführen

Aktuelle Version vom 11. November 2021, 17:53 Uhr

Info

Hardware

Raid

        Version : 1.2
  Creation Time : Sun Nov  8 00:30:12 2015
     Raid Level : raid6
     Array Size : 46883398656 (44711.49 GiB 48008.60 GB)
  Used Dev Size : 7813899776 (7451.92 GiB 8001.43 GB)
   Raid Devices : 8
  Total Devices : 9
    Persistence : Superblock is persistent
  Intent Bitmap : Internal
    Update Time : Sat Aug  7 12:32:17 2021
          State : clean
 Active Devices : 8
Working Devices : 9
 Failed Devices : 0
  Spare Devices : 1
         Layout : left-symmetric
     Chunk Size : 512K
           Name : Tokio:0
           UUID : 39c0b55f:74c0ab19:5f939236:16921f79
         Events : 399926

Cage Number Major Minor RaidDevice State      dev       id                                 Size  Since
A3    0      8     81    0      active sync   ata-WDC_WD4000FYYZ-01UL1B2_WD-WMC130F7EL9R   
A3                                            ata-ST8000VN0022-2EL112_ZA1CY63J             8TB   2018-12
A4    1      8     65    1      active sync   ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134LLA4YE   4TB   2015-12
D3                                            ata-ST8000VN004-2M2101_WKD1RM6S              8TB   2020-07
A1    2      8     49    2      active sync   ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134HF0CFD   4TB   2015-12
A2                                            ata-ST8000VN004-2M2101_WKD2PZZS              8TB   2020-12
A2    3      8     33    3      active sync   ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC133XLHK2N   4TB   2015-12
A4                                            ata-WDC_WD80EFAX-68KNBN0_VDJK7XTK            8TB   2020-07 
B1    6      8     113   4      active sync   ata-HGST_HDN724040ALE640_PK2338P4HGJ6RC      4TB   2016-03
                                              ata-ST8000VN004-2M2101_WKD3LLY8              8TB   2021-07
B2    5      8     97    5      active sync   ata-HGST_HDN724040ALE640_PK2338P4HGDRYC      4TB   2016-03
                                              ata-ST8000VN004-3CP101_WP001TWL              8TB   2021-08 
B4    4      8     17    6      active sync   ata-HGST_HDN724040ALE640_PK1334PCKNY90X      4TB   2016-03
B4                                            ata-ST8000DM004-2CX188_ZCT40SH2              8TB   2021-07
A1                                            ata-ST8000DM004-2CX188_ZCT3MVCJ              8TB   2020-12 
B3    7      8     1     7      active sync   ata-HGST_HDN724040ALE640_PK1334PEHYT9XS      4TB   2017-11

D4    11     8     129   -      spare         ata-HGST_HDN724040ALE640_PK1334PEKDZDDS      4TB   2020-07
                                              ata-ST8000VN004-3CP101_WP000N57              8TB   2021-08
  • ohne SATA Anschluss
C1
C2
C3
C4
D1
D2
  • Resync dauert 16 Stunden
[Sat Aug 21 23:19:21 2021] md: requested-resync of RAID array md127
[Sun Aug 22 15:17:38 2021] md: md127: requested-resync done.
  • Mainboard
Supermicro X10SRA-F
BIOS 1.0a


aktuelle Device-Names

   Number   Major   Minor   RaidDevice State         Parition    HDD-Serial Location
     10       8      113        0      active sync   /dev/sdh1 | ZA1CY63J | A.3
     12       8       33        1      active sync   /dev/sdc1 | WKD1RM6S | D.3
     14       8       65        2      active sync   /dev/sde1 | WKD2PZZS | A.2
     13       8       97        3      active sync   /dev/sdg1 | VDJK7XTK | A.4
     15       8       17        4      active sync   /dev/sdb1 | ZCT40SH2 | B.4
     16       8      145        5      active sync   /dev/sdj1 | WKD3LLY8 | B.1
      8       8       81        6      active sync   /dev/sdf1 | ZCT3MVCJ | A.1
      9       8      129        7      active sync   /dev/sdi1 | WP001TWL | B.2
     17       8        1        -      spare         /dev/sda1 | WP000N57 | B.3

Lage der Platten


  • oberhalb von Tokio
[A.1   |A.2   |A.3   |A.4   ] [B.1   |B.2   |B.3   |B.4   ]
[HF0CFD|XLHK2N|F7EL9R|LLA4YE] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[HF0CFD|XLHK2N|1CY63J|LLA4YE] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[HF0CFD|XLHK2N|1CY63J|JK7XTK] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[HF0CFD|D2PZZS|1CY63J|JK7XTK] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [HGJ6RC|HGDRYC|HYT9XS|T40SH2]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [D3LLY8|HGDRYC|HYT9XS|T40SH2]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [D3LLY8|001TWL|HYT9XS|T40SH2]
[T3MVCJ|D2PZZS|1CY63J|JK7XTK] [D3LLY8|001TWL|000N57|T40SH2]
  • unterhalb von Tokio
[C.1   |C.2   |C.3   |C.4   ] [D.1   |D.2   |D.3   |D.4   ]
[      |      |      |      ] [      |      |D1RM6S|KDZDDS]
[      |      |      |      ] [      |      |D1RM6S|KDZDDS]
[      |      |      |      ] [      |      |D1RM6S|001CVK]

Beschaffung

alle Platten sind 8 TB Exemplare
https://www.alternate.de/html/listings/1458214498740?order=ASC&lk=8323&showFilter=false&hideFilter=false&disableFilter=false&filter_-1=3500&filter_-1=142900&filter_1021=8000.0&filter_2147482612=1037

Ereignisse

May 09 01:22:50 tokio kernel: blk_update_request: I/O error, dev sdf, sector 7010129740
May 09 01:22:53 tokio kernel: blk_update_request: I/O error, dev sdf, sector 7010129740
Jul 26 01:22:52 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521277045
Jul 26 01:22:54 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521277045
Jul 26 01:22:57 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521278897
Jul 26 01:22:59 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521278897
Oct 18 01:18:41 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6620009643
Oct 18 01:18:44 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6620009643
27.11.2018 9 Uhr Ausfall von 2 Platten, sieht aus wie ein kurzes "Power Fail" Event
           11 Uhr Anfrage wegen zu wenig Platz, Reduziere Sicherungen von 10 auf 7
           Lösche die Sicherungsverzeichnisse 7+8+9
           stelle den Ausfall von 2 Platten fest
           rebuild von Platte sdd
28.11.2018 rebuild von Platte sdg
           I/O Fehler bei Platte sdf aus der Vergangenheit entdeckt
           Beschaffung einer neuen 8TB Platte, soll "sdf" ersetzen
30.11.2018 neue Platte als 9. Platte hinzugehängt
           --add-spare & --replace angestossen
           + ata-ST8000VN0022-2EL112_ZA1CY63J (sdi)
           - ata-WDC_WD4000FYYZ-01UL1B2_WD-WMC130F7EL9R (sdf)
06.06.2020 sde, ata7.00: failed command: READ FPDMA QUEUED, 
           blk_update_request: I/O error, dev sde, sector 6211109969
           blk_update_request: I/O error, dev sde, sector 6211111867
           einbau eines Spare vorgeschlagen
22.07.2020 +Spare: HGST HDN724040AL (4 TB), Serial=PK1334PEKDZDDS, /dev/sdi eingebaut
24.07.2020 neue Device-Names vom System nach einem Neustart vergeben
27.07.2020 [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366136 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366144 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366152 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366160 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366168 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366176 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366184 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366192 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366200 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366208 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314736 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314744 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314752 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314760 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314768 on sdf1)
           -> A4, LLA4YE ist Tauschkandidat
29.07.2020 ST8000VN004-2M21 (WKD1RM6S) wurde eingebaut, /dev/sdj
           maximale 8TB Partition erstellt, sdj1, als spare eingebunden
           # mdadm /dev/md127 --replace /dev/sdf1 --with /dev/sdj1
           mdadm: Marked /dev/sdf1 (device 1 in /dev/md127) for replacement
           mdadm: Marked /dev/sdj1 in /dev/md127 as replacement for device 1
           sdf1 zero superblock, partition entfernt
           anschliessendes "scrub"
30.07.2020 +ata-WDC_WD80EFAX-68KNBN0_VDJK7XTK wurde eingebaut, /dev/sdf
           -ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC133XLHK2N
31.07.2020 Ausbau von "XLHK2N", da "removed"
11.12.2020 +ata-ST8000VN004-2M2101_WKD2PZZS wurde eingebaut, /dev/sdj
           -ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134HF0CFD
           Replace von "HF0CFD" (sde), da 5,2 Jahre alt
14.12.2020 +ata-ST8000DM004-2CX188_ZCT3MVCJ wurde eingebaut, /dev/sde
           -ata-HGST_HDN724040ALE640_PK1334PCKNY90X 
           Replace von sdb, da Betrieb bei 54 °C und heisseste Platte
29.07.2021 +ata-ST8000DM004-2CX188_ZCT40SH2 (B4)
           -ata-HGST_HDN724040ALE640_PK2338P4HGJ6RC (B1)
           Replace von sdi, da über 5 Jahre in Betrieb
30.07.2021 +ata-ST8000VN004-2M2101_WKD3LLY8 (B1)
           -ata-HGST_HDN724040ALE640_PK2338P4HGDRYC (B2)
           Replace von sdh, da über 5 Jahre in Betrieb
02.08.2021 +ata-ST8000VN004-3CP101_WP001TWL (B2)
           -ata-HGST_HDN724040ALE640_PK1334PEHYT9XS (B3)
           Replace von sda, da 4 Jahre und noch 4 TB
03.08.2021 +ata-ST8000VN004-3CP101_WP000N57 (B3)
           -ata-HGST_HDN724040ALE640_PK1334PEKDZDDS (D4)
           Replace von sdd, da 4 TB aber nur 9065 Betriebsstunden
05.08.2021 Partition grow, nun 42,7 TB
           resize2fs hängt bei 100%, siehe Linux.raid#resize2fs_100.25_CPU_Usage
06.08.2021 [Fri Aug  6 02:49:37 2021] sd 1:0:0:0: [sdb] tag#18 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
           [Fri Aug  6 02:49:37 2021] sd 1:0:0:0: [sdb] tag#18 CDB: Read(16) 88 00 00 00 00 03 8e 3f c0 08 00 00 05 40 00 00
           [Fri Aug  6 02:49:37 2021] blk_update_request: I/O error, dev sdb, sector 15271444488 
           +12x
           [Sun Aug  8 02:59:12 2021] md/raid:md127: read error corrected (8 sectors at 3934061840 on sdb1)
           +12x 
           sdb war bisher die langsamste Platte, jetzt hat sie fehlerhafte Sektoren
11.08.2021 [Wed Aug 11 02:41:28 2021] ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
           [Wed Aug 11 02:41:28 2021] ata2.00: failed command: SMART
           [Wed Aug 11 02:41:28 2021] ata2.00: cmd b0/d0:01:00:4f:c2/00:00:00:00:00/00 tag 19 pio 512 in res 40/00:82:82:00:00/00:00:00:00:00/40 Emask 0x4 (timeout)
           [Wed Aug 11 02:41:28 2021] ata2: hard resetting link
           [Wed Aug 11 09:37:54 2021] ata2.00: failed command: FLUSH CACHE EXT
           [Wed Aug 11 09:37:54 2021] ata2: hard resetting link
           ...
           [Wed Aug 11 09:38:41 2021] ata2: hard resetting link
           [Wed Aug 11 09:38:46 2021] ata2: link is slow to respond, please be patient (ready=0)
           [Wed Aug 11 09:38:51 2021] ata2: COMRESET failed (errno=-16)
           [Wed Aug 11 09:38:51 2021] ata2: hard resetting link
           ... 
           [Wed Aug 11 09:40:15 2021] blk_update_request: I/O error, dev sdb, sector 2064
           [Wed Aug 11 09:40:15 2021] md: super_written gets error=-5, uptodate=0
           [Wed Aug 11 09:40:15 2021] md/raid:md127: Disk failure on sdb1, disabling device. md/raid:md127: Operation continuing on 7 devices.
           wir haben sdb verloren, der resync mit dem Spare läuft
16.08.2021 wir laufen ohne spare, Array ist clean
           ZCT40SH2 (B4) kann entnommen werden
25.08.2021 - T40SH2 (B4)
           + /dev/disk/by-id/ata-ST8000VN004-3CP101_WP001CVK als spare (D4)
??.??.2021 via Knoppix, resize2fs durchführen