RAID6-2018: Unterschied zwischen den Versionen

Aus OrgaMon Wiki
Zur Navigation springen Zur Suche springen
Zeile 164: Zeile 164:
             mdadm: Marked /dev/sdf1 (device 1 in /dev/md127) for replacement
             mdadm: Marked /dev/sdf1 (device 1 in /dev/md127) for replacement
             mdadm: Marked /dev/sdj1 in /dev/md127 as replacement for device 1
             mdadm: Marked /dev/sdj1 in /dev/md127 as replacement for device 1
            anschliessendes "scrub"

Version vom 29. Juli 2020, 22:49 Uhr

Info

/dev/md127:
       Version : 1.2
 Creation Time : Sun Nov  8 00:30:12 2015
    Raid Level : raid6
    Array Size : 23441313792 (22355.38 GiB 24003.91 GB)
 Used Dev Size : 3906885632 (3725.90 GiB 4000.65 GB)
  Raid Devices : 8
 Total Devices : 8
   Persistence : Superblock is persistent

 Intent Bitmap : Internal

   Update Time : Tue Oct 16 16:34:23 2018
         State : clean
Active Devices : 8
Wrking Devices : 8
Failed Devices : 0
 Spare Devices : 0

        Layout : left-symmetric
    Chunk Size : 512K

          Name : Tokio:0
          UUID : 39c0b55f:74c0ab19:5f939236:16921f79
        Events : 269988
Cage Number Major Minor RaidDevice State      dev       id                                           Size  Since
oben
A3    0      8     81    0      active sync   /dev/sdf1 ata-WDC_WD4000FYYZ-01UL1B2_WD-WMC130F7EL9R   
                                                        ata-ST8000VN0022-2EL112_ZA1CY63J             8TB   2018-12
A4    1      8     65    1      active sync   /dev/sde1 ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134LLA4YE   4TB   2015-12
                                                        ata-ST8000VN004-2M2101_WKD1RM6S              8TB   2020-07
A1    2      8     49    2      active sync   /dev/sdd1 ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC134HF0CFD   4TB   2015-12
A2    3      8     33    3      active sync   /dev/sdc1 ata-WDC_WD4000FYYZ-01UL1B2_WD-WCC133XLHK2N   4TB   2015-12
B1    6      8     113   4      active sync   /dev/sdh1 ata-HGST_HDN724040ALE640_PK2338P4HGJ6RC      4TB   2016-03
B2    5      8     97    5      active sync   /dev/sdg1 ata-HGST_HDN724040ALE640_PK2338P4HGDRYC      4TB   2016-03
B4    4      8     17    6      active sync   /dev/sdb1 ata-HGST_HDN724040ALE640_PK1334PCKNY90X      4TB   2016-03
B3    7      8     1     7      active sync   /dev/sda1 ata-HGST_HDN724040ALE640_PK1334PEHYT9XS      4TB   2017-11
Cage Number Major Minor RaidDevice State      dev       id                                           Size
unten
B4    11     8     129   -      spare         /dev/sdi1 ata-HGST_HDN724040ALE640_PK1334PEKDZDDS      4TB   2020-07

aktuelle Device-Names

   Number   Major   Minor   RaidDevice State
     10       8       97        0      active sync   /dev/sdg1
      1       8       81        1      active sync   /dev/sdf1
      8       8       65        2      active sync   /dev/sde1
      3       8       49        3      active sync   /dev/sdd1
      6       8      129        4      active sync   /dev/sdi1
      9       8      113        5      active sync   /dev/sdh1
      4       8       17        6      active sync   /dev/sdb1
      7       8        1        7      active sync   /dev/sda1
     11       8       33        -      spare   /dev/sdc1

Lage der Platten


  • oberhalb von Tokio
[A.1   |A.2   |A.3   |A.4   ] [B.1   |B.2   |B.3   |B.4   ]
[HF0CFD|XLHK2N|F7EL9R|LLA4YE] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[HF0CFD|XLHK2N|1CY63J|LLA4YE] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
[HF0CFD|XLHK2N|1CY63J|NEU   ] [HGJ6RC|HGDRYC|HYT9XS|KNY90X]
  • unterhalb von Tokio
[A.1   |A.2   |A.3   |A.4   ] [B.1   |B.2   |B.3   |B.4   ]
[      |      |      |      ] [      |      |D1RM6S|KDZDDS]

Beschaffung

Es sollen schrittweise alle Platten durch 8 TB Exemplare ersetzt werden.
https://www.alternate.de/html/listings/1458214498740?order=ASC&lk=8323&showFilter=false&hideFilter=false&disableFilter=false&filter_-1=3500&filter_-1=142900&filter_1021=8000.0&filter_2147482612=1037

Temperaturen

  • Situation kurz vor Ersatz von sdf
a:
194 Temperature_Celsius     0x0002   157   157   000    Old_age   Always       -       38 (Min/Max 25/52)
  9 Power_On_Hours          0x0012   099   099   000    Old_age   Always       -       11485
b:
194 Temperature_Celsius     0x0002   153   153   000    Old_age   Always       -       39 (Min/Max 21/54)
  9 Power_On_Hours          0x0012   097   097   000    Old_age   Always       -       23902
c:
194 Temperature_Celsius     0x0022   114   095   000    Old_age   Always       -       38
  9 Power_On_Hours          0x0032   063   063   000    Old_age   Always       -       27533
d:
194 Temperature_Celsius     0x0022   113   100   000    Old_age   Always       -       39
  9 Power_On_Hours          0x0032   063   063   000    Old_age   Always       -       27533
e:
194 Temperature_Celsius     0x0022   112   098   000    Old_age   Always       -       40
  9 Power_On_Hours          0x0032   063   063   000    Old_age   Always       -       27533

  # Interessant: "sdf" zeigt Lesefehler, smartctl unauffällig, aber: "heisseste Platte" 
f:
194 Temperature_Celsius     0x0022   111   095   000    Old_age   Always       -       41
  9 Power_On_Hours          0x0032   063   063   000    Old_age   Always       -       27533

g:
194 Temperature_Celsius     0x0002   157   157   000    Old_age   Always       -       38 (Min/Max 20/53)
  9 Power_On_Hours          0x0012   097   097   000    Old_age   Always       -       23903
h:
194 Temperature_Celsius     0x0002   157   157   000    Old_age   Always       -       38 (Min/Max 20/53)
  9 Power_On_Hours          0x0012   097   097   000    Old_age   Always       -       23902
i:
194 Temperature_Celsius     0x0022   038   040   000    Old_age   Always       -       38 (0 21 0 0 0)
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       6

Ereignisse

May 09 01:22:50 tokio kernel: blk_update_request: I/O error, dev sdf, sector 7010129740
May 09 01:22:53 tokio kernel: blk_update_request: I/O error, dev sdf, sector 7010129740
Jul 26 01:22:52 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521277045
Jul 26 01:22:54 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521277045
Jul 26 01:22:57 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521278897
Jul 26 01:22:59 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6521278897
Oct 18 01:18:41 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6620009643
Oct 18 01:18:44 tokio kernel: blk_update_request: I/O error, dev sdf, sector 6620009643
27.11.2018 9 Uhr Ausfall von 2 Platten, sieht aus wie ein kurzes "Power Fail" Event
           11 Uhr Anfrage wegen zu wenig Platz, Reduziere Sicherungen von 10 auf 7
           Lösche die Sicherungsverzeichnisse 7+8+9
           stelle den Ausfall von 2 Platten fest
           rebuild von Platte sdd
28.11.2018 rebuild von Platte sdg
           I/O Fehler bei Platte sdf aus der Vergangenheit entdeckt
           Beschaffung einer neuen 8TB Platte, soll "sdf" ersetzen
30.11.2018 neue Platte als 9. Platte hinzugehängt
           --add-spare & --replace angestossen
           + ata-ST8000VN0022-2EL112_ZA1CY63J (sdi)
           - ata-WDC_WD4000FYYZ-01UL1B2_WD-WMC130F7EL9R (sdf)
06.06.2020 sde, ata7.00: failed command: READ FPDMA QUEUED, 
           blk_update_request: I/O error, dev sde, sector 6211109969
           blk_update_request: I/O error, dev sde, sector 6211111867
           einbau eines Spare vorgeschlagen
22.07.2020 +Spare: HGST HDN724040AL (4 TB), Serial=PK1334PEKDZDDS, /dev/sdi eingebaut
24.07.2020 neue Device-Names vom System nach einem Neustart vergeben
27.07.2020 [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366136 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366144 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366152 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366160 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366168 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366176 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366184 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366192 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366200 on sdf1)
           [Sat Jul 25 23:58:00 2020] md/raid:md127: read error corrected (8 sectors at 42366208 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314736 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314744 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314752 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314760 on sdf1)
           [Sat Jul 25 23:58:39 2020] md/raid:md127: read error corrected (8 sectors at 46314768 on sdf1)
           -> A4, LLA4YE ist Tauschkandidat
29.07.2020 ST8000VN004-2M21 (WKD1RM6S) wurde eingebaut, /dev/sdj
           maximale 8TB Partition erstellt, sdj1, als spare eingebunden
           # mdadm /dev/md127 --replace /dev/sdf1 --with /dev/sdj1
           mdadm: Marked /dev/sdf1 (device 1 in /dev/md127) for replacement
           mdadm: Marked /dev/sdj1 in /dev/md127 as replacement for device 1
           anschliessendes "scrub"