RootFS nach 3-4 Wochen RO statt RW

Begonnen von Bartimaus, 19 September 2016, 10:03:01

Vorheriges Thema - Nächstes Thema

Bartimaus

Ich habe den Log in Code-Tags gesetzt, so wie das Smart, aber es wurde ignoriert. Vielleicht weil der Text zu lang ist ?

Ich schau da heute Abend mal.

Danke für die Info über den Plattenzustand  :)
LG
B.


FHEM@AMD-Ryzen7-5700U@Debian-LXC (ProxmoxHOST), CUL1101,FS20,IT,DS18B20,DS2413(Heizungslogger),DS2423(Stromlogger)Homematic,HM-LAN,ZWave,MiniCULs,Shelly

Wernieman

Geöffnete/geschlossene Code-Tags beachtet, eventuell ausversehen ein [ gelöscht?
- Bitte um Input für Output
- When there is a Shell, there is a Way
- Wann war Dein letztes Backup?

Wie man Fragen stellt: https://tty1.net/smart-questions_de.html

Bartimaus

K.a., habs nochmal editiert, jetzt sollte es passen
LG
B.


FHEM@AMD-Ryzen7-5700U@Debian-LXC (ProxmoxHOST), CUL1101,FS20,IT,DS18B20,DS2413(Heizungslogger),DS2423(Stromlogger)Homematic,HM-LAN,ZWave,MiniCULs,Shelly

Wernieman

Wie schon geschrieben: Ist das Logfile vom letzten reboot. Könntest Du in einem älteren Gucken?
- Bitte um Input für Output
- When there is a Shell, there is a Way
- Wann war Dein letztes Backup?

Wie man Fragen stellt: https://tty1.net/smart-questions_de.html

Bartimaus

LG
B.


FHEM@AMD-Ryzen7-5700U@Debian-LXC (ProxmoxHOST), CUL1101,FS20,IT,DS18B20,DS2413(Heizungslogger),DS2423(Stromlogger)Homematic,HM-LAN,ZWave,MiniCULs,Shelly

Bartimaus

Hier ml ein älterer Log

Sep 13 12:26:57 bananapi kernel: [1897592.497738] EXT4-fs warning (device sda1): ext4_end_bio:249: I/O error writing to inode 4326714 (offset 3670016 size 524288 start$
Sep 13 12:26:57 bananapi kernel: [1897592.520534] sd 0:0:0:0: [sda] Unhandled error code
Sep 13 12:26:57 bananapi kernel: [1897592.528801] sd 0:0:0:0: [sda]  Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
Sep 13 12:26:57 bananapi kernel: [1897592.548163] sd 0:0:0:0: [sda] CDB: Write(10): 2a 00 01 d0 e8 00 00 04 00 00
Sep 13 12:26:57 bananapi kernel: [1897592.556557] end_request: I/O error, dev sda, sector 30468096
Sep 13 12:26:57 bananapi kernel: [1897592.561703] Buffer I/O error on device sda1, logical block 3808256
Sep 13 12:26:57 bananapi kernel: [1897592.573199] Buffer I/O error on device sda1, logical block 3808257
Sep 13 12:26:57 bananapi kernel: [1897592.596641] Buffer I/O error on device sda1, logical block 3808258
Sep 13 12:26:57 bananapi kernel: [1897592.616474] Buffer I/O error on device sda1, logical block 3808259
Sep 13 12:26:57 bananapi kernel: [1897592.627434] Buffer I/O error on device sda1, logical block 3808260
Sep 13 12:26:57 bananapi kernel: [1897592.638904] Buffer I/O error on device sda1, logical block 3808261
Sep 13 12:26:57 bananapi kernel: [1897592.644053] Buffer I/O error on device sda1, logical block 3808262
Sep 13 12:26:57 bananapi kernel: [1897592.649191] Buffer I/O error on device sda1, logical block 3808263
Sep 13 12:26:57 bananapi kernel: [1897592.654374] Buffer I/O error on device sda1, logical block 3808264
Sep 13 12:26:57 bananapi kernel: [1897592.659512] Buffer I/O error on device sda1, logical block 3808265
Sep 13 12:26:57 bananapi kernel: [1897592.664682] Buffer I/O error on device sda1, logical block 3808266
Sep 13 12:26:57 bananapi kernel: [1897592.669819] Buffer I/O error on device sda1, logical block 3808267
Sep 13 12:26:57 bananapi kernel: [1897592.674991] Buffer I/O error on device sda1, logical block 3808268
Sep 13 12:26:57 bananapi kernel: [1897592.692781] Buffer I/O error on device sda1, logical block 3808269
Sep 13 12:26:57 bananapi kernel: [1897592.704280] Buffer I/O error on device sda1, logical block 3808270
Sep 13 12:26:57 bananapi kernel: [1897592.715766] Buffer I/O error on device sda1, logical block 3808271
Sep 13 12:26:57 bananapi kernel: [1897592.727287] Buffer I/O error on device sda1, logical block 3808272
Sep 13 12:26:57 bananapi kernel: [1897592.738788] Buffer I/O error on device sda1, logical block 3808273
Sep 13 12:26:57 bananapi kernel: [1897592.750282] Buffer I/O error on device sda1, logical block 3808274
Sep 13 12:26:57 bananapi kernel: [1897592.768104] Buffer I/O error on device sda1, logical block 3808275
Sep 13 12:26:57 bananapi kernel: [1897592.779576] Buffer I/O error on device sda1, logical block 3808276
Sep 13 12:26:57 bananapi kernel: [1897592.791050] Buffer I/O error on device sda1, logical block 3808277
Sep 13 12:26:57 bananapi kernel: [1897592.802535] Buffer I/O error on device sda1, logical block 3808278
Sep 13 12:26:57 bananapi kernel: [1897592.807681] Buffer I/O error on device sda1, logical block 3808279
Sep 13 12:26:57 bananapi kernel: [1897592.812818] Buffer I/O error on device sda1, logical block 3808280
Sep 13 12:26:57 bananapi kernel: [1897592.829867] Buffer I/O error on device sda1, logical block 3808281
Sep 13 12:26:57 bananapi kernel: [1897592.842657] Buffer I/O error on device sda1, logical block 3808282
Sep 13 12:26:57 bananapi kernel: [1897592.854149] Buffer I/O error on device sda1, logical block 3808283
Sep 13 12:26:57 bananapi kernel: [1897592.865622] Buffer I/O error on device sda1, logical block 3808284


Das sieht IMO doch garnicht so gut aus. Würde das FSCK helfen ?
LG
B.


FHEM@AMD-Ryzen7-5700U@Debian-LXC (ProxmoxHOST), CUL1101,FS20,IT,DS18B20,DS2413(Heizungslogger),DS2423(Stromlogger)Homematic,HM-LAN,ZWave,MiniCULs,Shelly

Wernieman

Sep 13 12:26:57 bananapi kernel: [1897592.497738] EXT4-fs warning (device sda1): ext4_end_bio:249: I/O error writing to inode 4326714 (offset 3670016 size 524288 start$
Sep 13 12:26:57 bananapi kernel: [1897592.520534] sd 0:0:0:0: [sda] Unhandled error code
Sep 13 12:26:57 bananapi kernel: [1897592.528801] sd 0:0:0:0: [sda]  Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
Sep 13 12:26:57 bananapi kernel: [1897592.548163] sd 0:0:0:0: [sda] CDB: Write(10): 2a 00 01 d0 e8 00 00 04 00 00
Sep 13 12:26:57 bananapi kernel: [1897592.556557] end_request: I/O error, dev sda, sector 30468096

Kommt etwas vor der ersten Meldung?

Mir sieht dieses nach einem Hardwarefehler aus. "I/O error writing to inode" .....

also auf dem Weg vom Kernel in die Platte liegt ein "Defekt" vor. Da Du schriebst, das es eine USB-Platte ist, kannst Dudie an einem anderen Linux-Rechner anschließen und mal ein fsck rüberlaufen lassen?
- Bitte um Input für Output
- When there is a Shell, there is a Way
- Wann war Dein letztes Backup?

Wie man Fragen stellt: https://tty1.net/smart-questions_de.html

Bartimaus

Es ist eine SATA-Platte. Aber ich könnte sie in ein USB-Gehäuse pcken, und am QNAP anschliessen und fsck drüberlaufen lassen....

Ja, vor dem Log stand noch einiges, aber der Ausschnitt erschien mir bemerkenswert...
LG
B.


FHEM@AMD-Ryzen7-5700U@Debian-LXC (ProxmoxHOST), CUL1101,FS20,IT,DS18B20,DS2413(Heizungslogger),DS2423(Stromlogger)Homematic,HM-LAN,ZWave,MiniCULs,Shelly

Wernieman

Was für ein Dateisystem?

Und wenn Du es an die Synology mountest, könntest Du versuchen mehrere GByte hin/herzuschaufeln?

P.S. Bitte nicht Backup vergessen ....
- Bitte um Input für Output
- When there is a Shell, there is a Way
- Wann war Dein letztes Backup?

Wie man Fragen stellt: https://tty1.net/smart-questions_de.html

Bartimaus

Hm, ich denke ext4

Ist ein Qnap.
Backup mache ich täglich
LG
B.


FHEM@AMD-Ryzen7-5700U@Debian-LXC (ProxmoxHOST), CUL1101,FS20,IT,DS18B20,DS2413(Heizungslogger),DS2423(Stromlogger)Homematic,HM-LAN,ZWave,MiniCULs,Shelly

AxelSchweiss

Ist ein ext4  ;)
Sep 13 12:26:57 bananapi kernel: [1897592.497738] EXT4-fs warning (device sda1): ext4_end_bio:249: I/O error writing to inode 4326714 (offset 3670016 size 524288 start$

Hat die Platte sowas wie einen Autopowerdown?
Dann könnte es nämlich sein das sie nicht wieder oder zu langsam aufwacht.

Mach mal einen "Extended Self Test" mit den smartmontools.
Das Ergebnis siehst du dann, wenn er durchgelaufen ist, am Ende des Listings bei smartctl -a

Das sieht dann z.B.: so aus:

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     32383         -
# 2  Short offline       Completed without error       00%     32359         -
# 3  Short offline       Completed without error       00%     32335         -
# 4  Extended offline    Completed without error       00%     32317         -
# 5  Short offline       Completed without error       00%     32287         -
# 6  Short offline       Completed without error       00%     32263         -
# 7  Short offline       Completed without error       00%     32239         -
# 8  Short offline       Completed without error       00%     32215         -


Bartimaus

Der extended-Test soll 106min dauern lt den smarttools. Liegt in der Zeit dann alles brach, bzw. kann man den Test auch stoppen ?
LG
B.


FHEM@AMD-Ryzen7-5700U@Debian-LXC (ProxmoxHOST), CUL1101,FS20,IT,DS18B20,DS2413(Heizungslogger),DS2423(Stromlogger)Homematic,HM-LAN,ZWave,MiniCULs,Shelly

Wernieman

Nee .. der Test läuft im Hintergrund nur auf der Platte. Teoretisch ist diese in der Zeit langsamer, praktisch dürfte dieses bei Dir (wegen USB-Anschluß) nicht meßbar sein.
- Bitte um Input für Output
- When there is a Shell, there is a Way
- Wann war Dein letztes Backup?

Wie man Fragen stellt: https://tty1.net/smart-questions_de.html

Bartimaus

Danke.

Wie kommst Du dauernd auf den USB-Anschluss ????
LG
B.


FHEM@AMD-Ryzen7-5700U@Debian-LXC (ProxmoxHOST), CUL1101,FS20,IT,DS18B20,DS2413(Heizungslogger),DS2423(Stromlogger)Homematic,HM-LAN,ZWave,MiniCULs,Shelly

Bartimaus

#29
Anbei das Ergebnis des HDDchecks.

Irgendwelche Auffälligkeiten ? Kenne mich da nicht so aus...

smartctl 5.41 2011-06-09 r3365 [armv7l-linux-3.4.104+] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Hitachi Travelstar 5K500.B
Device Model:     Hitachi HTS545032B9A300
Serial Number:    xxxxxxxxxxxxxxxxxxx
LU WWN Device Id: 5 000cca 5f1ef0c22
Firmware Version: PB3OC60N
User Capacity:    320.071.851.520 bytes [320 GB]
Sector Size:      512 bytes logical/physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 6
Local Time is:    Wed Sep 21 16:36:50 2016 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (  645) seconds.
Offline data collection
capabilities:                    (0x5b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off supp                                                                                        ort.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        No Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 106) minutes.
SCT capabilities:              (0x003d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_  FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000b   100   100   062    Pre-fail  Always       -  0
  2 Throughput_Performance  0x0005   100   100   040    Pre-fail  Offline      -  0
  3 Spin_Up_Time            0x0007   170   170   033    Pre-fail  Always       -  2
  4 Start_Stop_Count        0x0012   097   097   000    Old_age   Always       -  4780
  5 Reallocated_Sector_Ct   0x0033   094   094   005    Pre-fail  Always       -  0
  7 Seek_Error_Rate         0x000b   100   100   067    Pre-fail  Always       -  0
  8 Seek_Time_Performance   0x0005   100   100   040    Pre-fail  Offline      -  0
  9 Power_On_Hours          0x0012   054   054   000    Old_age   Always       -  20334
10 Spin_Retry_Count        0x0013   100   100   060    Pre-fail  Always       -  0
12 Power_Cycle_Count       0x0032   099   099   000    Old_age   Always       -  1885
191 G-Sense_Error_Rate      0x000a   100   100   000    Old_age   Always       -  0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -  94
193 Load_Cycle_Count        0x0012   018   018   000    Old_age   Always       -  826541
194 Temperature_Celsius     0x0002   166   166   000    Old_age   Always       -  33 (Min/Max 10/50)
196 Reallocated_Event_Count 0x0032   086   086   000    Old_age   Always       -  1066
197 Current_Pending_Sector  0x0022   100   100   000    Old_age   Always       -  0
198 Offline_Uncorrectable   0x0008   100   100   000    Old_age   Offline      -  0
199 UDMA_CRC_Error_Count    0x000a   200   200   000    Old_age   Always       -  0
223 Load_Retry_Count        0x000a   100   100   000    Old_age   Always       -  0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Completed without error       00%     20328         -
# 2  Short offline       Completed without error       00%      8955         -

SMART Selective self-test log data structure revision number 1
SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
LG
B.


FHEM@AMD-Ryzen7-5700U@Debian-LXC (ProxmoxHOST), CUL1101,FS20,IT,DS18B20,DS2413(Heizungslogger),DS2423(Stromlogger)Homematic,HM-LAN,ZWave,MiniCULs,Shelly