Error ZFS: DEGRADED: too many errors
Esta vez un post muy corto para resolver un fallo del zpool del servidor con el que nos hemos encontrado.
Resulta que al hacer un zpool status nos ofrecía esta salida:
root@mordor:~# zpool status
pool: DataPool
state: DEGRADED
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-9P
scan: scrub repaired 0B in 04:15:16 with 0 errors on Mon Oct 17 23:09:45 2022
config:
NAME STATE READ WRITE CKSUM
DataPool DEGRADED 0 0 0
raidz1-0 DEGRADED 0 0 0
scsi-36b8ca3a0f9c2a100256d4b820c206e41 ONLINE 0 0 0
scsi-36b8ca3a0f9c2a100255ad5bf16c51324 DEGRADED 0 0 0 too many errors
scsi-36b8ca3a0f9c2a1002564623b1739ad27 ONLINE 0 0 0
scsi-36b8ca3a0f9c2a1002564625518c10a56 ONLINE 0 0 0
errors: No known data errors
Pues bien, si el error no es muy grave todavía podemos arreglar el fallo ejecutando:
root@mordor:~# zpool clear DataPool
root@mordor:~# zpool scrub DataPool
root@mordor:~# zpool status
pool: DataPool
state: ONLINE
status: Some supported features are not enabled on the pool. The pool can
still be used, but some features are unavailable.
action: Enable all features using 'zpool upgrade'. Once this is done,
the pool may no longer be accessible by software that does not support
the features. See zpool-features(5) for details.
scan: scrub in progress since Thu Oct 20 11:21:00 2022
305G scanned at 76.4G/s, 176K issued at 44K/s, 4.97T total
0B repaired, 0.00% done, no estimated completion time
config:
NAME STATE READ WRITE CKSUM
DataPool ONLINE 0 0 0
raidz1-0 ONLINE 0 0 0
scsi-36b8ca3a0f9c2a100256d4b820c206e41 ONLINE 0 0 0
scsi-36b8ca3a0f9c2a100255ad5bf16c51324 ONLINE 0 0 0
scsi-36b8ca3a0f9c2a1002564623b1739ad27 ONLINE 0 0 0
scsi-36b8ca3a0f9c2a1002564625518c10a56 ONLINE 0 0 0
errors: No known data errors
El zpool clear permite limpiar los errores acumulados y el zpool scrub permite volver a comprobar el estado del zpool
Por supuesto si el error persiste o se repite, contempla como opción substituir el disco afectado.
Como se suele decir es un primer aviso de que el disco está fallando
Comprueba tus backup y verificar que los puedes recuperar. Te van a hacer falta pronto.