Friday, March 16, 2012

ZFS scare : state:UNAVAIL , scsi device files changed


# zpool status
  pool: vault
 state: UNAVAIL
status: One or more devices could not be used because the label is missing
or invalid.  There are insufficient replicas for the pool to continue
functioning.
action: Destroy and re-create the pool from
a backup source.
   see: http://zfsonlinux.org/msg/ZFS-8000-5E
 scan: none requested
config:

NAME        STATE     READ WRITE CKSUM
vault       UNAVAIL      0     0     0  insufficient replicas
 raidz2-0  UNAVAIL      0     0     0  insufficient replicas
   sdc     UNAVAIL      0     0     0
   sdd     UNAVAIL      0     0     0
   sde     FAULTED      0     0     0  corrupted data
   sdf     FAULTED      0     0     0  corrupted data
   sdg     FAULTED      0     0     0  corrupted data
   sdh     FAULTED      0     0     0  corrupted data

Why did this happen?  I have been changing around my disk layouts in the main server, and in doing so the drive order shifted down by two.  After much panicing I took a leap of faith and did an export and import.


# zpool export vault
# zpool import vault
# zpool status
  pool: vault
 state: ONLINE
 scan: scrub repaired 0 in 15h3m with 0 errors on Thu Mar  1 22:09:47 2012
config:

NAME        STATE     READ WRITE CKSUM
vault       ONLINE       0     0     0
 raidz2-0  ONLINE       0     0     0
   sde     ONLINE       0     0     0
   sdf     ONLINE       0     0     0
   sdg     ONLINE       0     0     0
   sdh     ONLINE       0     0     0
   sdi     ONLINE       0     0     0
   sdj     ONLINE       0     0     0

Thank goodness.

Now I need to figure out how to convert to import disk/by-id instead of the scsi device file so this non-sense doesn't happen again.


# zpool import -d /dev/disk/by-id/ vault
# zpool status
  pool: vault
 state: ONLINE
 scan: scrub repaired 0 in 15h3m with 0 errors on Thu Mar  1 22:09:47 2012
config:

NAME                                            STATE     READ WRITE CKSUM
vault                                           ONLINE       0     0     0
 raidz2-0                                      ONLINE       0     0     0
   ata-Hitachi_HDS5C3030ALA630_MJ13....  ONLINE       0     0     0
   ata-WDC_WD30EZRX-00MMMB0_WD-W....    ONLINE       0     0     0
   ata-Hitachi_HDS5C3030ALA630_MJ13...  ONLINE       0     0     0
   ata-WDC_WD30EZRX-00MMMB0_WD-W....    ONLINE       0     0     0
   ata-Hitachi_HDS5C3030ALA630_MJ13....  ONLINE       0     0     0
   ata-WDC_WD30EZRX-00MMMB0_WD-W....    ONLINE       0     0     0


Bingo.
Problem solved.

Special thanks to the following thread for tips:  http://forums.gentoo.org/viewtopic-t-838534-start-0.html

No comments: