Pending sector. Device disabled. What to do?


Recommended Posts

2 minutes ago, johnnie.black said:

You can start the checking the file system on the emulated disk1 (md1):

 

https://lime-technology.com/wiki/index.php/Check_Disk_Filesystems#Drives_formatted_with_XFS

 

Do you have a spare for the rebuild?

What would be prefered? To rebuild to a new disk tomorrow and shutdown the server for now?

I don't want to mess with it more then needed at this point. 

Link to comment
Just now, rjbaat said:

What would be prefered? To rebuild to a new disk tomorrow and shutdown the server for now?

I don't want to mess with it more then needed at this point. 

If you can shutdown until you get the new disk then you could get your array protected again before starting on the filesystem repair. You can use the drive manufacturers diagnostics software to test the new disk on another computer rather than preclearing it, since it is not necessary to have a clear disk for a rebuild.

 

Do you have backups of any irreplaceable data?

Link to comment
1 minute ago, trurl said:

If you can shutdown until you get the new disk then you could get your array protected again before starting on the filesystem repair. You can use the drive manufacturers diagnostics software to test the new disk on another computer rather than preclearing it, since it is not necessary to have a clear disk for a rebuild.

 

Do you have backups of any irreplaceable data?

Yes, I have everything backed up to Crashplan. 

 

1 minute ago, johnnie.black said:

I would fix the file system first, then and if don't need it leave it shutdown until the spare is ready.

 

Also keep the old disk intact until rebuild is done.

Oke, so it would not harm to do the xfs_repair -v /dev/md1 command now? Would that mean the disk pending sector will be repaired?

Link to comment
19 hours ago, johnnie.black said:

You'd be running xfs_repair on the emulated disk, not old disk1, I would do it before the rebuild because although xfs_repair should fix it with no issues there would be no point in rebuilding a disk with a corrupt file system that can't be repaired.

 

Oke i tried the command and this is the result:

Phase 1 - find and verify superblock...
        - block cache size set to 1085048 entries
Phase 2 - using internal log
        - zero log...
zero_log: head block 228480 tail block 228474
ERROR: The filesystem has valuable metadata changes in a log which needs to
be replayed.  Mount the filesystem to replay the log, and unmount it before
re-running xfs_repair.  If you are unable to mount the filesystem, then use
the -L option to destroy the log and attempt a repair.
Note that destroying the log may cause corruption -- please attempt a mount
of the filesystem before doing this.

What is best to do next? 

 

 

Link to comment
2 minutes ago, johnnie.black said:

Looks like there was some corruption, look at lost+found folder, any files there may be corrupt.

 

If there are corrupt files you should be able to copy most of them from the old disk if needed, but only do that after the rebuild.

 

Oke i have restarted the array looked on Disk1 and i see indeed an lost+found folder. There are alot of files i dont recognise. Could be Crashplan incoming backup data or something. There are some large files aswell. 

 

Now i can swap the disk for the new one right? Or do i need to preclear is first?

Link to comment

Many errors:

 

Apr  3 18:16:09 Tower kernel: ata13.00: exception Emask 0x1 SAct 0x1f00 SErr 0x0 action 0x6 frozen
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/38:00:c8:05:81/02:00:00:00:00/40 tag 8 ncq dma 290816 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x5 (timeout)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/08:00:00:08:81/04:00:00:00:00/40 tag 9 ncq dma 528384 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x5 (timeout)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/f8:00:08:0c:81/03:00:00:00:00/40 tag 10 ncq dma 520192 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x5 (timeout)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/01:00:00:00:00/00:00:00:00:00/40 tag 11 ncq dma 512 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x1 (device error)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/01:00:40:64:0d/00:00:47:00:00/40 tag 12 ncq dma 512 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x1 (device error)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13: hard resetting link

 

And finally drooped offline:

 

Apr  3 18:16:29 Tower kernel: ata13.00: disabled

 

Edited by johnnie.black
Link to comment
21 minutes ago, johnnie.black said:

Many errors:

 


Apr  3 18:16:09 Tower kernel: ata13.00: exception Emask 0x1 SAct 0x1f00 SErr 0x0 action 0x6 frozen
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/38:00:c8:05:81/02:00:00:00:00/40 tag 8 ncq dma 290816 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x5 (timeout)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/08:00:00:08:81/04:00:00:00:00/40 tag 9 ncq dma 528384 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x5 (timeout)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/f8:00:08:0c:81/03:00:00:00:00/40 tag 10 ncq dma 520192 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x5 (timeout)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/01:00:00:00:00/00:00:00:00:00/40 tag 11 ncq dma 512 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x1 (device error)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13.00: failed command: READ FPDMA QUEUED
Apr  3 18:16:09 Tower kernel: ata13.00: cmd 60/01:00:40:64:0d/00:00:47:00:00/40 tag 12 ncq dma 512 in
Apr  3 18:16:09 Tower kernel:         res 41/04:17:e0:06:81/00:01:00:00:00/40 Emask 0x1 (device error)
Apr  3 18:16:09 Tower kernel: ata13.00: status: { DRDY ERR }
Apr  3 18:16:09 Tower kernel: ata13.00: error: { ABRT }
Apr  3 18:16:09 Tower kernel: ata13: hard resetting link

 

And finally drooped offline:

 


Apr  3 18:16:29 Tower kernel: ata13.00: disabled

 

 

Is there a specific reason of the errors? I connected the disk to my Mac and formatted it fine.

I will try again to preclear it in Unraid. 

 

 

 

Link to comment

Oke, well it did complete the preclear and i think it started the post-read. But after it disappeared again i think.

I restarted the server but its saying: status precleared. I then formatted with the unassigned devices plugin and mounted. This did work.

But when checking the SMART status. The pending sector is still there. 

tower-diagnostics-20170404-0007.zip

 

Edit 1:

At this moment i am running an erase and clear cycle to see if that fixes it. 

 

Edit 2:

No it didnt. The cycle aborted and the disk disappeared.

tower-diagnostics-20170404-1502.zip

 

Edited by rjbaat
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.