6.3.3 Now Log full of never before seen errors


Recommended Posts

Good day crew,  Strange one here for some more knowledgeable eyes than my own as to what needs to be done. 

 

After the first server was updated without issue (parity check, data use OK, etc) I updated my second server.

Shortly after reboot the log started clogging with a set of errors that I have never before seen.

 

Apr 3 07:18:42 Tower2 kernel: ata1: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe frozen

 

Diagnostics attached

 

Thanks for looking!

 

tower2-diagnostics-20170403-0925.zip

Link to comment
On 4/3/2017 at 7:35 AM, landS said:

Good day crew,  Strange one here for some more knowledgeable eyes than my own as to what needs to be done. 

 

After the first server was updated without issue (parity check, data use OK, etc) I updated my second server.

Shortly after reboot the log started clogging with a set of errors that I have never before seen.

 

Apr 3 07:18:42 Tower2 kernel: ata1: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe frozen

 

Diagnostics attached

 

Thanks for looking!

 

tower2-diagnostics-20170403-0925.zip

 

Normally for this kind of error I'd say check your cabling, reseat controllers, etc., but it looks like there is nothing attached to that port.

What unRaid OS version did you upgrade from?

Link to comment
37 minutes ago, limetech said:

 

Normally for this kind of error I'd say check your cabling, reseat controllers, etc., but it looks like there is nothing attached to that port.

What unRaid OS version did you upgrade from?

 

6.3.2

This morning I stopped the array, rebooted, and after an hour of idling it is full of the same errors again

Edited by landS
Link to comment

Bump... Any idea of what to do on this one crew?  Anything else I can try? 

 

I have tried restarting, running parity, running mover, spin ups, spin downs. .. No rhyme or reason Ive noticed to this error cropping up.  Clean/safe boot option also throws the errors.    Memtest is fine.  

 

... 

Or do I live with these strange errors as no device is on the troublesome port.... And hope my complacency with these errors doesn't cause me to miss a real one

Edited by landS
Link to comment

No Joy :(

 

Apr 8 22:19:49 Tower2 kernel: ata1: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe froze
Apr 8 22:19:49 Tower2 kernel: ata1: irq_stat 0x00000040, connection status changed
Apr 8 22:19:49 Tower2 kernel: ata1: SError: { DevExch }
Apr 8 22:19:49 Tower2 kernel: ata1: hard resetting link
Apr 8 22:19:50 Tower2 kernel: ata1: SATA link down (SStatus 0 SControl 300)
Apr 8 22:19:50 Tower2 kernel: ata1: EH complete
Apr 8 22:27:44 Tower2 kernel: ata1: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe frozen

Apr 8 22:27:44 Tower2 kernel: ata1: irq_stat 0x00000040, connection status changed
Apr 8 22:27:44 Tower2 kernel: ata1: SError: { DevExch }
Apr 8 22:27:44 Tower2 kernel: ata1: hard resetting link
Apr 8 22:27:45 Tower2 kernel: ata1: SATA link down (SStatus 0 SControl 300)
Apr 8 22:27:45 Tower2 kernel: ata1: EH complete

Link to comment

WTF....

My parity check normally hovers around 150 MB/sec and is done within 6 hours.  

It is going on 11 hours, showing 8 remaining and is stuck down at 32 MB/sec

...

edit:  over 20 hours to complete the parity check and pages of these strange errors.

ctivity started on Sun 09 Apr 2017 08:48:17 PM CDT (today), finding 0 errors.
Last result: 20 hours, 35 minutes, 18 seconds. Average speed: 54.0 MB/s

Now I am reverting from 'previous' folder on flash (3.3.2)  and will put the server through it's paces then report back.

 

Edited by landS
Link to comment
  • 2 weeks later...

My Tower (D525) has been massively upgraded with a E5-1650v3 :)

 

My Tower2's faulty motherboard has been replaced with the D525 setup (same mobo, ram, Dell Perc H310 controller).

***Attached eSata cable to Sata-3 onboard

***Attached Optical disk drive cable to Sata-4 onboard

***Attached Cache disk drive cable to Sata-0 onboard  ***this SSD is BTRFS***

***2 parity and 4 data disks remain on the IT Crossflashed to LSI Dell Perc H310

 

MMMM... after boot I have this:

 

Apr 21 13:55:27 Tower2 kernel: ata5: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe frozen t4

Apr 21 13:55:27 Tower2 kernel: ata5: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe frozen t3

Apr 21 13:56:27 Tower2 kernel: ata5: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe frozen t2

Apr 21 13:57:29 Tower2 kernel: ata5: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe frozen t1

 

 

Edited by landS
Link to comment

UGG!

Yanked the disc drive.... errors gone

Reverted to original Asus mobo & i5-2500... errors gone

Reverted to original SASLP.... errors gone.

 

Tossed X7SPA and Dell perc back in... as I 'believe' the hardware is better quality than the Asus & i5-2500...

And this is for a basic backup NAS

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.