Hardware problems log attached.


Recommended Posts

I was told through the unraid gui that i might have hardware problems.

 

 

tower-diagnostics-20171209-1044.zip

 

System:

Model: Custom
M/B: ASRock - EP2C602-4L/D16
CPU: Intel® Xeon® CPU E5-2670 0 @ 2.60GHz
HVM: Enabled
IOMMU: Enabled
Cache: 512 kB, 2048 kB, 20480 kB
Memory: 128 GB (max. installable capacity 192 GB)
Network: eth0: 1000 Mb/s, full duplex, mtu 1500 
 eth1: not connected
 eth2: not connected
 eth3: not connected
Kernel: Linux 4.9.30-unRAID x86_64
OpenSSL: 1.0.2k
 
In the log I notice there was errors regarding Ram Scrubbing?
Edited by deaerator
Link to comment
1 hour ago, deaerator said:

In the log I notice there was errors regarding Ram Scrubbing?

Searching through the syslog, there is no mention of "scrub"

 

1 hour ago, deaerator said:

I was told through the unraid gui that i might have hardware problems.

Any notification would have been through Fix Common Problems, and there is no record of FCP finding anything other than docker updates available.  What is this notification that you were talking about?

Link to comment

Now it feels like a buyer beware kind of deal.

 

I purchased the Motherboard & Ram as a combo; made the assumption that it will work together. 

 

.

 

Turns out the ram is not the approved QVL, but it worked in my system for over a year, without any major problems until now.

y4mY2qLUSKAgtWpjmRUR3vrjS1q8rIVf9ax-Irw8

 

 

 

Edited by deaerator
Link to comment
5 minutes ago, deaerator said:

but it worked in my system for over a year, without any major problems until now.

Memory can (and does) go bad.  No different than anything else.  QVL doesn't necessarily mean that the memory won't work with the mobo.  It simply means that if its on the list that it will work (if its not bad)

Edited by Squid
Link to comment

You could try pulling a couple of sticks (perhaps from each CPU bank) and see if that clears up the problem.  If it is a problem with the line drivers/RAM, that may fix the problem.  (I am not certain from what you wrote if you had any indication of a problem BEFORE you got the notice from Fix Common Problems.  If that is the case, the issue may always have been there but you just were not aware of it.  With EEC, A single bit read error should be detected and a reread should result which may return the proper data.  The system may be a bit slower but it probably won't crash and burn.)

Link to comment

Part of your problem though is that if you can't find this

1 hour ago, johnnie.black said:

ECC errors on Supermicro boards are logged on the boards's event log, I would assume it would be the same with Asrock.

 

then if the errors are being corrected properly a memtest will pass with flying colors and you will have done a ton of work with no results.

Link to comment

Then you need to talk to AsRock regarding this

Dec 11 04:40:08 Tower root: Hardware event. This is not a software error.
Dec 11 04:40:08 Tower root: MCE 0
Dec 11 04:40:08 Tower root: CPU 8 BANK 8 
Dec 11 04:40:08 Tower root: MISC 1221020002000e8c ADDR 1e3872c000 
Dec 11 04:40:08 Tower root: TIME 1512963915 Sun Dec 10 23:45:15 2017
Dec 11 04:40:08 Tower root: MCG status:
Dec 11 04:40:08 Tower root: MCi status:
Dec 11 04:40:08 Tower root: Corrected error
Dec 11 04:40:08 Tower root: MCi_MISC register valid
Dec 11 04:40:08 Tower root: MCi_ADDR register valid
Dec 11 04:40:08 Tower root: MCA: MEMORY CONTROLLER MS_CHANNEL0_ERR
Dec 11 04:40:08 Tower root: Transaction: Memory scrubbing error
Dec 11 04:40:08 Tower root: MemCtrl: Corrected patrol scrub error
Dec 11 04:40:08 Tower root: STATUS 8c000047000800c0 MCGSTATUS 0
Dec 11 04:40:08 Tower root: MCGCAP 1000c14 APICID 20 SOCKETID 1 
Dec 11 04:40:08 Tower root: CPUID Vendor Intel Family 6 Model 45

Memory scrubs are when the system attempts to read & write to every ram location during idle periods to see if the ram is good / bad.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.