whiteatom Posted June 22, 2017 Share Posted June 22, 2017 (edited) Hi there, I'm having an issue with my box not recognizing a new drive and I'm not sure how to troubleshoot the issue. When I plug the drive in (hot swap cage), I get the following in the dmesg: [ 3946.015580] ata10.00: exception Emask 0x10 SAct 0x4000 SErr 0x780000 action 0x6 frozen [ 3946.015582] ata10.00: irq_stat 0x08000000, interface fatal error [ 3946.015584] ata10: SError: { 10B8B Dispar BadCRC Handshk } [ 3946.015586] ata10.00: failed command: READ FPDMA QUEUED [ 3946.015590] ata10.00: cmd 60/08:70:00:00:00/00:00:00:00:00/40 tag 14 ncq dma 4096 in res 40/00:70:00:00:00/00:00:00:00:00/40 Emask 0x10 (ATA bus error) [ 3946.015591] ata10.00: status: { DRDY } [ 3946.015595] ata10: hard resetting link [ 3946.478566] ata10: SATA link up 3.0 Gbps (SStatus 123 SControl 330) [ 3946.481529] ata10.00: configured for UDMA/133 [ 3946.481540] ata10: EH complete But, the drive never gets a /dev/sd-letter assigned. Any ideas where to look next? Edited June 22, 2017 by whiteatom Quote Link to comment
whiteatom Posted June 22, 2017 Author Share Posted June 22, 2017 New cable, no change. SATA card?? Quote Link to comment
whiteatom Posted June 22, 2017 Author Share Posted June 22, 2017 Oh.. now I think about it, this has only been a problem since the most recent update. Any change to the drivers? Quote Link to comment
JorgeB Posted June 22, 2017 Share Posted June 22, 2017 [ 3946.015584] ata10: SError: { 10B8B Dispar BadCRC Handshk } CRC errors 9 times out of 10 are a bad SATA cable, but it can also be a bad SATA port/backplane or even disk Quote Link to comment
whiteatom Posted June 23, 2017 Author Share Posted June 23, 2017 Ok, reseated the card and tried a second replacement SATA cable. Same issue. The problems seem to be isolated to when I connect 2 disks to my HighPoint Rocket 62x controller. This card was working perfectly by running the enable_ahc script. I realize now that this might have stopped working when I upgraded to UnRAID 6. I spent ages working the array when I added the second parity drive, and "a disk failed". It was a redundant disk anyway, so I just shrunk the array and ignored it but now I realize that it was the second disk on this controller so it's likely a driver issue. Does anyone know what's involved in getting this controller working on V6? Quote Link to comment
JorgeB Posted June 23, 2017 Share Posted June 23, 2017 I believe it uses a common Marvell chip should it should work with the standard AHCI driver, post your diagnostics. Quote Link to comment
whiteatom Posted June 24, 2017 Author Share Posted June 24, 2017 (edited) OK! So i think I have found the problem. I replaced the CPU in my system (with a Xeon for Plex transcoding) a few weeks ago - and that required a BIOS upgrade. I have a feeling that the new bios defaulted to AHCI disabled. That means that my HPT card may not be working correctly. In the middle of a big copy now, but as soon as it's over, I'll be reboot and check it. Diagnostics from the other day (nothing has really changed since then) attached. Can you see if the Marvell chip on that card is being recognized? I have a Marvell chipset on the MB as well that is working fine. Thanks for your input! knox-diagnostics-20170621-1918.zip PS: I have tried 3 cables and 3 disks with no major difference. Today I did manage to get a disk to mount, but as soon as I put it under any heavy load, I started getting write errors (cp: cannot create regular file '...': Input/output error) even though it's still mounted rw. Edited June 24, 2017 by whiteatom Quote Link to comment
JorgeB Posted June 24, 2017 Share Posted June 24, 2017 On those diagnostics, and besides the onboard controller, there's only one more controller, one that looks like a SASLP, there are no other, with or without a loaded driver. Quote Link to comment
whiteatom Posted June 24, 2017 Author Share Posted June 24, 2017 Ok, we'll this is clearly the problem... my HPT 62x is not being recognized. The only ACHI setting in the BIOS was for the on board controller and it was enabled. As far as I can tell this happened when I upgraded the CPU/BIOS, so I'm unsure what to do next. Any idea why this would be recognized? Quote Link to comment
JorgeB Posted June 24, 2017 Share Posted June 24, 2017 42 minutes ago, whiteatom said: Any idea why this would be recognized? No, but it's not software/driver related, try a different slot if available. This also means the the earlier ATA errors you posted are from a different disk, one with a bad SATA cable. Quote Link to comment
whiteatom Posted June 24, 2017 Author Share Posted June 24, 2017 34 minutes ago, johnnie.black said: No, but it's not software/driver related, try a different slot if available. This also means the the earlier ATA errors you posted are from a different disk, one with a bad SATA cable. No, they are for sure from this disk - all my disks are 3tb reds, except an SSD, and the drives plugged into this card. When I have one disk plugged into the HPT card, it works perfectly. When I put a second disk in to it, there are intermittent errors like the ones I originally posted. This was the same problem I had when I first got the card until in found the enable_achi script linked above. Now it's not being recognized again so we're back to a basic default operation (that I don't really understand how it works). Ill try a different slot and report back! Quote Link to comment
whiteatom Posted June 25, 2017 Author Share Posted June 25, 2017 OK! I have a bit more info here. Updated the IPMI firmware - read somewhere if you upgrade the BIOS without the IPMI you can get troubles. Moved the HighPoint RocketRaid 62X card to a different slot and I get exactly the same thing. No reference to the card in the dmesg, but a single drive on port 1, will work. The only Hitachi drive in the system is attached to my HPT card, but the card is not recognized in the dmesg. Here is the drive getting recognized and I can use it properly in the OS. [ 11.309998] ata9: SATA link up 6.0 Gbps (SStatus 133 SControl 330) [ 11.311615] ata9.00: ATA-8: Hitachi HDS5C3020ALA632, ML0220F313PSBD, ML6OA580, max UDMA/133 [ 11.311921] ata9.00: 3907029168 sectors, multi 0: LBA48 NCQ (depth 31/32), AA [ 11.313574] ata9.00: configured for UDMA/133 [ 11.313941] scsi 10:0:0:0: Direct-Access ATA Hitachi HDS5C302 A580 PQ: 0 ANSI: 5 [ 11.325619] random: crng init done [ 11.329146] sd 10:0:0:0: Attached scsi generic sg7 type 0 [ 11.329160] sd 10:0:0:0: [sdh] 3907029168 512-byte logical blocks: (2.00 TB/1.82 TiB) [ 11.329238] sdg: sdg1 [ 11.329315] sd 10:0:0:0: [sdh] Write Protect is off [ 11.329317] sd 10:0:0:0: [sdh] Mode Sense: 00 3a 00 00 [ 11.329360] sd 10:0:0:0: [sdh] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA New diagnostics attached. In the lspci I do see the device now as a SATA controller, but it seems to also list a IDE interface - so is there an AHCI issue still? Any more input would be VERY much appreciated! knox-diagnostics-20170624-2221.zip Quote Link to comment
JorgeB Posted June 25, 2017 Share Posted June 25, 2017 It's being detected and loading both AHCI and IDE drivers, this is kind of strange but some HPT controller are, I suspect some incompatibility and probably there's not much you can do about it. 02:00.0 SATA controller [0106]: Marvell Technology Group Ltd. 88SE9120 SATA 6Gb/s Controller [1b4b:9120] (rev 12) Subsystem: Marvell Technology Group Ltd. 88SE9120 SATA 6Gb/s Controller [1b4b:9120] Kernel driver in use: ahci Kernel modules: ahci 02:00.1 IDE interface [0101]: Marvell Technology Group Ltd. 88SE912x IDE Controller [1b4b:91a4] (rev 12) Subsystem: Marvell Technology Group Ltd. 88SE912x IDE Controller [1b4b:91a4] Kernel driver in use: pata_marvell Kernel modules: pata_marvell Marvell controllers are not recommended with v6 anyway so recommend you get a new controller, if 2 ports are enough look for an Asmedia 1601, they can be found for about 15$, and they work great with unRAID. Quote Link to comment
JorgeB Posted June 25, 2017 Share Posted June 25, 2017 Are you still using the ahci script? If so try without it. Quote Link to comment
whiteatom Posted June 25, 2017 Author Share Posted June 25, 2017 (edited) Hmm.. so this is a V6 upgrade issue. Marvell not recommended? Does that mean my MB controller is likely to cause problems at some point as well? I'll try commenting out the ahci script when I get home. Edited June 25, 2017 by whiteatom Quote Link to comment
JorgeB Posted June 25, 2017 Share Posted June 25, 2017 34 minutes ago, whiteatom said: Marvell not recommended? Don't know if it's the chips or the drivers, but they're a source of problems for many users and not recommended for v6, though some people continue to use them without issues 37 minutes ago, whiteatom said: I'll try commenting out the ahci script when I get home. I assumed you removed it after the upgrade, it's not needed for v6. Quote Link to comment
whiteatom Posted June 25, 2017 Author Share Posted June 25, 2017 Ok.. that was not clear from the instructions.. as I said before, I'll comment it out and reboot and see what happens... Thanks for all your input on this! Quote Link to comment
BobPhoenix Posted June 26, 2017 Share Posted June 26, 2017 The Marvel 9230 controller built into my MB drops drives. I had to leave it empty and use the Intel only controllers on the MB and an LSI add it card. Quote Link to comment
whiteatom Posted June 26, 2017 Author Share Posted June 26, 2017 (edited) Ok.. that was not clear from the instructions.. Tried with out the script and I basically get the same thing where the card is found as both SATA and IDE. Between this card and the Marvell controller on the board, I have just ordered a second SASLP, so we'll see if that fixes me up! Thanks again all for the help. Edited June 26, 2017 by whiteatom Quote Link to comment
whiteatom Posted June 26, 2017 Author Share Posted June 26, 2017 (edited) So... I just had a thought.. Isn't the SASLP a Marvell controller? Are these reliable in V6? I haven't had any issues, but before my order ships I better check no one else is having problems!! Edited June 26, 2017 by whiteatom Quote Link to comment
Squid Posted June 26, 2017 Share Posted June 26, 2017 16 minutes ago, whiteatom said: So... I just had a thought.. Isn't the SASLP a Marvell controller? Are these reliable in V6? I have zero problems with them, but unfortunately its not something that can be determined ahead of time Quote Link to comment
whiteatom Posted June 26, 2017 Author Share Posted June 26, 2017 I haven't either, but this comment made me realize there could be issues that I haven't had yet (I have a SASLP, and I just ordered an SAS2LP for continued expansion). On 2017-06-25 at 5:07 AM, johnnie.black said: Marvell controllers are not recommended with v6 anyway so recommend you get a new controller, if 2 ports are enough look for an Asmedia 1601, they can be found for about 15$, and they work great with unRAID. What's the "recommended" 8 port SATA card now? (or 2x SAS for breakout cables) Quote Link to comment
JorgeB Posted June 26, 2017 Share Posted June 26, 2017 Any LSI with the SAS2008 chip, eg, 9211-8i and clones Quote Link to comment
SSD Posted June 26, 2017 Share Posted June 26, 2017 1 hour ago, johnnie.black said: Any LSI with the SAS2008 chip, eg, 9211-8i and clones Johnnie - Notice you routinely recommend the 9211-8i and not the 9201-8i. The latter being a pure HBA with no need for firmware updates. Curious if there is a reason. Is the 9211 superior in some way? I have the 9201 and it works great, and have recommended to several people that have also had a good experience. Just curious. Quote Link to comment
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.