m1a8x2

Members
  • Posts

    37
  • Joined

  • Last visited

m1a8x2's Achievements

Noob

Noob (1/14)

1

Reputation

1

Community Answers

  1. Update: I removed the GPU and USB cards and have now been running for over 2 days without a crash. Still not positive which card was the issue.
  2. I removed the GPU and USB card from the server and went down to four sticks of 8gb ECC memory. That is back to how I had the server before I made all the upgrades apart from the new PSU. It ran for about 5 hours fine, at which point I decided to try a reboot. Again, it never rebooted only shut down.
  3. Yeah, I have no idea what those rsyslog entries are about. I've removed the leads going to the reset switch headers already as I have two kids and a dog (but they're never never in my office with me being in here too). And yes, all of my Memory is ECC.
  4. Sorry, I forgot to attach the syslog. I had already setup to mirror. Attaching now. I forgot to mention I also added my USB PCI-e card for passthrough (which was previously used in my server for a VM passthrough). I did the CPU swap and removed the GPU in one go. I then removed the Nvidia driver and next added the new GPU. Probably not enough time in between, but I did undo each change one at a time. I removed the GPU and USB card first (but didn't put back the old one or the Nvidia driver), then I replaced the power supply (500w to an 800w, which I calculated should be enough). After that, I went back to the old CPUs. Then things seemed okay so I added the GPU back. Then the USB card. Crashes became more frequent during that time. syslog.txt
  5. My server has been running great for a couple years now until recently. I made the mistake of making several changes at once, so I can't pinpoint what the issue is. Things I recently did: replaced a GPU removed Nvidia drivers to use new GPU for passthrough upgraded CPUs upgraded Unraid added new windows 11 VM I noticed this happening after the CPUs and GPU upgrade (which is the same time I removed the Nvidia drivers). The weird thing is I first noticed an issue when rebooting. The server would go down for the reboot, but wouldn't turn back on automatically. I had to manually turn it back on. Fast forward a few hours later and I notice I can't connect to the server. I go and check it and discover it's turned off. I boot it back up and it's showing an unclean shutdown. This happened a few more times so I swapped the power supply thinking that was the issue. Still happening at random. I ended up swapping out the CPUs to my old ones to make sure they weren't the issue. It seems to run more stable for a bit but then another shutdown. Today only I've already had it crash three times while trying to get diagnostics and syslog saved. Any help is greatly appreciated. I've safely powered the server off for now while I wait to troubleshoot more. krieger-diagnostics-20220329-1627.zip
  6. I'm curious for your next update. I'm having similar issues after upgrading my CPUS to "unsupported" models as well. The crashing seems so random, but that's the one thing I'm seeing in common with my issue and yours.
  7. I'm guessing that I'm not in luck here... I can't see my old disk5 or disk4 at this time. I'm able to assign my cloned disk5 but I can't assign anything to disk4. And disk2 is showing SMART error on the dashboard. I was able to backup all my sensitive data to another external drive (and will be backing up to this drive regularly and looking into a cloud backup solution as well). At this point I might just clear the array of assignments, remove the failed drives and populate the server with my two new 8tb drives and this 4tb I bought for the cloning. While it sucks to lose this much data (I lost around 6tb to a lightning strike about 5 years ago, you'd think I would've learned my lesson), I think I'd be better off getting all healthy drives added now and starting fresh. I will also be looking into getting another 8tb drive for a double parity. Let me know if I'm missing any other options here... otherwise I'll start rebuilding in the next hour. Thanks for all your help trying to save my drives!
  8. No, I didn't start the array again. I stopped the array in order to generate the new config. I accidentally change some drive assignments before generating that, so I changed them back and then generated the new config. I was able to assign my cloned drive as disk5 but now my old disk5 is not showing up to assign as disk4.
  9. Also, after refreshing the page I no longer see my old disk5 available to temporarily assign to disk4.
  10. Okay, I've created the new config. While assigning drives I see this next to parity: All existing data on this device will be OVERWRITTEN when array is Started. Is that normal?
  11. What would you suggest I do here? I'm really trying to save whatever I can. I have 10tb of data on this array and don't want to lose it all if I don't have to...
  12. ddrescue has finished. rescued: 1444 GB, tried: 2556 GB, bad-sector: 1175 GB, bad areas: 1765 Current status ipos: 4000 GB, non-trimmed: 0 B, current rate: 0 B/s opos: 4000 GB, non-scraped: 0 B, average rate: 0 B/s non-tried: 0 B, bad-sector: 2556 GB, error rate: 27596 kB/s rescued: 1444 GB, bad areas: 1764, run time: 11h 43m 30s pct rescued: 36.09%, read errors:2697716168, remaining time: n/a time since last successful read: n/a Finished What's next? I now have SMART warnings for disk 2 about current pending sectors. Is that drive failing now too or is that read errors from running ddrescue or something else?
  13. Good call. I'm going to dump as much as I can onto another external drive.
  14. No I don't have any email notifications unfortunately. I need to get those setup once I'm out of the woods here. I can live with some data loss I suppose. I'm okay with losing media files, but it's pictures and files from school/work I don't want to lose. So far, when I connect to my shares on another PC it appears most all of my important files are safe. I'm assuming this parity would need to be the same size as the other parity drive? Right now I have a 5tb parity, and five 4tb data drives. I just added a 4tb drive for this recovery and I have two new 8tb drives to swap into the array/parity.
  15. So what's my best approach here? What's my endgame - will I be able to restore my array in any way or am I completely screwed here?