Server going non responsive after a few minutes // IRQ error


Recommended Posts

I posted here this morning about an issue i was having which turned into something else so i figured that this warranted a different thread since the issue no longer matched. What started out as my machine being unresponsive then upon reboot it would work for a few minutes then throw an Disabling IRQ #18 error which made the GUI and any containers/vms no longer function although i could still access the array. One user told me to move my second NIC to another slot which i did but it didnt help, so i removed the 2nd card completely and am now running onboard and im experiencing the same issues without seeing the IRQ #18 error. Since the machine is non responsive i cannot run diagnostics from the gui, tried running it from cmd but it never actually finishes. I thought perhaps it was an issue with 6.3.4 so i downgraded to 6.3.3 but that didnt help either. The best i can do is give you my syslog that i copied via cmd and hopefully that helps to some extent.

 

Not sure if this helps now either but here are the results of cat /proc/interrupts - prior to pulling my other NIC line 18 had eth0 listed

 

EDIT: Diagnostics finally worked so they are attached as well!

root@Storage:~# cat /proc/interrupts
           CPU0       CPU1       CPU2       CPU3       CPU4       CPU5       CPU6       CPU7
  0:         29          0          0          0          0          0          0          0  IR-IO-APIC   2-edge      timer
  1:          1          0          0          0          0          0          0          1  IR-IO-APIC   1-edge      i8042
  8:         53          0          0          1          1          0          0          0  IR-IO-APIC   8-edge      rtc0
  9:          0          0          0          0          0          0          0          0  IR-IO-APIC   9-fasteoi   acpi
 12:          4          0          0          0          0          0          0          0  IR-IO-APIC  12-edge      i8042
 16:         85          2          4          1         13          3          2          9  IR-IO-APIC  16-fasteoi   ehci_hcd:usb1
 18:          0          0          0          0          0          0          0          0  IR-IO-APIC  18-fasteoi   i801_smbus
 19:     282611      26913      18752      13007     103143     102744      81838      40706  IR-IO-APIC  19-fasteoi   ata_piix, ata_piix
 23:       3520        623        597        423       1951       1380        744        694  IR-IO-APIC  23-fasteoi   ehci_hcd:usb2
 24:          0          0          0          0          0          0          0          0  DMAR-MSI   0-edge      dmar0
 25:          0          0          0          0          0          0          0          0  DMAR-MSI   1-edge      dmar1
 28:        289         33         19         23        126         33         25         34  IR-PCI-MSI 2097152-edge      xhci_hcd
 29:          0          0          0          0          0          0          0          0  IR-PCI-MSI 2097153-edge      xhci_hcd
 30:          0          0          0          0          0          0          0          0  IR-PCI-MSI 2097154-edge      xhci_hcd
 31:          0          0          0          0          0          0          0          0  IR-PCI-MSI 2097155-edge      xhci_hcd
 32:          0          0          0          0          0          0          0          0  IR-PCI-MSI 2097156-edge      xhci_hcd
 33:          0          0          0          0          0          0          0          0  IR-PCI-MSI 2097157-edge      xhci_hcd
 34:          0          0          0          0          0          0          0          0  IR-PCI-MSI 2097158-edge      xhci_hcd
 35:          0          0          0          0          0          0          0          0  IR-PCI-MSI 2097159-edge      xhci_hcd
 36:      70877       6830       4978       3240      20018      12940       9281       6576  IR-PCI-MSI 524288-edge      mpt2sas0-msix0
 37:     372337      43280      25194      19402     123527      81982      60890      46134  IR-PCI-MSI 409600-edge      eth0
 38:      58305       4623       3762       2280      12847       9519       7089       5222  IR-PCI-MSI 1048576-edge      mpt2sas1-msix0
 39:      58951       4757       4022       2465      12296       9615       7070       5059  IR-PCI-MSI 1572864-edge      mpt2sas2-msix0
NMI:          0          0          0          0          0          0          0          0   Non-maskable interrupts
LOC:     557747     555080     546294     658922     636080     486156     435353     406441   Local timer interrupts
SPU:          0          0          0          0          0          0          0          0   Spurious interrupts
PMI:          0          0          0          0          0          0          0          0   Performance monitoring interrupts
IWI:          0          0          0          0          0          0          0          0   IRQ work interrupts
RTR:          0          0          0          0          0          0          0          0   APIC ICR read retries
RES:     210030     211826     198842     171955     289050     257279     328987     316112   Rescheduling interrupts
CAL:       8458       7961       8816       7957       9253       8715       7401       6328   Function call interrupts
TLB:       7845       7495       8179       7344       8682       8082       6781       5816   TLB shootdowns
TRM:          0          0          0          0          0          0          0          0   Thermal event interrupts
THR:          0          0          0          0          0          0          0          0   Threshold APIC interrupts
DFR:          0          0          0          0          0          0          0          0   Deferred Error APIC interrupts
MCE:          0          0          0          0          0          0          0          0   Machine check exceptions
MCP:          5          5          5          5          5          5          5          5   Machine check polls
ERR:          0
MIS:          0
PIN:          0          0          0          0          0          0          0          0   Posted-interrupt notification event
PIW:          0          0          0          0          0          0          0          0   Posted-interrupt wakeup event

 

syslog.txt

storage-diagnostics-20170522-1331.zip

Edited by hermy65
Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.