[Plugin] IPMI for unRAID 6.1+


Recommended Posts

I was talking about the bottom of the page and I think you are correct that 10 would likely be too many. Would it be possible to have a variable display of fan speeds based on speed or maybe associated temp? They would need to have a label though so everyone would know what fan they were looking at.

 

Just an idea.

Link to comment

Thanks for this!  Setup was very straightforward.

 

Also if anyone had input on the number of sensors to display in the footer, I could add more. 2 temps & 2 fans or 3 & 2. I'm not sure how many is too much though.

 

Once we start controlling the fans, I think it will be important to have quick access to all of them.  For me, that would mean 2 temps and 3 fans :) but if we want to cover the 10 fan use case...

 

What do you think about displaying one fan in the footer, but when you mouseover it, there is a flyout div showing details on all of them?  Not sure how hard that would be, but I thought I'd throw it out there

Link to comment

Give it a try now. I just filtered any sensors without the proper data. Also if your using the dynamix temp plugin you should uninstall it and set up temps through this plugin. Who wants to use that lm_sensors witchcraft anyway when you can use ipmi data.

 

Also if anyone had input on the number of sensors to display in the footer, I could add more. 2 temps & 2 fans or 3 & 2. I'm not sure how many is too much though.

All looking good now (Dynamix temp uninstalled as well). Many thanks!

Link to comment

Thanks for this, great tool and I am looking forward to Fan Control!

 

A quick question about my IPMI setup:

Motherboard: Intel S5500WB

CPUs: Intel Xeon CPU L5630  (x2)

 

output of "/usr/bin/ipmitool -vc sdr":

 

root@Tower:~# /usr/bin/ipmitool -vc sdr
Running Get PICMG Properties my_addr 0x20, transit 0, target 0
Error Response 0xc1 from Get PICMG Properities
No PICMG Extenstion discovered
BB +1.1V IOH,1.084,Volts,ok,7.1,System Board,Voltage,1.102,1.067,1.133,,1.177,1.146,,1.027,1.054,0.068,1.190
BB +1.1V P1 Vccp,0.921,Volts,ok,7.1,System Board,Voltage,1.098,0.714,1.482,,1.543,1.501,,0.683,0.708,0.006,1.562
BB +1.5V P1 DDR3,,,ns,7.1,System Board,Voltage,1.492,1.420,1.570,,1.635,1.589,,1.362,1.401,-0.003,1.654
BB +1.8V AUX,1.781,Volts,ok,7.1,System Board,Voltage,1.797,1.743,1.851,,1.927,1.874,,1.673,1.720,-0.013,1.951
BB +3.3V,3.296,Volts,ok,7.1,System Board,Voltage,3.296,3.125,3.454,,3.625,3.525,,2.982,3.067,0.022,3.668
BB +3.3V STBY,3.239,Volts,ok,7.1,System Board,Voltage,3.296,3.125,3.454,,3.625,3.525,,2.982,3.067,0.022,3.668
BB +3.3V Vbat,2.997,Volts,ok,7.1,System Board,Voltage,2.997,2.542,3.139,,3.283,3.192,,2.178,2.502,0.007,3.322
BB +5.0V,4.994,Volts,ok,7.1,System Board,Voltage,4.994,4.744,5.243,,5.538,5.380,,4.471,4.630,-0.182,5.607
BB +5.0V STBY,4.972,Volts,ok,7.1,System Board,Voltage,4.908,4.672,5.143,,5.421,5.272,,4.416,4.565,0.029,5.486
BB +12.0V,12.247,Volts,ok,7.1,System Board,Voltage,11.987,11.363,12.559,,13.287,12.923,,10.739,11.103,0.183,13.443
BB +1.35v P1 MEM,1.378,Volts,ok,7.1,System Board,Voltage,1.354,1.289,1.425,,1.484,1.443,,1.236,1.271,-0.003,1.502
Baseboard Temp,25,degrees C,ok,66.1,Baseboard/Main System Board,Temperature,38.000,20.000,55.000,,66.000,61.000,,5.000,10.000,-128.000,127.000
IOH Therm Margin,-40,degrees C,ok,7.1,System Board,Temperature,-20.000,-40.000,0.000,,10.000,5.000,,,,-128.000,127.000
Mem P1 Thrm Mrgn,-47,degrees C,ok,7.1,System Board,Temperature,-20.000,-40.000,0.000,,10.000,5.000,,,,-128.000,127.000
Processor 1 Fan,3136,RPM,ok,29.1,Fan Device,Fan,8640.000,960.000,16320.000,,,,,640.000,768.000,0.000,16320.000
Processor 1A Fan,640,RPM,Lower Critical,29.2,Fan Device,Fan,8640.000,960.000,16320.000,,,,,640.000,768.000,0.000,16320.000
Processor 2 Fan,3136,RPM,ok,29.5,Fan Device,Fan,8640.000,960.000,16320.000,,,,,640.000,768.000,0.000,16320.000
P1 Therm Margin,-47,degrees C,ok,65.1,Processor,Temperature,-15.000,-70.000,0.000,,,,,,,-128.000,127.000
P1 Therm Ctrl %,0,percent,ok,3.1,Processor,Temperature,0.000,0.000,7.800,,19.500,11.700,,,,0.000,39.000
Pwr Unit Status,01h,ok,21.1,
IPMI Watchdog,03h,ok,7.1,
Physical Scrty,04h,ok,23.1,
FP NMI Diag Int,05h,ok,7.1,
SMI Timeout,06h,ok,7.1,
System Event Log,07h,ok,7.1,
System Event,08h,ok,7.1,
P1 Status,60h,ok,3.1,Presence detected
P1 VRD Hot,66h,ok,3.1,
CATERR,68h,ok,3.1,
CPU Missing,69h,ok,3.1,
IOH Therm Trip,6Ah,ok,7.1,
NM Capabilities,1Ah,ok,46.1,

 

In the IMPI settings page I only get the following options for CPU temp (see attachment):

 

Is there anything I should be doing to enable CPU temp?

 

Thanks again!

 

 

screenshot.png.535ded215d18d425c12331a4be526937.png

Link to comment

From what I read, it seems that some Intel boards don't show CPU temps in ipmi.  They either show a percentage or a difference from the CPU temp limit. Can you connect to your boards ipmi web interface? What it show for cpu temp and does it have a sensor name?

 

This is going to sound like a stupid question I'm sure but; do all IPMI systems have a web-interface?

I have googled "intel ipmi web interface" and found nothing. Could you give me any pointers at all as how to access the IPMI web interface?

 

Currently I CAN get cpu temps through the coretemp sensors.lm module:

 

root@Tower:~# modprobe coretemp
root@Tower:~# sensors
coretemp-isa-0000
Adapter: ISA adapter
Core 0:       +31.0°C  (high = +61.0°C, crit = +71.0°C)
Core 1:       +28.0°C  (high = +61.0°C, crit = +71.0°C)
Core 9:       +25.0°C  (high = +61.0°C, crit = +71.0°C)
Core 10:      +31.0°C  (high = +61.0°C, crit = +71.0°C)

coretemp-isa-0001
Adapter: ISA adapter
Core 0:       +34.0°C  (high = +61.0°C, crit = +71.0°C)
Core 1:       +28.0°C  (high = +61.0°C, crit = +71.0°C)
Core 9:       +33.0°C  (high = +61.0°C, crit = +71.0°C)
Core 10:      +28.0°C  (high = +61.0°C, crit = +71.0°C)

 

but I would prefer to use the IPMI route!

 

Thanks for any help!

 

 

Link to comment

'ipmitool lan print' will give you the ip address. Then open try the ip in a browser. Also I've been looking at freeipmi again instead of ipmitool. I has more options

 

root@Tower:~# ipmitool lan print
Set in Progress         : Set Complete
Auth Type Support       : MD5 PASSWORD
Auth Type Enable        : Callback : MD5 PASSWORD
                        : User     : MD5 PASSWORD
                        : Operator : MD5 PASSWORD
                        : Admin    : MD5
                        : OEM      :
IP Address Source       : Static Address
IP Address              : 192.168.1.15
Subnet Mask             : 255.255.255.0
MAC Address             : 00:15:17:9e:58:f0
SNMP Community String   : INTEL
IP Header               : TTL=0x00 Flags=0x00 Precedence=0x00 TOS=0x00
BMC ARP Control         : ARP Responses Enabled, Gratuitous ARP Disabled
Gratituous ARP Intrvl   : 0.0 seconds
Default Gateway IP      : 192.168.1.1
Default Gateway MAC     : c0:3e:0f:4e:17:70
Backup Gateway IP       : 0.0.0.0
Backup Gateway MAC      : 00:00:00:00:00:00
802.1q VLAN ID          : Disabled
802.1q VLAN Priority    : 0
RMCP+ Cipher Suites     : 1,2,3,6,7,8,11,12,0
Cipher Suite Priv Max  : caaaXXaaaXXaaXX
                        :     X=Cipher Suite Unused
                        :     c=CALLBACK
                        :     u=USER
                        :     o=OPERATOR
                        :     a=ADMIN
                        :     O=OEM

 

I have tried to access 192.168.1.15 as above but I get nothing, I also tried 192.168.1.15:623 because I read somewhere that IPMI tends to act over port 623 but again get nothing. I followed a guide here: https://www.thomas-krenn.com/en/wiki/Configuring_IPMI_under_Linux_using_ipmitool. still nothing!

 

 

Finally, I downloaded SuperMicro IPMIView and I can connect to the server IPMI over 192.168.1.15, I can see some things about the server (power status, Motherboard Details, IPMI firmware version etc.), but IPMIView freezes when I try to look at sensors or use remote shell. I have read somewhere that it is likely due to java updates breaking IPMIView.

 

Does intel offer a program similar to IPMIView? I have searched and found nothing!!

 

Any other help will be much appreciated!!

Link to comment

IMHO IPMIView is way too buggy and a pain matching up versions with java versions. I'm not sure about Intel ipmi implementation. If you have a manual it might say.  It's called Intel Node Manager.

 

You could try installing freeipmi. Here's a version that's compatible with unRAID.

https://github.com/dmacias72/unRAID-plugins/raw/master/source/packages/freeipmi-1.4.11-x86_64-2.txz

Run ipmi-sensors.

If that doesn't work there's a newer 1.5.1 release that specifically mentions Intel Node Manager support for 3.0 specification. I'd have to compile it. A newer ipmitool might help too. I'll try and compile those anyway.

Link to comment

I've noticed what appears to be a fault in the latest version of the plugin.

 

It appears that Basic View and Advanced View Columns and data are mixed up. When Basic View is selected it shows Basic View Columns BUT Advanced View Data AND when Advanced View is selected it shows Advanced View Columns but Basic View Data.

 

Tested in Chrome and Safari and the issue appears in both. I am running the following OS and browser versions:

 

OS X El Capitan Version 10.11.2

Safari Version 9.0.2 (11601.3.9)

Chrome Version 48.0.2564.103 (64-bit)

 

See Screenshots:

 

Screen_Shot_2016_02_22_at_11_53_24_PM.png

 

Screen_Shot_2016_02_22_at_11_53_34_PM.png

Link to comment

Hey dmacias, I have 976 events in my event log and it takes a while to load.  What do you think of exporting older entries to a text file and deleting them from the IPMI system?  The plugin should be able to read them from the text file much faster.

I can add an archive function that would save them to your flash drive and clear the events. Ipmitool just saves it in a format I can't parse back into a table easily. You can run 'ipmitool save /boot/path-to-file/event.log' then 'ipmitool sel clear' or delete all from the webgui.

Link to comment

I've noticed what appears to be a fault in the latest version of the plugin.

 

It appears that Basic View and Advanced View Columns and data are mixed up. When Basic View is selected it shows Basic View Columns BUT Advanced View Data AND when Advanced View is selected it shows Advanced View Columns but Basic View Data.

 

Tested in Chrome and Safari and the issue appears in both. I am running the following OS and browser versions:

 

OS X El Capitan Version 10.11.2

Safari Version 9.0.2 (11601.3.9)

Chrome Version 48.0.2564.103 (64-bit)

 

See Screenshots:

 

Screen_Shot_2016_02_22_at_11_53_24_PM.png

 

Screen_Shot_2016_02_22_at_11_53_34_PM.png

I'll check it out. The switch is cookie based. So it could be a problem with browser settings.

Link to comment

I've noticed what appears to be a fault in the latest version of the plugin.

 

It appears that Basic View and Advanced View Columns and data are mixed up. When Basic View is selected it shows Basic View Columns BUT Advanced View Data AND when Advanced View is selected it shows Advanced View Columns but Basic View Data.

 

Tested in Chrome and Safari and the issue appears in both. I am running the following OS and browser versions:

 

OS X El Capitan Version 10.11.2

Safari Version 9.0.2 (11601.3.9)

Chrome Version 48.0.2564.103 (64-bit)

 

See Screenshots:

 

Screen_Shot_2016_02_22_at_11_53_24_PM.png

 

Screen_Shot_2016_02_22_at_11_53_34_PM.png

I'll check it out. The switch is cookie based. So it could be a problem with browser settings.

 

I thought so too, which is why i tried it in Chrome. I don't use Chrome on my Mac's except for the odd exception web site which Safari doesn't display properly (which TBH these days is few and far between). Safari is pretty much default apart from basic changes (such as homepage etc) BUT the settings in Chrome are just default as it is hardly every used. Indeed the previous versions of the plugin worked perfectly and nothing has changed on my clients since previous versions.

 

For kicks - just tried it on the iPad (Safari), Windows 10 VM (Edge - Default Settings), Ubuntu 15.10 VM (Firefox - Default Settings) and iPhone (Safari) too.

 

Exactly the same in each browser.

Link to comment

IMHO IPMIView is way too buggy and a pain matching up versions with java versions. I'm not sure about Intel ipmi implementation. If you have a manual it might say.  It's called Intel Node Manager.

 

You could try installing freeipmi. Here's a version that's compatible with unRAID.

https://github.com/dmacias72/unRAID-plugins/raw/master/source/packages/freeipmi-1.4.11-x86_64-2.txz

Run ipmi-sensors.

If that doesn't work there's a newer 1.5.1 release that specifically mentions Intel Node Manager support for 3.0 specification. I'd have to compile it. A newer ipmitool might help too. I'll try and compile those anyway.

 

I just tried with your build of freeipmi 1.4.11:

 

root@Tower:/tmp# ipmi-sensors
ID | Name             | Type                   | Reading    | Units | Event
1  | BB +1.1V IOH     | Voltage                | 1.08       | V     | 'OK'
2  | BB +1.1V P1 Vccp | Voltage                | 0.87       | V     | 'OK'
3  | BB +1.5V P1 DDR3 | Voltage                | N/A        | V     | N/A
4  | BB +1.8V AUX     | Voltage                | 1.78       | V     | 'OK'
5  | BB +3.3V         | Voltage                | 3.30       | V     | 'OK'
6  | BB +3.3V STBY    | Voltage                | 3.24       | V     | 'OK'
7  | BB +3.3V Vbat    | Voltage                | 2.98       | V     | 'OK'
8  | BB +5.0V         | Voltage                | 4.99       | V     | 'OK'
9  | BB +5.0V STBY    | Voltage                | 4.97       | V     | 'OK'
10 | BB +12.0V        | Voltage                | 12.25      | V     | 'OK'
11 | BB +1.35v P1 MEM | Voltage                | 1.38       | V     | 'OK'
12 | Baseboard Temp   | Temperature            | 28.00      | C     | 'OK'
13 | IOH Therm Margin | Temperature            | -33.00     | C     | 'OK'
14 | Mem P1 Thrm Mrgn | Temperature            | -43.00     | C     | 'OK'
15 | Processor 1 Fan  | Fan                    | 3200.00    | RPM   | 'OK'
16 | Processor 1A Fan | Fan                    | 640.00     | RPM   | 'At or Below (<=) Lower Critical Threshold'
17 | Processor 2 Fan  | Fan                    | 3136.00    | RPM   | 'OK'
18 | P1 Therm Margin  | Temperature            | -44.00     | C     | 'OK'
19 | P1 Therm Ctrl %  | Temperature            | 0.00       | %     | 'OK'
20 | Pwr Unit Status  | Power Unit             | N/A        | N/A   | 'OK'
21 | IPMI Watchdog    | Watchdog 2             | N/A        | N/A   | 'OK'
22 | Physical Scrty   | Physical Security      | N/A        | N/A   | 'OK'
23 | FP NMI Diag Int  | Critical Interrupt     | N/A        | N/A   | 'OK'
24 | SMI Timeout      | OEM Reserved           | N/A        | N/A   | 'OK'
25 | System Event Log | Event Logging Disabled | N/A        | N/A   | 'Log Area Reset/Cleared'
26 | System Event     | System Event           | N/A        | N/A   | 'OK'
27 | P1 Status        | Processor              | N/A        | N/A   | 'Processor Presence detected'
28 | P1 VRD Hot       | Temperature            | N/A        | N/A   | 'OK'
29 | CATERR           | Processor              | N/A        | N/A   | 'OK'
30 | CPU Missing      | Processor              | N/A        | N/A   | 'OK'
31 | IOH Therm Trip   | Temperature            | N/A        | N/A   | 'OK'
32 | NM Capabilities  | OEM Reserved           | N/A        | N/A   | N/A

 

No further luck with CPU temps!!

 

Note: nevermind the error with "Processor 1A Fan", that fan header is connected to a fan controller so incorrect speeds are reported!!

 

Thanks for this help so far!

Link to comment

The thresholds for the fan are probably not set or incorrect. From what I have read the temp for the processor may be one of the negative Therm Margins. Where 0 is actually 80C or whatever your processor limit is. So a negative -30C will be 50C.

See if that corresponds with the coretemp.

Link to comment

The thresholds for the fan are probably not set or incorrect. From what I have read the temp for the processor may be one of the negative Therm Margins. Where 0 is actually 80C or whatever your processor limit is. So a negative -30C will be 50C.

See if that corresponds with the coretemp.

 

I ran ipmi-sensors and sensors one after the other and got the following:

 

root@Tower:/tmp# ipmi-sensors
Caching SDR repository information: /root/.freeipmi/sdr-cache/sdr-cache-Tower.localhost
Caching SDR record 146 of 146 (current record ID 146)
ID | Name             | Type                   | Reading    | Units | Event
1  | BB +1.1V IOH     | Voltage                | 1.08       | V     | 'OK'
2  | BB +1.1V P1 Vccp | Voltage                | 0.93       | V     | 'OK'
3  | BB +1.5V P1 DDR3 | Voltage                | N/A        | V     | N/A
4  | BB +1.8V AUX     | Voltage                | 1.78       | V     | 'OK'
5  | BB +3.3V         | Voltage                | 3.30       | V     | 'OK'
6  | BB +3.3V STBY    | Voltage                | 3.24       | V     | 'OK'
7  | BB +3.3V Vbat    | Voltage                | 2.98       | V     | 'OK'
8  | BB +5.0V         | Voltage                | 4.99       | V     | 'OK'
9  | BB +5.0V STBY    | Voltage                | 4.97       | V     | 'OK'
10 | BB +12.0V        | Voltage                | 12.25      | V     | 'OK'
11 | BB +1.35v P1 MEM | Voltage                | 1.37       | V     | 'OK'
12 | Baseboard Temp   | Temperature            | 28.00      | C     | 'OK'
13 | IOH Therm Margin | Temperature            | -33.00     | C     | 'OK'
14 | Mem P1 Thrm Mrgn | Temperature            | -42.00     | C     | 'OK'
15 | Processor 1 Fan  | Fan                    | 3200.00    | RPM   | 'OK'
16 | Processor 1A Fan | Fan                    | 640.00     | RPM   | 'At or Below (<=) Lower Critical Threshold'
17 | Processor 2 Fan  | Fan                    | 3200.00    | RPM   | 'OK'
18 | P1 Therm Margin  | Temperature            | -44.00     | C     | 'OK'
19 | P1 Therm Ctrl %  | Temperature            | 0.00       | %     | 'OK'
20 | Pwr Unit Status  | Power Unit             | N/A        | N/A   | 'OK'
21 | IPMI Watchdog    | Watchdog 2             | N/A        | N/A   | 'OK'
22 | Physical Scrty   | Physical Security      | N/A        | N/A   | 'OK'
23 | FP NMI Diag Int  | Critical Interrupt     | N/A        | N/A   | 'OK'
24 | SMI Timeout      | OEM Reserved           | N/A        | N/A   | 'OK'
25 | System Event Log | Event Logging Disabled | N/A        | N/A   | 'OK'
26 | System Event     | System Event           | N/A        | N/A   | 'OK'
27 | P1 Status        | Processor              | N/A        | N/A   | 'Processor Presence detected'
28 | P1 VRD Hot       | Temperature            | N/A        | N/A   | 'OK'
29 | CATERR           | Processor              | N/A        | N/A   | 'OK'
30 | CPU Missing      | Processor              | N/A        | N/A   | 'OK'
31 | IOH Therm Trip   | Temperature            | N/A        | N/A   | 'OK'
32 | NM Capabilities  | OEM Reserved           | N/A        | N/A   | N/A
root@Tower:/tmp# sensors
coretemp-isa-0000
Adapter: ISA adapter
CPU 1 Core 1:  +26.0°C  (high = +61.0°C, crit = +71.0°C)
CPU 1 Core 2:  +25.0°C  (high = +61.0°C, crit = +71.0°C)
CPU 1 Core 3:  +22.0°C  (high = +61.0°C, crit = +71.0°C)
CPU 1 Core 4:  +26.0°C  (high = +61.0°C, crit = +71.0°C)

coretemp-isa-0001
Adapter: ISA adapter
CPU 2 Core 1:  +28.0°C  (high = +61.0°C, crit = +71.0°C)
CPU 2 Core 2:  +24.0°C  (high = +61.0°C, crit = +71.0°C)
CPU 2 Core 3:  +28.0°C  (high = +61.0°C, crit = +71.0°C)
CPU 2 Core 4:  +24.0°C  (high = +61.0°C, crit = +71.0°C)

 

I have also looked online for a build of freeipmi v1.5.1 for slackware but I cannot find one anywhere to see if any further info given.

Link to comment

Join the conversation

You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.

Guest
Reply to this topic...

×   Pasted as rich text.   Restore formatting

  Only 75 emoji are allowed.

×   Your link has been automatically embedded.   Display as a link instead

×   Your previous content has been restored.   Clear editor

×   You cannot paste images directly. Upload or insert images from URL.