Friday, April 24, 2009

HP Blade Server rebooting for no apparent reason

Just a bit less than a week after applying an HP Proliant Support pack to our BL460c G5 blades one of them running Windows 2003 begain rebooting randomly. The only real errors I could find were in the System log. These errors are about Event Source: hpqilo2 with an Event ID: 57. They had to do with Timeouts causing a ASR (reboot)

Description: The system has rebooted from a Automatic Server Recovery (ASR) event.
ProbableCause: 111 0x6f (Timeout)
ProbableCauseDescription: "ASR Reboot Occurred"

I did a full hardware swap since everything I found on google pointed to hardware. After that didn't resolve anything, I found that there is a new driver for HP ProLiant iLO 2 Management Controller Driver available. One of the known fixes is
"Resolved a problem where system could spontaneously reboot (ASR) if all CPU's were under continuous 100% load, and iLO 2 was reset (e.g. due to firmware update, changes to network settings, etc.)."

http://h20000.www2.hp.com/bizsupport/TechSupport/SoftwareDescription.jsp?lang=en&cc=us&prodTypeId=3709945&prodSeriesId=3808910&swItem=MTX-b016a4092d95486b88c4ebe86d&prodNameId=3808911&swEnvOID=1113&swLang=13&taskId=135&mode=4&idx=2

This looks like it could be our issue, only time will tell, but if history is my guide then I should see within 48 hours if this machine will be stable or keep blowing up.

5 comments:

g said...

Can you tell me if this was your issue? I am having the same problem with BL460C G6 servers. There do seem to be quite a few updates available on the HP support site for the blades.

ALEX said...

hi i have the same problem with servers bl465 and bl460 with windows server 2003 32 and 64bit y install than patch but some of servers still rebooting, the only servers that no restart has windows server 2008.
i dont know what happend.

Ian Witts said...

An update from HP on the 13th April fixed this for us. The details are here:

Critical Customer Advisory
Advisory: (Revision) Integrated Lights-Out 2 (iLO 2) Firmware Version 1.81 (Or Earlier) And iLO 2 Management Controller Driver Version 1.11.1.0 (Or Earlier) - ProLiant Server May Unexpectedly Reboot And Display Event ID 57 Error Messages (c01802766)

Products: ProLiant Servers, BladeSystem, Storage, Software, Desktops & Workstations
OS: not applicable

Description: Advisory: (Revision) Integrated Lights-Out 2 (iLO 2) Firmware Version 1.81 (Or Earlier) And iLO 2 Management Controller Driver Version 1.11.1.0 (Or Earlier) - ProLiant Server May Unexpectedly Reboot And Display Event ID 57 Error Messages

http://alerts.hp.com/r?2.1.3KT.2ZR.15m1iU.ERodnS..N.U1kq.39kg.bW89MQ%5f%5fDITeFSF0

andygen said...

HP BladeSystem Matrix using HP ProLiant BLS) Server powered by Intel® Xeon® Processor 5600*, your business benefits from shared services and a private cloud environment with the industry’s first integrated infrastructure platform, enabling you to reduce capital costs and energy consumption. Even better, you can more efficiently utilize the talent of your server administration teams for business innovation, rather than for operations and maintenance.

Detail article at:
http://www.kidbuxblog.com/talk-technology/hp-bladesystem-and-matrix-tco

valli p said...




Thank you so much! That did the trick, you saved me more endless hours of searching for a fix.



HP server