Problem
The server is unable to complete Power On Self Test (POST) and powers off with Light Path Diagnostics (LPD) BRD and FAN Light Emitting Diodes (LEDs) lit. Logged events from the Integrated Management Module (IMM) shows the following.
1. I — 12/22/2011:14:18:45 — «Host Power» has been turned off
2. E — 12/22/2011:14:18:45 — Non-redundant:Insufficient Resources for «Cooling Zone 3» has asserted
3. E — 12/22/2011:14:18:40 — Non-redundant:Insufficient Resources for «Cooling Zone 1» has asserted
4. E — 12/22/2011:14:14:7 — Redundancy Lost for «Cooling Zone 2» has asserted
5. E — 12/22/2011:14:14:1 — Redundancy Lost for «Cooling Zone 3» has asserted
6. E — 12/22/2011:14:14:0 — Redundancy Lost for «Cooling Zone 1» has asserted
Resolving The Problem
Source
RETAIN tip: H205178
Symptom
The server is unable to complete Power On Self Test (POST) and powers off with Light Path Diagnostics (LPD) BRD and FAN Light Emitting Diodes (LEDs) lit. Logged events from the Integrated Management Module (IMM) shows the following.
|
Affected configurations
The system may be any of the following IBM servers:
- System x3550 M3, type 7944, any model
- System x3650 M3, type 7945, any model
This tip is not software specific.
This tip is not option specific.
Workaround
The fix for this issue is to check the number of PCI Riser card assemblies installed in the server and make sure there is a PCI Riser card installed in connector 2.
Additional information
The root cause for this issue is a PCI Riser card assembly must be installed in connector 2, even if you do not have an option adapter to install.
To identify this issue, when the BRD LED is lit and all cooling zones are asserted at the same time, remove the server cover and check if PCI Riser card 2 LED is on.
Document Location
Worldwide
Operating System
System x:Operating system independent / None
[{«Type»:»HW»,»Business Unit»:{«code»:»BU016″,»label»:»Multiple Vendor Support»},»Product»:{«code»:»QU04SLL»,»label»:»System x->System x3650 M3->7945″},»Platform»:[{«code»:»PF025″,»label»:»Platform Independent»}],»Line of Business»:{«code»:»»,»label»:»»}},{«Type»:»HW»,»Business Unit»:{«code»:»BU016″,»label»:»Multiple Vendor Support»},»Product»:{«code»:»QU04SMA»,»label»:»System x->System x3550 M3->7944″},»Platform»:[{«code»:»PF025″,»label»:»Platform Independent»}],»Line of Business»:{«code»:»»,»label»:»»}}]
About Lenovo
-
Our Company
-
News
-
Investor Relations
-
Sustainability
-
Product Compliance
-
Product Security
-
Lenovo Open Source
-
Legal Information
-
Jobs at Lenovo
Shop
-
Laptops & Ultrabooks
-
Tablets
-
Desktops & All-in-Ones
-
Workstations
-
Accessories & Software
-
Servers
-
Storage
-
Networking
-
Laptop Deals
-
Outlet
Support
-
Drivers & Software
-
How To’s
-
Warranty Lookup
-
Parts Lookup
-
Contact Us
-
Repair Status Check
-
Imaging & Security Resources
Resources
-
Where to Buy
-
Shopping Help
-
Sales Order Status
-
Product Specifications (PSREF)
-
Forums
-
Registration
-
Product Accessibility
-
Environmental Information
-
Gaming Community
-
LenovoEDU Community
-
LenovoPRO Community
©
Lenovo.
|
|
|
|
The following table describes the error codes that the diagnostic programs might
generate and suggested actions to correct the detected problems.
If the diagnostic programs generate error codes that are not listed in the table,
make sure that the latest levels of BIOS, Remote Supervisor Adapter II SlimLine,
and ServeRAID code are installed.
In the error codes, x can be any numeral or letter. However, if the three-digit
number in the central position of the code is 000, 195, or 197, do not replace a
CRU or FRU. These numbers appearing in the central position of the code have the
following meanings:
000
195
197
v Follow the suggested actions in the order in which they are listed in the Action column until the problem
is solved.
v See Chapter 3, «Parts listing, Type 7978 and 1913 server,» on page 29 to determine which components are
customer replaceable units (CRU) and which components are field replaceable units (FRU).
v If an action step is preceded by «(Trained service technician only),» that step must be performed only by a
trained service technician.
Error code
Description
001-250-000
Failed microprocessor board ECC.
001-xxx-000
Failed core tests.
001-xxx-001
Failed core tests.
001-292-000
Failed microprocessor board ECC.
005-xxx-000
Failed video test.
011-xxx-000
Failed COM1 serial port test.
144
IBM System x3550 Type 7978 and 1913: Problem Determination and Service Guide
The server passed the test. Do not replace a CRU or FRU.
The Esc key was pressed to end the test. Do not replace a CRU or FRU.
This is a warning error, but it does not indicate a hardware failure; do not
replace a CRU or FRU. Take the action that is indicated in the Action
column but do not replace a CRU or a FRU. See the description of
Warning in «Diagnostic text messages» on page 143 for more information.
Action
1. Check the system-error log and the BMC log for
messages that indicate the cause of the error
(see «Error logs» on page 107).
2. From the diagnostic programs, run Quick Memory
Test All Banks (see «Running the diagnostic
programs» on page 142).
3. From the diagnostic programs, run the ECC test
again (see «Running the diagnostic programs» on
page 142).
4. (Trained service technician only) Replace the
system board.
(Trained service technician only) Replace the system
board.
(Trained service technician only) Replace the system
board.
Load BIOS code defaults and run the test again.
1. Reseat the optional video adapter, if one is
installed.
2. (Trained service technician only) Replace the
system board.
1. Check the loopback plug that is connected to the
serial port.
2. (Trained service technician only) Replace the
system board.
