IBM NeXtScale nx360 M4 Installation and maintenance instructions

IBM NeXtScale nx360 M4
Ty pe 5 45 5
Installation and Service Guide


IBM NeXtScale nx360 M4
Ty pe 5 45 5
Installation and Service Guide

Note
Before using this information and the product it supports, read the general information in Appendix D, “Getting help and
technical assistance,” on page 373, “Notices” on page 377, the Warranty Information document, and the Safety Information and
Environmental Notices and User Guide documents on the IBM Documentation CD.
Fourth Edition (June 2014)
© Copyright IBM Corporation 2014.
US Government Users Restricted Rights – Use, duplication or disclosure restricted by GSA ADP Schedule Contract
with IBM Corp.

Contents
Safety ...............vii
Guidelines for trained service technicians ....viii
Inspecting for unsafe conditions ......viii
Guidelines for servicing electrical equipment . . ix
Safety statements .............x
Chapter 1. The IBM NeXtScale nx360 M4
Compute Node Type 5455 .......1
The IBM Documentation CD .........2
Hardware and software requirements .....3
The Documentation Browser ........3
Related documentation ...........4
Notices and statements in this document .....4
Features and specifications..........5
What your compute node offers ........8
Reliability, availability, and serviceability features. . 10
Major components of the compute node .....10
Major components of the storage tray......11
Major components of the GPU tray ......12
Power, controls, and indicators ........13
Compute node controls, connectors, and LEDs . 13
Console breakout cable .........15
Turning on the compute node .......15
Turning off the compute node .......16
System-board layouts ...........16
System-board internal connectors ......16
System-board external connectors ......17
System-board switches and jumpers .....18
System-board LEDs and controls ......19
Chapter 2. Configuration information
and instructions...........21
Updating the firmware ..........21
Configuring the server...........22
Using the ServerGuide Setup and Installation CD 24
Using the Setup utility..........25
Using the Boot Manager .........32
Starting the backup server firmware .....32
The UpdateXpress System Pack Installer ....32
Changing the Power Policy option to the default
settings after loading UEFI defaults .....33
Using the integrated management module . . . 33
Using the remote presence and blue-screen
capture features ............34
Using the embedded hypervisor ......36
Configuring the Ethernet controller .....37
Enabling Features on Demand Ethernet software 37
Enabling Features on Demand RAID software . . 37
Configuring RAID arrays.........38
IBM Advanced Settings Utility program ....38
Updating IBM Systems Director ......38
Updating the Universal Unique Identifier (UUID) 39
Updating the DMI/SMBIOS data ......42
Chapter 3. Troubleshooting ......47
Start here ...............47
Diagnosing a problem ..........47
Undocumented problems .........49
Service bulletins .............49
Checkout procedure ...........50
About the checkout procedure .......50
Performing the checkout procedure .....51
Diagnostic tools .............51
Power-supply LEDs ..........53
System pulse LEDs ...........55
Event logs ..............55
POST ...............58
IBM Dynamic System Analysis .......58
Automated service request (call home) .....61
IBM Electronic Service Agent .......61
Error messages .............61
Error messages .............61
Troubleshooting by symptom ........62
General problems ...........62
Hard disk drive problems ........62
Hypervisor problems ..........63
Intermittent problems ..........64
Keyboard, mouse, or USB-device problems . . . 64
Memory problems ...........66
Microprocessor problems .........67
Monitor and video problems .......68
Network connection problems .......70
Optional-device problems ........70
Power problems ............72
Serial-device problems..........73
ServerGuide problems ..........74
Software problems ...........75
Universal Serial Bus (USB) port problems . . . 75
Video problems ............75
Solving power problems ..........75
Solving Ethernet controller problems ......77
Solving undetermined problems .......78
Problem determination tips .........79
Recovering the server firmware (UEFI update
failure) ................80
In-band manual recovery method ......80
In-band automated boot recovery method . . . 82
Out-of-band method ..........82
Automated boot recovery (ABR) .......82
Nx-boot failure .............83
Chapter 4. Parts listing, IBM NeXtScale
nx360 M4 Compute Node Type 5455 . . 85
Replaceable server components ........85
Structural parts ............89
Power cords ..............90
© Copyright IBM Corp. 2014 iii

Chapter 5. Removing and replacing
components ............93
Installation tools.............93
Installing an optional device.........93
Installation guidelines ...........93
System reliability guidelines ........95
Handling static-sensitive devices ......95
Returning a device or component ......96
Updating the compute node configuration . . . 96
Removing a compute node from a chassis ....96
Installing a compute node in a chassis .....97
Removing a storage tray from a compute node . . 105
Installing a storage tray into a compute node. . . 106
Removing a GPU tray from a compute node . . . 108
Installing a GPU tray into a compute node . . . 109
Removing and replacing structural parts ....110
Removing the compute node cover .....110
Installing the compute node cover .....111
Removing the air baffle .........113
Replacing the air baffle .........114
Removing a RAID adapter battery holder . . . 115
Replacing a RAID adapter battery holder . . . 115
Removing the PCI riser filler .......116
Replacing the PCI riser filler .......117
Removing the filler from the GPU tray ....117
Replacing the filler on to the GPU tray ....118
Removing the front handle ........119
Installing the front handle ........120
Removing the hard disk drive cage .....121
Installing the hard disk drive cage .....123
Removing and replacing Tier 1 CRUs .....125
Removing the operator information panel . . . 125
Installing the operator information panel . . . 127
Removing the power paddle card from the GPU
tray ...............128
Replacing the power paddle card on to the GPU
tray ...............129
Removing the system battery .......130
Replacing the system battery .......131
Removing a memory module .......132
Installing a memory module .......133
Removing the optional 3.5-inch hard disk drive
hardware RAID cage ..........138
Installing the optional 3.5-inch hard disk drive
hardware RAID cage ..........140
Removing the hard disk drive backplate . . . 142
Installing the hard disk drive backplate . . . 143
Removing and installing drives ......145
Removing a PCI riser-cage assembly ....154
Replacing a PCI riser-cage assembly.....155
Removing a PCI riser-cage assembly in the GPU
tray ...............156
Replacing a PCI riser-cage assembly in the GPU
tray ...............157
Removing an adapter/GPU adapter .....159
Replacing an adapter/GPU adapter .....160
Removing the USB flash drive.......162
Installing the USB flash drive .......163
Removing and replacing Tier 2 CRUs .....165
Removing a microprocessor and heat sink . . . 165
Replacing a microprocessor and heat sink . . . 168
Removing the compute node .......176
Installing the compute node .......178
Internal cable routing and connectors .....180
Cabling hard disk drive with software RAID
signal cable .............180
Cabling hard disk drive with ServeRAID
SAS/SATA controller ..........181
Appendix A. Integrated Management
Module II (IMM2) error messages . . . 185
Appendix B. UEFI (POST) error codes 309
Appendix C. DSA diagnostic test
results ..............321
DSA Broadcom network test results ......321
DSA Brocade test results..........324
DSA checkpoint panel test results ......326
DSA CPU stress test results.........327
DSA Emulex adapter test results .......328
DSA EXA port ping test results .......329
DSA hard drive test results .........330
DSA Intel network test results ........330
DSA LSI hard drive test results .......332
DSA Mellanox adapter test results ......332
DSA memory isolation test results ......333
DSA memory stress test results .......360
DSA Nvidia GPU test results ........361
DSA optical drive test results ........363
DSA system management test results .....364
DSA tape drive test results .........369
Appendix D. Getting help and
technical assistance ........373
Before you call .............373
Using the documentation .........374
Getting help and information from the World Wide
Web................374
How to send DSA data to IBM .......374
Creating a personalized support web page . . . 374
Software service and support ........375
Hardware service and support .......375
IBM Taiwan product service ........375
Notices ..............377
Trademarks ..............377
Important notes ............378
Particulate contamination .........379
Documentation format ..........380
Telecommunication regulatory statement ....380
Electronic emission notices .........380
Federal Communications Commission (FCC)
statement..............380
Industry Canada Class A emission compliance
statement..............381
Avis de conformité à la réglementation
d'Industrie Canada ..........381
Australia and New Zealand Class A statement 381
iv IBM NeXtScale nx360 M4 Type 5455: Installation and Service Guide

European Union EMC Directive conformance
statement..............381
Germany Class A statement .......382
Japan VCCI Class A statement.......383
Japan Electronics and Information Technology
Industries Association (JEITA) statement . . . 383
Korea Communications Commission (KCC)
statement..............383
Russia Electromagnetic Interference (EMI) Class
A statement .............383
People's Republic of China Class A electronic
emission statement ..........383
Taiwan Class A compliance statement ....384
German Ordinance for Work gloss
statement .............385
Index ...............387
Contents v

vi IBM NeXtScale nx360 M4 Type 5455: Installation and Service Guide

Safety
Before installing this product, read the Safety Information.
Antes de instalar este produto, leia as Informações de Segurança.
Læs sikkerhedsforskrifterne, før du installerer dette produkt.
Lees voordat u dit product installeert eerst de veiligheidsvoorschriften.
Ennen kuin asennat tämän tuotteen, lue turvaohjeet kohdasta Safety Information.
Avant d'installer ce produit, lisez les consignes de sécurité.
Vor der Installation dieses Produkts die Sicherheitshinweise lesen.
Prima di installare questo prodotto, leggere le Informazioni sulla Sicurezza.
© Copyright IBM Corp. 2014 vii

Les sikkerhetsinformasjonen (Safety Information) før du installerer dette produktet.
Antes de instalar este produto, leia as Informações sobre Segurança.
Antes de instalar este producto, lea la información de seguridad.
Läs säkerhetsinformationen innan du installerar den här produkten.
Bu ürünü kurmadan önce güvenlik bilgilerini okuyun.
Guidelines for trained service technicians
This section contains information for trained service technicians.
Inspecting for unsafe conditions
Use this information to help you identify potential unsafe conditions in an IBM®
product that you are working on.
Each IBM product, as it was designed and manufactured, has required safety items
to protect users and service technicians from injury. The information in this section
addresses only those items. Use good judgment to identify potential unsafe
conditions that might be caused by non-IBM alterations or attachment of non-IBM
features or optional devices that are not addressed in this section. If you identify
viii IBM NeXtScale nx360 M4 Type 5455: Installation and Service Guide

an unsafe condition, you must determine how serious the hazard is and whether
you must correct the problem before you work on the product.
Consider the following conditions and the safety hazards that they present:
vElectrical hazards, especially primary power. Primary voltage on the frame can
cause serious or fatal electrical shock.
vExplosive hazards, such as a damaged CRT face or a bulging capacitor.
vMechanical hazards, such as loose or missing hardware.
To inspect the product for potential unsafe conditions, complete the following
steps:
1. Make sure that the power is off and the power cords are disconnected.
2. Make sure that the exterior cover is not damaged, loose, or broken, and observe
any sharp edges.
3. Check the power cords:
vMake sure that the third-wire ground connector is in good condition. Use a
meter to measure third-wire ground continuity for 0.1 ohm or less between
the external ground pin and the frame ground.
vMake sure that the power cords are the correct type.
vMake sure that the insulation is not frayed or worn.
4. Remove the cover.
5. Check for any obvious non-IBM alterations. Use good judgment as to the safety
of any non-IBM alterations.
6. Check inside the system for any obvious unsafe conditions, such as metal
filings, contamination, water or other liquid, or signs of fire or smoke damage.
7. Check for worn, frayed, or pinched cables.
8. Make sure that the power-supply cover fasteners (screws or rivets) have not
been removed or tampered with.
Guidelines for servicing electrical equipment
Observe these guidelines when you service electrical equipment.
vCheck the area for electrical hazards such as moist floors, nongrounded power
extension cords, and missing safety grounds.
vUse only approved tools and test equipment. Some hand tools have handles that
are covered with a soft material that does not provide insulation from live
electrical current.
vRegularly inspect and maintain your electrical hand tools for safe operational
condition. Do not use worn or broken tools or testers.
vDo not touch the reflective surface of a dental mirror to a live electrical circuit.
The surface is conductive and can cause personal injury or equipment damage if
it touches a live electrical circuit.
vSome rubber floor mats contain small conductive fibers to decrease electrostatic
discharge. Do not use this type of mat to protect yourself from electrical shock.
vDo not work alone under hazardous conditions or near equipment that has
hazardous voltages.
vLocate the emergency power-off (EPO) switch, disconnecting switch, or electrical
outlet so that you can turn off the power quickly in the event of an electrical
accident.
vDisconnect all power before you perform a mechanical inspection, work near
power supplies, or remove or install main units.
Safety ix

vBefore you work on the equipment, disconnect the power cord. If you cannot
disconnect the power cord, have the customer power-off the wall box that
supplies power to the equipment and lock the wall box in the off position.
vNever assume that power has been disconnected from a circuit. Check it to
make sure that it has been disconnected.
vIf you have to work on equipment that has exposed electrical circuits, observe
the following precautions:
– Make sure that another person who is familiar with the power-off controls is
near you and is available to turn off the power if necessary.
– When you work with powered-on electrical equipment, use only one hand.
Keep the other hand in your pocket or behind your back to avoid creating a
complete circuit that could cause an electrical shock.
– When you use a tester, set the controls correctly and use the approved probe
leads and accessories for that tester.
– Stand on a suitable rubber mat to insulate you from grounds such as metal
floor strips and equipment frames.
vUse extreme care when you measure high voltages.
vTo ensure proper grounding of components such as power supplies, pumps,
blowers, fans, and motor generators, do not service these components outside of
their normal operating locations.
vIf an electrical accident occurs, use caution, turn off the power, and send another
person to get medical aid.
Safety statements
These statements provide the caution and danger information that is used in this
documentation.
Important:
Each caution and danger statement in this documentation is labeled with a
number. This number is used to cross reference an English-language caution or
danger statement with translated versions of the caution or danger statement in
the Safety Information document.
For example, if a caution statement is labeled Statement 1, translations for that
caution statement are in the Safety Information document under Statement 1.
Be sure to read all caution and danger statements in this documentation before you
perform the procedures. Read any additional safety information that comes with
your system or optional device before you install the device.
Statement 1
xIBM NeXtScale nx360 M4 Type 5455: Installation and Service Guide

DANGER
Electrical current from power, telephone, and communication cables is
hazardous.
To avoid a shock hazard:
vDo not connect or disconnect any cables or perform installation,
maintenance, or reconfiguration of this product during an electrical storm.
vConnect all power cords to a properly wired and grounded electrical outlet.
vConnect to properly wired outlets any equipment that will be attached to
this product.
vWhen possible, use one hand only to connect or disconnect signal cables.
vNever turn on any equipment when there is evidence of fire, water, or
structural damage.
vDisconnect the attached power cords, telecommunications systems,
networks, and modems before you open the device covers, unless
instructed otherwise in the installation and configuration procedures.
vConnect and disconnect cables as described in the following table when
installing, moving, or opening covers on this product or attached devices.
To Connect: To Disconnect:
1. Turn everything OFF.
2. First, attach all cables to devices.
3. Attach signal cables to connectors.
4. Attach power cords to outlet.
5. Turn device ON.
1. Turn everything OFF.
2. First, remove power cords from outlet.
3. Remove signal cables from connectors.
4. Remove all cables from devices.
Statement 2
CAUTION:
When replacing the lithium battery, use only IBM Part Number 33F8354 or an
equivalent type battery recommended by the manufacturer. If your system has a
module containing a lithium battery, replace it only with the same module type
made by the same manufacturer. The battery contains lithium and can explode if
not properly used, handled, or disposed of.
Do not:
vThrow or immerse into water
vHeat to more than 100°C (212°F)
vRepair or disassemble
Dispose of the battery as required by local ordinances or regulations.
Safety xi

Statement 3
CAUTION:
When laser products (such as CD-ROMs, DVD drives, fiber optic devices, or
transmitters) are installed, note the following:
vDo not remove the covers. Removing the covers of the laser product could
result in exposure to hazardous laser radiation. There are no serviceable parts
inside the device.
vUse of controls or adjustments or performance of procedures other than those
specified herein might result in hazardous radiation exposure.
DANGER
Some laser products contain an embedded Class 3A or Class 3B laser diode.
Note the following.
Laser radiation when open. Do not stare into the beam, do not view directly
with optical instruments, and avoid direct exposure to the beam.
Class 1 Laser Product
Laser Klasse 1
Laser Klass 1
Luokan 1 Laserlaite
Appareil A Laser de Classe 1
`
Statement 4
CAUTION:
Use safe practices when lifting.
≥18 kg (39.7 lb) ≥32 kg (70.5 lb) ≥55 kg (121.2 lb)
xii IBM NeXtScale nx360 M4 Type 5455: Installation and Service Guide

Statement 5
CAUTION:
The power control button on the device and the power switch on the power
supply do not turn off the electrical current supplied to the device. The device
also might have more than one power cord. To remove all electrical current from
the device, ensure that all power cords are disconnected from the power source.
1
2
Statement 6
CAUTION:
If you install a strain-relief bracket option over the end of the power cord that is
connected to the device, you must connect the other end of the power cord to an
easily accessible power source.
Statement 8
CAUTION:
Never remove the cover on a power supply or any part that has the following
label attached.
Hazardous voltage, current, and energy levels are present inside any component
that has this label attached. There are no serviceable parts inside these
components. If you suspect a problem with one of these parts, contact a service
technician.
Safety xiii

Statement 12
CAUTION:
The following label indicates a hot surface nearby.
Statement 26
CAUTION:
Do not place any object on top of rack-mounted devices.
Statement 27
CAUTION:
Hazardous moving parts are nearby.
Rack Safety Information, Statement 2
xiv IBM NeXtScale nx360 M4 Type 5455: Installation and Service Guide

DANGER
vAlways lower the leveling pads on the rack cabinet.
vAlways install stabilizer brackets on the rack cabinet.
vAlways install servers and optional devices starting from the bottom of the
rack cabinet.
vAlways install the heaviest devices in the bottom of the rack cabinet.
Safety xv

xvi IBM NeXtScale nx360 M4 Type 5455: Installation and Service Guide

Chapter 1. The IBM NeXtScale nx360 M4 Compute Node Type
5455
The IBM NeXtScale nx360 M4 Compute Node Type 5455 is a high-availability,
scalable compute node that is optimized to support the next-generation
microprocessor technology and is ideally suited for medium and large businesses.
The IBM NeXtScale nx360 M4 Compute Node Type 5455 is supported in the IBM
NeXtScale n1200 Enclosure only.
This documentation provides the following information about setting up and
troubleshooting the compute node:
vStarting and configuring the compute node
vInstalling the operating system
vDiagnosing problems
vInstalling, removing, and replacing components
Packaged with the compute node are software CDs that help you configure
hardware, install device drivers, and install the operating system.
If firmware and documentation updates are available, you can download them
from the IBM website. The server might have features that are not described in the
documentation that comes with the server, and the documentation might be
updated occasionally to include information about those features, or technical
updates might be available to provide additional information that is not included
in the server documentation. To check for updates, go to http://www.ibm.com/
supportportal.
The compute node comes with a limited warranty. For information about the terms
of the warranty and getting service and assistance, see the Warranty Information
document for your compute node.
You can download the IBM ServerGuide Setup and Installation CD to help you
configure the hardware, install device drivers, and install the operating system.
For a list of supported optional devices for the server, see http://www.ibm.com/
systems/info/x86servers/serverproven/compat/us.
See the Rack Installation Instructions document on the IBM System x Documentation
CD for complete rack installation and removal instructions.
You can obtain up-to-date information about the server and other IBM server
products at http://www.ibm.com/systems/x. At http://www.ibm.com/
supportportal, you can create a personalized support page by identifying IBM
products that are of interest to you. From this personalized page, you can subscribe
to weekly email notifications about new technical documents, search for
information and downloads, and access various administrative services.
The compute node might have features that are not described in the
documentation that comes with the compute node. The documentation might be
updated occasionally to include information about those features. Technical
updates might also be available to provide additional information that is not
© Copyright IBM Corp. 2014 1

included in the compute node documentation. To obtain the most up-to-date
documentation for this product, go to http://publib.boulder.ibm.com/infocenter/
flexsys/information/index.jsp.
You can subscribe to information updates that are specific to your compute node at
http://www.ibm.com/support/mynotifications.
The model number and serial number are on the ID label on the bezel on the front
of the compute node, and on a label on the bottom of the compute node that is
visible when the compute node is not in the IBM NeXtScale n1200 Enclosure. If the
compute node comes with an RFID tag, the RFID tag covers the ID label on the
bezel on the front of the compute node, but you can open the RFID tag to see the
ID label behind it.
Note: The illustrations in this document might differ slightly from your hardware.
In addition, the system service label, which is on the cover of the server, provides a
QR code for mobile access to service information. You can scan the QR code using
a QR code reader and scanner with a mobile device and get quick access to the
IBM Service Information website. The IBM Service Information website provides
additional information for parts installation and replacement videos, and error
codes for server support.
The following illustration shows the QR code (http://ibm.co/1hrOZP0):
The IBM Documentation CD
The IBM Documentation CD contains documentation for the server in Portable
Document Format (PDF) and includes the IBM Documentation Browser to help
you find information quickly.
Node
serial
number
Figure 1. NeXtScale nx360 M4 compute node
Figure 2. QR code
2IBM NeXtScale nx360 M4 Type 5455: Installation and Service Guide
Table of contents
Other IBM Server manuals

IBM
IBM Power Systems Series User manual

IBM
IBM System x3300 M4 User manual

IBM
IBM Power System System E950 User manual

IBM
IBM System x3250 M4 Operation manual

IBM
IBM System x3100 M4 User manual

IBM
IBM 9407-515 User manual

IBM
IBM RS/6000 7025 F50 Series User manual

IBM
IBM X3850 X6 User manual

IBM
IBM eServer iSeries 825 User manual

IBM
IBM pseries 655 User manual

IBM
IBM System i User manual

IBM
IBM 88701RX User manual

IBM
IBM eServer 330 xSeries User manual

IBM
IBM System x3300 M4 Manual

IBM
IBM System x iDataPlex dx360 M2 User manual

IBM
IBM xSeries 330 Owner's manual

IBM
IBM BladeCenter GPU Expansion Blade User manual

IBM
IBM eServer 260 Series User manual

IBM
IBM System x3650 M2 Type 7947 User manual

IBM
IBM 6H0 User manual