
Contents
About This Document................................................................................................................ ii
1 Safety Instructions.................................................................................................................. 1
2 Troubleshooting Process........................................................................................................ 5
3 Preparing for Troubleshooting............................................................................................. 6
4 Collecting Information.........................................................................................................10
4.1 Collecting Basic Information............................................................................................................................................. 10
4.2 Collecting OS Logs................................................................................................................................................................11
4.3 Collecting Hardware Logs.................................................................................................................................................. 12
5 Diagnosing and Rectifying Faults......................................................................................13
5.1 Fault Diagnosis Rules.......................................................................................................................................................... 13
5.2 Using Tools to Diagnose Faults........................................................................................................................................14
5.3 Handling Alarms................................................................................................................................................................... 14
5.4 Using Error Codes to Locate Faults................................................................................................................................ 15
5.5 Using Indicators to Locate Faults.................................................................................................................................... 16
5.6 Handling Faults Based on Symptoms............................................................................................................................ 41
5.6.1 Power Failures.................................................................................................................................................................... 42
5.6.2 KVM Login Faults.............................................................................................................................................................. 46
5.6.3 POST Faults......................................................................................................................................................................... 49
5.6.4 Memory Faults....................................................................................................................................................................53
5.6.5 Drive I/O Faults.................................................................................................................................................................. 55
5.6.6 Ethernet Controller Faults.............................................................................................................................................. 58
5.6.7 OS Faults.............................................................................................................................................................................. 63
6 Software and Firmware Upgrade...................................................................................... 68
7 Preventive Maintenance...................................................................................................... 69
7.1 Inspecting the Equipment Room Environment and Cable Layout.......................................................................69
7.1.1 Precautions.......................................................................................................................................................................... 69
7.1.2 Inspecting the Equipment Room Environment....................................................................................................... 70
7.1.3 Inspecting Cable Layout.................................................................................................................................................. 70
7.2 Inspecting Servers................................................................................................................................................................. 71
7.2.1 Precautions.......................................................................................................................................................................... 71
TaiShan Servers
Troubleshooting Contents
Issue 12 (2022-08-30) Copyright © Huawei Technologies Co., Ltd. v