博客
关于我
强烈建议你试试无所不能的chatGPT,快点击我
记录下测试服务器频繁死机问题解决
阅读量:5961 次
发布时间:2019-06-19

本文共 3606 字,大约阅读时间需要 12 分钟。

hot3.png

问题

测试服务器频繁死机,刚开始一周一次,后面应用服务启动就死机。

服务器系统: CentOS 6.5
内核版本:2.6.32-431.el6.x86_64

服务器系统日志分析

查看日志:/var/log/message ,下面是出错比较多的

Dec  4 14:11:46 localhost abrtd: Init complete, entering main loopDec  4 14:11:53 localhost modem-manager: (ttyS1) closing serial device...Dec  4 14:11:53 localhost modem-manager: (ttyS1) opening serial device...Dec  4 14:11:59 localhost modem-manager: (ttyS1) closing serial device...Dec  4 14:12:16 localhost kernel: {1}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 1Dec  4 14:12:16 localhost kernel: {1}[Hardware Error]: APEI generic hardware error statusDec  4 14:12:16 localhost kernel: {1}[Hardware Error]: severity: 2, correctedDec  4 14:12:16 localhost kernel: {1}[Hardware Error]: section: 0, severity: 2, correctedDec  4 14:12:16 localhost kernel: {1}[Hardware Error]: flags: 0x01Dec  4 14:12:16 localhost kernel: {1}[Hardware Error]: primaryDec  4 14:12:16 localhost kernel: {1}[Hardware Error]: fru_text: CorrectedErrDec  4 14:12:16 localhost kernel: {1}[Hardware Error]: section_type: memory errorDec  4 14:12:16 localhost kernel: {1}[Hardware Error]: node: 15424Dec  4 14:12:16 localhost kernel: {1}[Hardware Error]: device: 12343Dec  4 14:12:16 localhost kernel: {1}[Hardware Error]: error_type: 2, single-bit ECCDec  4 14:12:16 localhost kernel: [Hardware Error]: Machine check events logged 【死机】Dec  9 04:05:06 localhost kernel: imklog 5.8.10, log source = /proc/kmsg started. 【重启】Dec  9 04:05:06 localhost rsyslogd: [origin software="rsyslogd" swVersion="5.8.10" x-pid="1601" x-info="http://www.rsyslog.com"] startDec  9 04:05:06 localhost kernel: Initializing cgroup subsys cpusetDec  9 04:05:11 localhost abrtd: Init complete, entering main loopDec  9 04:05:19 localhost modem-manager: (ttyS1) closing serial device...Dec  9 04:05:19 localhost modem-manager: (ttyS1) opening serial device...Dec  9 04:05:25 localhost modem-manager: (ttyS1) closing serial device...Dec  9 04:05:52 localhost kernel: {1}[Hardware Error]: Hardware error from APEI Generic Hardware Error Source: 1Dec  9 04:05:52 localhost kernel: {1}[Hardware Error]: APEI generic hardware error statusDec  9 04:05:52 localhost kernel: {1}[Hardware Error]: severity: 2, correctedDec  9 04:05:52 localhost kernel: {1}[Hardware Error]: section: 0, severity: 2, correctedDec  9 04:05:52 localhost kernel: {1}[Hardware Error]: flags: 0x01Dec  9 04:05:52 localhost kernel: {1}[Hardware Error]: primaryDec  9 04:05:52 localhost kernel: {1}[Hardware Error]: fru_text: CorrectedErrDec  9 04:05:52 localhost kernel: {1}[Hardware Error]: section_type: memory errorDec  9 04:05:52 localhost kernel: {1}[Hardware Error]: node: 24208Dec  9 04:05:52 localhost kernel: {1}[Hardware Error]: device: 12343Dec  9 04:05:52 localhost kernel: {1}[Hardware Error]: error_type: 2, single-bit ECCDec  9 04:05:52 localhost kernel: [Hardware Error]: Machine check events logged 【死机】Dec 11 10:40:00 localhost kernel: imklog 5.8.10, log source = /proc/kmsg started. 【重启】Dec 11 10:40:00 localhost rsyslogd: [origin software="rsyslogd" swVersion="5.8.10" x-pid="1603" x-info="http://www.rsyslog.com"] startDec 11 10:40:00 localhost kernel: Initializing cgroup subsys cpusetDec 11 10:40:00 localhost kernel: Initializing cgroup subsys cpu

当时看到这些错误还是比较懵,Hardware Error硬件错误,以为无法挽救。

解决办法

在bing搜索关键“Hardware error from APEI Generic Hardware Error Source: 1”找到一篇匹配度还算比较高的: 大致是系统与ECC 内存相关的问题导致

后面我进行了2个操作:

  • 1.内存条拔出来清理灰尘换个插槽重新插入【重启后问题没解决】
  • 2.升级内核 (内核从 2.6.32-431.el6.x86_64 升级到 )

目前服务器已经运行一周多,暂没出现死机现象,/var/log/message 无任何报错出现。

事后思考

服务器出现这个问题,可能与前几次突然停电有关。

资料参考

转载于:https://my.oschina.net/wenjinglian/blog/1591609

你可能感兴趣的文章
EXTJS 4.0 核心代码分析 (一)
查看>>
如何让路由器使用起来更加便捷
查看>>
实现自动为用户映射网络驱动器
查看>>
Kali 2/3中启动带数据库支持的MSF
查看>>
java调用新浪微博API发布第一条微博
查看>>
django实用技巧:template模板的使用
查看>>
python正则表达式基础
查看>>
Git Flow
查看>>
Objective-C --- - UICollectionView (梳理总结)
查看>>
我的友情链接
查看>>
jsoup将外部样式修改为内嵌样式
查看>>
存储方案与存储产品之NAS篇
查看>>
鸟哥学习笔记---网络基本管理
查看>>
鸟哥学习笔记---SAMBA
查看>>
JSON.parse()和JSON.stringify()
查看>>
完整的WordPress函数大全
查看>>
Citrix xenapp记录
查看>>
2011,我的IT我的梦
查看>>
×××lamp
查看>>
LAMP+Postfix+Dovecot+Postfixadmin搭建邮件管理系统(七)
查看>>