寻觅生命中的那一片浅草......

ccsd is not running

终于解决了,这个问题困扰了我一天。

今天通过网络安装了centos5,将虚拟化,集群,GFS都装上了,准备学习下集群的东西,ricci,luci都起来了,但在浏览器输入 https://localhost:8084/时,说没初始化admin的密码,就用luci_admin init初始化,但报错,看了错误信息,说是luci在运行,不能初始化,把luci停了后,初始化就好了,这里写下,算是一个备忘吧。

接下来,在luci里添加了本机作为一个node,但里面提示cman服务没有起来。就用service cman start启动,但报如下错误:

Starting cluster:
Enabling workaround for Xend bridged networking… done
Loading modules… done
Mounting configfs… done
Starting ccsd… done
Starting cman… failed
/usr/sbin/cman_tool: ccsd is not running
[失败]

说是ccsd没有运行,但用ps aux | grep ccsd是在运行的,然后man ccsd,发现有个-n参数是使ccsd在前台运行,在终端输入ccsd -n,没想到它会报告一些即时的运行信息,在另一个终端运行 service cman start,还是报以上错误,但ccsd -n这边也报了错误,如下:
Starting ccsd 2.0.84:
Built: May 25 2008 02:03:49
Copyright (C) Red Hat, Inc.  2004  All rights reserved.
No Daemon:: SET

I/O warning : failed to load external entity “/etc/cluster/cluster.conf”
Unable to parse /etc/cluster/cluster.conf
Searching cluster for valid copy.
The appropriate config file could not be loaded.
Error while processing connect: No data available
Unable to connect to cluster infrastructure after 30 seconds.
Unable to connect to cluster infrastructure after 60 seconds.
Unable to connect to cluster infrastructure after 90 seconds.
Unable to connect to cluster infrastructure after 120 seconds.

提示说不能解释到/etc/cluster/cluster.conf,ls /etc/cluster目录,发现里面什么都没有,用touch /etc/cluster/cluster.conf新建配置文件,杀掉ccsd进程后,重启,再service cman start,继续报错:

Unable to parse /etc/cluster/cluster.conf.
You should either:
1. Correct the XML mistakes, or
2. (Re)move the file and attempt to grab a valid copy from the network.
service cman start
Starting cluster:
Enabling workaround for Xend bridged networking… done
Loading modules… done
Mounting configfs… done
Starting ccsd… done
Starting cman… done
Starting daemons… done
Starting fencing… done

估计语法不符合xml规范吧,将cluster.conf删除,用system-config-cluster图形化界面新建了一个cluster,并且命名为oracletest,与luci建立的名字一样,但没有添加任何内容,杀ccsd,再service cman start,继续报错:

Unable to parse /etc/cluster/cluster.conf
Searching cluster for valid copy.
The appropriate config file could not be loaded.
Error while processing connect: No data available
Unable to connect to cluster infrastructure after 150 seconds.
Unable to connect to cluster infrastructure after 180 seconds.
Unable to connect to cluster infrastructure after 210 seconds.
Unable to connect to cluster infrastructure after 240 seconds.
Stopping ccsd, SIGINT received.

继续用system-config-cluster,为oracletest添加节点,添加fence设备,保存配置文件,杀ccsd,再service cman start,成功,哈哈。

另外,看/var/log/message也可以看到ccsd报错的信息。

尚无评论

发表评论

2024年四月
« 5月    
1234567
891011121314
15161718192021
22232425262728
2930