Entries in the ' linux ' category

Written May 13, 2009 in linux

If you’re running a cluster environment with shared resources, you need to have STONITH or some sort of fencing running.

A cluster is a complicated beast. It’s a community of machines that makes decisions. The decisions can be simple and only affect the cluster (i.e. which service runs where and …

Continue Reading »

Written May 11, 2009 in linux, sysadmin

You can always use LSB scripts (/etc/init.d) with pacemaker, but it’s better to make them osb-compliant

Continue Reading »

Written May 10, 2009 in linux, sysadmin

Seeing this error message in /var/log/xen/xend.log?

ERROR (SrvDaemon:347) Exception starting xend (no
element found: line 1, column 0)

You’ve got a corrupt xen status base. Go under /var/lib/xend/ and remove any xml files under any of those directories. Don’t delete the directories or sockets themselves.

I had this happen after a …

Continue Reading »

Written May 7, 2009 in linux, sysadmin

I had a weirdness happen as I was feeling my way through this node configuration with crm (pacemaker) 1.0.3.

It turns out that as I configured my resources and created locations and constraints, the crm created a bunch of lrm_resource (location resource manager) objects in the xml cib. You can’t …

Continue Reading »

Written May 6, 2009 in linux, sysadmin

With the new email features in the most recent crm_mon daemon, it shouldn’t be too difficult to get a service set up so that Nagios will alert us when a stonith event happens, and maybe even some details about why.

It’ll take much longer for me to decide on …

Continue Reading »

Written May 5, 2009 in howto, linux

With Dell kit of 1950/860 and newer, I’m using the built in IPMI-over-LAN in the BIOS for stonith instead of messing with DRAC5 or more complicated means. It’s easy to configure on it’s own IP and it just plain works.

First, a security note with people who have their machines …

Continue Reading »

Written May 4, 2009 in linux, sysadmin

Random notes … It’s nice to have manuals, but maybe these will help someone.

  • Hmn, you should probably add rules to keep the stonith service for a node off of it. It’ll probably also save time if you set it’s failcount low.
  • stonith is fun. BOOM, HEADSHOT!
  • For some reason, monitoring a

Continue Reading »

Written April 26, 2009 in linux

The High Availability Extension for SuSE Linux Enterprise Server is now … … available. From the download center at Novell’s website.

I’m going to miss the irony of it not having been released yet. *sniffle*

Continue Reading »

Written April 24, 2009 in howto, linux, sysadmin

You can use pacemaker with ocfs 1.4, but when you’re running service o2cb configure, you need to specify the “cluster stack” pcmk instead of the default o2cb. I haven’t tested this myself yet, but if I don’t blog it, I know I’ll forget it. :-P

Continue Reading »

Written April 17, 2009 in sysadmin, webdev

I’ve installed Semantic Mediawiki in our existing mediawiki instance at work to start tracking some of the data we need to retain for compliance and audit documentation purposes.

As the list of things that we support has grown, the number of servers has grown, and our documentation overhead has …

Continue Reading »