Entries in the ' sysadmin ' category

Written May 10, 2009 in linux, sysadmin

Seeing this error message in /var/log/xen/xend.log?

ERROR (SrvDaemon:347) Exception starting xend (no
element found: line 1, column 0)

You’ve got a corrupt xen status base. Go under /var/lib/xend/ and remove any xml files under any of those directories. Don’t delete the directories or sockets themselves.

I had this happen after a …

Continue Reading »

Written May 7, 2009 in linux, sysadmin

I had a weirdness happen as I was feeling my way through this node configuration with crm (pacemaker) 1.0.3.

It turns out that as I configured my resources and created locations and constraints, the crm created a bunch of lrm_resource (location resource manager) objects in the xml cib. You can’t …

Continue Reading »

Written May 6, 2009 in linux, sysadmin

With the new email features in the most recent crm_mon daemon, it shouldn’t be too difficult to get a service set up so that Nagios will alert us when a stonith event happens, and maybe even some details about why.

It’ll take much longer for me to decide on …

Continue Reading »

Written May 4, 2009 in linux, sysadmin

Random notes … It’s nice to have manuals, but maybe these will help someone.

  • Hmn, you should probably add rules to keep the stonith service for a node off of it. It’ll probably also save time if you set it’s failcount low.
  • stonith is fun. BOOM, HEADSHOT!
  • For some reason, monitoring a

Continue Reading »

Written April 24, 2009 in howto, linux, sysadmin

You can use pacemaker with ocfs 1.4, but when you’re running service o2cb configure, you need to specify the “cluster stack” pcmk instead of the default o2cb. I haven’t tested this myself yet, but if I don’t blog it, I know I’ll forget it. :-P

Continue Reading »

Written April 17, 2009 in sysadmin, webdev

I’ve installed Semantic Mediawiki in our existing mediawiki instance at work to start tracking some of the data we need to retain for compliance and audit documentation purposes.

As the list of things that we support has grown, the number of servers has grown, and our documentation overhead has …

Continue Reading »

Written April 12, 2009 in linux, opensuse, sysadmin

Mm, alphabet soup.

I hadn’t looked into the cluster world since the last time we configured our cluster for SuSE 10.3 last year. Major groundswells, sinkholes, and shifts seem to have opened up … to the point where things aren’t even called the same anymore.

The big difference is that EVMS …

Continue Reading »

Written April 8, 2009 in linux, opensuse, sysadmin

After months of throwing books at one another and not talking despite sitting almost back to back, my co-sysadmin and I are starting the next buildout/migration of our Xen virtual cluster at work. One of the big decisions that we’re making is the OS. In the past, we’ve skipped every …

Continue Reading »

Written December 10, 2008 in sysadmin

It’s been quiet around here for the past few months. I’ve been busy working on central syslogging of all kinds of different things, and the project’s been killing my free time because it’s downright interesting to put together this many things all focused on one task.

The end goal …

Continue Reading »

Written December 3, 2008 in linux, sysadmin

Apache errors can use a syslog facility as a destination (by default in this mode, local7:info). Apache access logs, rewrite logs, and php error logs can not use syslog. This is annoying if you’re doing things like combining apache logs from a cluster of web servers for stats purposes or …

Continue Reading »