I've been happily running a cluster for several weeks but yesterday it fell apart and I can't get it working again.
First thing that happened was that at least one node stopped listening on 1100. I can't remember what state the other was in but there were no exceptions in the logs and the servers were still performing their scheduled mbean tasks.
I restarted both nodes in the cluster but neither seem to come up properly although they both behave differently.
I'm using the default 'all' configuration including the default cluster-service.xml. When I start the one server I get:
2004-06-10 09:05:34,142 INFO [org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Initializing 2004-06-10 09:05:34,163 DEBUG [DefaultPartition:ReplicantManager] registerRPCHandler 2004-06-10 09:05:34,163 DEBUG [DefaultPartition:ReplicantManager] subscribeToStateTransferEvents 2004-06-10 09:05:34,163 DEBUG [DefaultPartition:ReplicantManager] registerMembershipListener 2004-06-10 09:05:34,302 DEBUG [org.javagroups.DefaultPartition] [Thu Jun 10 09:05:34 BST 2004] [ERROR] JChannel.connect(): exception: java.net.BindException: Cannot assign requested address 2004-06-10 09:05:34,305 ERROR [org.jboss.ha.framework.server.ClusterPartition] Starting failed ChannelException: java.net.BindException: Cannot assign requested address at org.jgroups.JChannel.connect(JChannel.java:224)
Sorry. I just got the second machine to start up normally so I've just got to fix the first one.