4 Replies Latest reply on Apr 26, 2011 4:21 AM by wdfink

    HAPartition can not be started on cluster

    emrehe

      Hello,

       

      i have 2 nodes as one cluster environment,

      1. first server can start up successfully without any exception;

      2. then when i start the second server, i met below ERROR logs.

       

      any idea about this issue?

       

      many thanks,

      Emre

       

       

      {code}

      04/21 16:06:38,626 ERROR [org.jboss.kernel.plugins.dependency.AbstractKernelController] main: Error installing to Start: name=HAPartition state=Create

      java.lang.IllegalStateException: Could not flush the cluster and proceed with state retrieval

      at org.jgroups.JChannel.getState(JChannel.java:1045)

      at org.jgroups.JChannel.getState(JChannel.java:977)

      at org.jgroups.JChannel.getState(JChannel.java:931)

      at org.jboss.ha.framework.server.ClusterPartition.fetchState(ClusterPartition.java:616)

      at org.jboss.ha.framework.server.ClusterPartition.startService(ClusterPartition.java:422)

      at org.jboss.system.ServiceMBeanSupport.jbossInternalStart(ServiceMBeanSupport.java:376)

      at org.jboss.system.ServiceMBeanSupport.pojoStart(ServiceMBeanSupport.java:216)

      at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

      at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)

      at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

      at java.lang.reflect.Method.invoke(Method.java:585)

      at org.jboss.reflect.plugins.introspection.ReflectionUtils.invoke(ReflectionUtils.java:59)

      at org.jboss.reflect.plugins.introspection.ReflectMethodInfoImpl.invoke(ReflectMethodInfoImpl.java:150)

      at org.jboss.joinpoint.plugins.BasicMethodJoinPoint.dispatch(BasicMethodJoinPoint.java:66)

      at org.jboss.kernel.plugins.dependency.KernelControllerContextAction$JoinpointDispatchWrapper.execute(KernelControllerContextAction.java:241)

       

      {code}
        • 1. Re: HAPartition can not be started on cluster
          emrehe

          and here are more ERROR logs happened before above lines:

           

           

          {code}

          04/21 16:05:59,783 ERROR [org.jgroups.protocols.UDP] Incoming-14,192.168.100.2:60243: failed handling incoming message

          java.io.EOFException

          at java.io.DataInputStream.readFully(DataInputStream.java:178)

          at java.io.DataInputStream.readUTF(DataInputStream.java:565)

          at java.io.DataInputStream.readUTF(DataInputStream.java:522)

          at org.jgroups.Message.readFrom(Message.java:671)

          at org.jgroups.protocols.TP.readMessage(TP.java:1506)

          at org.jgroups.protocols.TP.access$700(TP.java:49)

          at org.jgroups.protocols.TP$IncomingPacket.run(TP.java:1808)

          at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:650)

          at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:675)

          at java.lang.Thread.run(Thread.java:595)

          04/21 16:06:00,085 ERROR [org.jgroups.protocols.UDP] Incoming-15,192.168.100.2:60243: failed handling incoming message

          java.io.EOFException

          at java.io.DataInputStream.readFully(DataInputStream.java:178)

           

          {code}
          • 2. HAPartition can not be started on cluster
            wdfink

            Looks like a strange network problem to me.  (EOF) I suppose that part of the message are lost.

             

            How your cluster environment is?

            Which system OS?

            both nodes same system?

            any network hardware between if deployed at two different systems, mean switch/firewall etc

            • 3. HAPartition can not be started on cluster
              emrehe

              cluster has 2 nodes: 192.168.100.1, 192.168.100.2, and they all have other real ip addresses.

              system os RHEL 5.3

              both nodes have the same system

               

              both server connected in same switch in same cabinet, ping showes they are accessable from each side.

              • 4. HAPartition can not be started on cluster
                wdfink

                Could you check the JGroups function with the test described here http://community.jboss.org/wiki/TestingJBoss