4 Replies Latest reply on Apr 26, 2011 4:21 AM by Wolf-Dieter Fink

    HAPartition can not be started on cluster

    Emre He Newbie

      Hello,

       

      i have 2 nodes as one cluster environment,

      1. first server can start up successfully without any exception;

      2. then when i start the second server, i met below ERROR logs.

       

      any idea about this issue?

       

      many thanks,

      Emre

       

       

      {code}

      04/21 16:06:38,626 ERROR [org.jboss.kernel.plugins.dependency.AbstractKernelController] main: Error installing to Start: name=HAPartition state=Create

      java.lang.IllegalStateException: Could not flush the cluster and proceed with state retrieval

      at org.jgroups.JChannel.getState(JChannel.java:1045)

      at org.jgroups.JChannel.getState(JChannel.java:977)

      at org.jgroups.JChannel.getState(JChannel.java:931)

      at org.jboss.ha.framework.server.ClusterPartition.fetchState(ClusterPartition.java:616)

      at org.jboss.ha.framework.server.ClusterPartition.startService(ClusterPartition.java:422)

      at org.jboss.system.ServiceMBeanSupport.jbossInternalStart(ServiceMBeanSupport.java:376)

      at org.jboss.system.ServiceMBeanSupport.pojoStart(ServiceMBeanSupport.java:216)

      at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)

      at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)

      at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)

      at java.lang.reflect.Method.invoke(Method.java:585)

      at org.jboss.reflect.plugins.introspection.ReflectionUtils.invoke(ReflectionUtils.java:59)

      at org.jboss.reflect.plugins.introspection.ReflectMethodInfoImpl.invoke(ReflectMethodInfoImpl.java:150)

      at org.jboss.joinpoint.plugins.BasicMethodJoinPoint.dispatch(BasicMethodJoinPoint.java:66)

      at org.jboss.kernel.plugins.dependency.KernelControllerContextAction$JoinpointDispatchWrapper.execute(KernelControllerContextAction.java:241)

       

      {code}
        • 1. Re: HAPartition can not be started on cluster
          Emre He Newbie

          and here are more ERROR logs happened before above lines:

           

           

          {code}

          04/21 16:05:59,783 ERROR [org.jgroups.protocols.UDP] Incoming-14,192.168.100.2:60243: failed handling incoming message

          java.io.EOFException

          at java.io.DataInputStream.readFully(DataInputStream.java:178)

          at java.io.DataInputStream.readUTF(DataInputStream.java:565)

          at java.io.DataInputStream.readUTF(DataInputStream.java:522)

          at org.jgroups.Message.readFrom(Message.java:671)

          at org.jgroups.protocols.TP.readMessage(TP.java:1506)

          at org.jgroups.protocols.TP.access$700(TP.java:49)

          at org.jgroups.protocols.TP$IncomingPacket.run(TP.java:1808)

          at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:650)

          at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:675)

          at java.lang.Thread.run(Thread.java:595)

          04/21 16:06:00,085 ERROR [org.jgroups.protocols.UDP] Incoming-15,192.168.100.2:60243: failed handling incoming message

          java.io.EOFException

          at java.io.DataInputStream.readFully(DataInputStream.java:178)

           

          {code}
          • 2. HAPartition can not be started on cluster
            Wolf-Dieter Fink Master

            Looks like a strange network problem to me.  (EOF) I suppose that part of the message are lost.

             

            How your cluster environment is?

            Which system OS?

            both nodes same system?

            any network hardware between if deployed at two different systems, mean switch/firewall etc

            • 3. HAPartition can not be started on cluster
              Emre He Newbie

              cluster has 2 nodes: 192.168.100.1, 192.168.100.2, and they all have other real ip addresses.

              system os RHEL 5.3

              both nodes have the same system

               

              both server connected in same switch in same cabinet, ping showes they are accessable from each side.

              • 4. HAPartition can not be started on cluster
                Wolf-Dieter Fink Master

                Could you check the JGroups function with the test described here http://community.jboss.org/wiki/TestingJBoss