5 Replies Latest reply on Sep 21, 2007 12:18 PM by Bela Ban

    Cluster merge issues

    mk jboss Newbie

      We have a cluster of nodes deployed on the same machine with the cluster-service.xml having the following snippet:

      <TCP bind_addr="localhost" start_port="${jboss.cluster.tcp.port:7800}" loopback="true"/>
      <TCPPING initial_hosts="localhost[${jboss.cluster.tcp.port:7800}]" port_range="${jboss.cluster.tcp.port.range:5}" timeout="3500"
      num_initial_members="${jboss.cluster.tcp.members:5}" up_thread="true" down_thread="true"/>
      <MERGE2 min_interval="5000" max_interval="10000"/>
      <FD shun="true" timeout="5000" max_tries="5" up_thread="false" down_thread="false" />
      <VERIFY_SUSPECT timeout="4000" down_thread="false" up_thread="false" />
      <pbcast.NAKACK down_thread="true" up_thread="true" gc_lag="100"
      retransmit_timeout="3000"/>
      <pbcast.STABLE desired_avg_gossip="20000" down_thread="false" up_thread="false" />
      <pbcast.GMS join_timeout="5000" join_retry_timeout="2000" shun="false"
      print_local_addr="true" down_thread="true" up_thread="true"/>
      <pbcast.STATE_TRANSFER up_thread="true" down_thread="true"/>
      



      When a split happens, the nodes from the secondary partition doesn't merge at all. We are re-starting the node every time this happens.
      Could someone tell me if there is anything wrong in the configuration?
      Would setting shun="true" in the GMS change the behavior? I also heard that the JGroup channel's AUTO_RECONNECT should be set to true programatically, how do we do that declaratively?

      Your help is appreciated.

      Thanks.