0 Replies Latest reply on Sep 2, 2008 4:31 AM by agohar

    JBoss-4.2.2 Clustering

    agohar

      Hi,

      Starting a new topic just for visibility.

      I am trying to build a jboss cluster and having some issues. I've downloaded fresh copy of jboss-4.2.2 and deployed on 3 servers. I configured cluster-service.xml and jboss-web-cluster/META-INF/jboss-service.xml and put udp ip and port in there. Here are the portions of my xml files:

      cluster-service.xml

      <Config>
       <UDP mcast_addr="${jboss.partition.udpGroup:228.20.20.35}"
       mcast_port="${jboss.hapartition.mcast_port:45667}"
       tos="8"
       ucast_recv_buf_size="20000000"
       ucast_send_buf_size="640000"
       mcast_recv_buf_size="25000000"
       mcast_send_buf_size="640000"
       loopback="false"
       discard_incompatible_packets="true"
       enable_bundling="false"
       max_bundle_size="64000"
       max_bundle_timeout="30"
       use_incoming_packet_handler="true"
       use_outgoing_packet_handler="false"
       ip_ttl="${jgroups.udp.ip_ttl:8}"
       down_thread="false" up_thread="false"/>
       <PING timeout="2000"
       down_thread="false" up_thread="false" num_initial_members="3"/>
       <MERGE2 max_interval="100000"
       down_thread="false" up_thread="false" min_interval="20000"/>
       <FD_SOCK down_thread="false" up_thread="false"/>
       <FD timeout="10000" max_tries="5" down_thread="false" up_thread="false" shun="true"/>
       <VERIFY_SUSPECT timeout="1500" down_thread="false" up_thread="false"/>
       <pbcast.NAKACK max_xmit_size="60000"
       use_mcast_xmit="false" gc_lag="0"
       retransmit_timeout="300,600,1200,2400,4800"
       down_thread="false" up_thread="false"
       discard_delivered_msgs="true"/>
       <UNICAST timeout="300,600,1200,2400,3600"
       down_thread="false" up_thread="false"/>
       <pbcast.STABLE stability_delay="1000" desired_avg_gossip="50000"
       down_thread="false" up_thread="false"
       max_bytes="400000"/>
       <pbcast.GMS print_local_addr="true" join_timeout="3000"
       down_thread="false" up_thread="false"
       join_retry_timeout="2000" shun="true"
       view_bundling="true"/>
       <FRAG2 frag_size="60000" down_thread="false" up_thread="false"/>
       <pbcast.STATE_TRANSFER down_thread="false" up_thread="false" use_flush="false"/>
       </Config>
      


      jboss-web-cluster.sar/META-INF/jboss-service.xml
      <config>
       <UDP mcast_addr="${jboss.partition.udpGroup:230.20.20.74}"
       mcast_port="${jboss.webpartition.mcast_port:45679}"
       tos="8"
       ucast_recv_buf_size="20000000"
       ucast_send_buf_size="640000"
       mcast_recv_buf_size="25000000"
       mcast_send_buf_size="640000"
       loopback="false"
       discard_incompatible_packets="true"
       enable_bundling="false"
       max_bundle_size="64000"
       max_bundle_timeout="30"
       use_incoming_packet_handler="true"
       use_outgoing_packet_handler="false"
       ip_ttl="${jgroups.udp.ip_ttl:8}"
       down_thread="false" up_thread="false"/>
       <PING timeout="2000"
       down_thread="false" up_thread="false" num_initial_members="3"/>
       <MERGE2 max_interval="100000"
       down_thread="false" up_thread="false" min_interval="20000"/>
       <FD_SOCK down_thread="false" up_thread="false"/>
       <FD timeout="10000" max_tries="5" down_thread="false" up_thread="false" shun="true"/>
       <VERIFY_SUSPECT timeout="1500" down_thread="false" up_thread="false"/>
       <pbcast.NAKACK max_xmit_size="60000"
       use_mcast_xmit="false" gc_lag="0"
       retransmit_timeout="300,600,1200,2400,4800"
       down_thread="false" up_thread="false"
       discard_delivered_msgs="true"/>
       <UNICAST timeout="300,600,1200,2400,3600"
       down_thread="false" up_thread="false"/>
       <pbcast.STABLE stability_delay="1000" desired_avg_gossip="50000"
       down_thread="false" up_thread="false"
       max_bytes="400000"/>
       <pbcast.GMS print_local_addr="true" join_timeout="3000"
       down_thread="false" up_thread="false"
       join_retry_timeout="2000" shun="true"
       view_bundling="true"
       view_ack_collection_timeout="5000"/>
       <FC max_credits="2000000" down_thread="false" up_thread="false"
       min_threshold="0.10"/>
       <FRAG2 frag_size="60000" down_thread="false" up_thread="false"/>
       <pbcast.STATE_TRANSFER down_thread="false" up_thread="false" use_flush="false"/>
       </config>


      When i start the servers in cluster, 2 of the servers join the cluster fine but adding third one creates problem and is very slow during startup and gets halt for some time at:
      2008-09-01 13:39:51,150 INFO [org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Number of cluster members: 3
      2008-09-01 13:39:51,150 INFO [org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Other members: 2
      2008-09-01 13:39:51,151 INFO [org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] Fetching state (will wait for 30000 milliseconds):
      2008-09-01 13:39:51,194 INFO [org.jboss.ha.framework.interfaces.HAPartition.DefaultPartition] state was retrieved successfully (in 43 milliseconds)
      


      I didn't deploy any application on the servers, these are just fresh servers. I've tried the same with jboss-4.0.2 and it works fine on the same network with same clustering configs.

      Is there anything changed in new jboss-4.2.2 for the way jboss servers communicate for clustering? any network configurations i can check? Please help

      Thanks