0 Replies Latest reply on Aug 6, 2008 7:31 AM by abhinav_solan

    Problem in enabling clustering on 2 nodes in JBoss 3.2.8SP1

    abhinav_solan

      Hi I have been trying to enable clustering on 2 machines on JBoss 3.2.8SP1( I cant move to JBoss 4 as my application is configurable for 3.2.8SP1 only) but the 2 machines are not able to discover each other, other machine is showing.

      11:23:10,526 INFO [DefaultPartition] Number of cluster members: 1
      11:23:10,526 INFO [DefaultPartition] Other members: 0

      can some one please help me in this ...

      Here is the log on my machine

      =========================================================================

      JBoss Bootstrap Environment

      JBOSS_HOME: /home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1

      JAVA: /usr/lib/jvm/java-6-sun-1.6.0.06//bin/java

      JAVA_OPTS: -server -Dprogram.name=run.sh

      CLASSPATH: /home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/bin/run.jar:/usr/lib/jvm/java-6-sun-1.6.0.06//lib/tools.jar

      =========================================================================

      11:22:20,032 INFO [Server] Starting JBoss (MX MicroKernel)...
      11:22:20,199 INFO [Server] Release ID: JBoss [WonderLand] 3.2.8.SP1 (build: CVSTag=JBoss_3_2_8_SP1 date=200603031235)
      11:22:20,200 INFO [Server] Home Dir: /home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1
      11:22:20,200 INFO [Server] Home URL: file:/home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/
      11:22:20,201 INFO [Server] Patch URL: null
      11:22:20,202 INFO [Server] Server Name: all
      11:22:20,202 INFO [Server] Server Home Dir: /home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/server/all
      11:22:20,202 INFO [Server] Server Home URL: file:/home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/server/all/
      11:22:20,202 INFO [Server] Server Temp Dir: /home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/server/all/tmp
      11:22:20,203 INFO [Server] Root Deployment Filename: jboss-service.xml
      11:22:24,687 INFO [ServerInfo] Java version: 1.6.0_06,Sun Microsystems Inc.
      11:22:24,687 INFO [ServerInfo] Java VM: Java HotSpot(TM) Server VM 10.0-b22,Sun Microsystems Inc.
      11:22:24,687 INFO [ServerInfo] OS-System: Linux 2.6.24-19-generic,i386
      11:22:28,063 INFO [Server] Core system initialized
      11:22:45,167 INFO [Log4jService$URLWatchTimerTask] Configuring from URL: resource:log4j.xml
      11:22:51,550 INFO [WebService] Using RMI server codebase: http://abhinav-desktop:8083/
      11:22:55,210 INFO [NamingService] JNDI bootstrap JNP=/0.0.0.0:1099, RMI=/0.0.0.0:1098, backlog=50, no client SocketFactory, Server SocketFactory=class org.jboss.net.sockets.DefaultSocketFactory
      11:23:06,249 INFO [SnmpAgentService] SNMP agent going active
      11:23:06,486 INFO [RARMetaData] Required license terms present. See deployment descriptor.
      11:23:06,571 INFO [RARMetaData] Loading JBoss Resource Adapter for JDBC 2 XA drivers
      11:23:06,571 INFO [RARMetaData] Required license terms present. See deployment descriptor.
      11:23:06,645 INFO [RARMetaData] Required license terms present. See deployment descriptor.
      11:23:08,344 INFO [DefaultPartition] Initializing
      11:23:08,496 INFO [UDP] unicast sockets will use interface 127.0.1.1
      11:23:08,503 INFO [UDP] socket information:
      local_addr=abhinav-desktop:56543 (additional data: 14 bytes), mcast_addr=228.1.2.3:45566, bind_addr=/127.0.1.1, ttl=8
      sock: bound to 127.0.1.1:56543, receive buffer size=131071, send buffer size=131071
      mcast_recv_sock: bound to 127.0.1.1:45566, send buffer size=131071, receive buffer size=131071
      mcast_send_sock: bound to 127.0.1.1:54431, send buffer size=131071, receive buffer size=131071
      11:23:08,506 INFO [STDOUT]
      -------------------------------------------------------
      GMS: address is abhinav-desktop:56543 (additional data: 14 bytes)
      -------------------------------------------------------
      11:23:10,526 INFO [DefaultPartition] Number of cluster members: 1
      11:23:10,526 INFO [DefaultPartition] Other members: 0
      11:23:10,527 INFO [DefaultPartition] Fetching state (will wait for 60000 milliseconds):
      11:23:10,528 INFO [DefaultPartition] New cluster view for partition DefaultPartition (id: 0, delta: 0) : [127.0.1.1:1099]
      11:23:10,626 INFO [DefaultPartition] I am (127.0.1.1:1099) received membershipChanged event:
      11:23:10,678 INFO [DefaultPartition] Dead members: 0 ([])
      11:23:10,678 INFO [DefaultPartition] New Members : 0 ([])
      11:23:10,678 INFO [DefaultPartition] All Members : 1 ([127.0.1.1:1099])
      11:23:10,701 INFO [HANamingService] Listening on /0.0.0.0:1100
      11:23:10,751 INFO [DetachedHANamingService$AutomaticDiscovery] Listening on /0.0.0.0:1102, group=230.0.0.4, HA-JNDI address=127.0.1.1:1100
      11:23:13,838 INFO [orb] ORB run
      11:23:14,303 INFO [CorbaNamingService] Naming: [IOR:000000000000002B49444C3A6F6D672E6F72672F436F734E616D696E672F4E616D696E67436F6E746578744578743A312E300000000000010000000000000064000102000000000A3132372E302E312E31000DC8000000114A426F73732F4E616D696E672F726F6F74000000000000020000000000000008000000004A414300000000010000001C00000000000100010000000105010001000101090000000105010001]
      11:23:14,527 INFO [MailService] Mail Service bound to java:/Mail
      11:23:14,982 INFO [TreeCache] setting cluster properties from xml to: UDP(ip_mcast=true;ip_ttl=8;loopback=false;mcast_addr=230.1.2.3;mcast_port=45577;mcast_recv_buf_size=80000;mcast_send_buf_size=150000;ucast_recv_buf_size=80000;ucast_send_buf_size=150000):PING(down_thread=false;num_initial_members=3;timeout=2000;up_thread=false):MERGE2(max_interval=20000;min_interval=10000):FD_SOCK:VERIFY_SUSPECT(down_thread=false;timeout=1500;up_thread=false):pbcast.NAKACK(down_thread=false;gc_lag=50;max_xmit_size=8192;retransmit_timeout=600,1200,2400,4800;up_thread=false):UNICAST(down_thread=false;min_threshold=10;timeout=600,1200,2400;window_size=100):pbcast.STABLE(desired_avg_gossip=20000;down_thread=false;up_thread=false):FRAG(down_thread=false;frag_size=8192;up_thread=false):pbcast.GMS(join_retry_timeout=2000;join_timeout=5000;print_local_addr=true;shun=true):pbcast.STATE_TRANSFER(down_thread=true;up_thread=true)
      11:23:14,998 INFO [TreeCache] interceptor chain is:
      class org.jboss.cache.interceptors.CallInterceptor
      class org.jboss.cache.interceptors.LockInterceptor
      class org.jboss.cache.interceptors.UnlockInterceptor
      class org.jboss.cache.interceptors.ReplicationInterceptor
      11:23:14,999 INFO [TreeCache] cache mode is REPL_ASYNC
      11:23:15,055 INFO [UDP] unicast sockets will use interface 127.0.1.1
      11:23:15,059 INFO [UDP] socket information:
      local_addr=abhinav-desktop:43561, mcast_addr=230.1.2.3:45577, bind_addr=/127.0.1.1, ttl=8
      sock: bound to 127.0.1.1:43561, receive buffer size=80000, send buffer size=131071
      mcast_recv_sock: bound to 127.0.1.1:45577, send buffer size=131071, receive buffer size=80000
      mcast_send_sock: bound to 127.0.1.1:50021, send buffer size=131071, receive buffer size=80000
      11:23:15,060 INFO [STDOUT]
      -------------------------------------------------------
      GMS: address is abhinav-desktop:43561
      -------------------------------------------------------
      11:23:17,129 INFO [TreeCache] state could not be retrieved (must be first member in group)
      11:23:17,129 INFO [TreeCache] viewAccepted(): new members: [abhinav-desktop:43561]
      11:23:17,223 INFO [TreeCache] new cache is null (maybe first member in cluster)
      11:23:17,431 INFO [Embedded] Catalina naming disabled
      11:23:17,960 INFO [Http11Protocol] Initializing Coyote HTTP/1.1 on http-0.0.0.0-8080
      11:23:18,001 INFO [Catalina] Initialization processed in 523 ms
      11:23:18,028 INFO [StandardService] Starting service jboss.web
      11:23:18,031 INFO [StandardEngine] Starting Servlet Engine: Apache Tomcat/5.0.30
      11:23:18,044 INFO [StandardHost] XML validation disabled
      11:23:18,093 INFO [Catalina] Server startup in 66 ms
      11:23:18,177 INFO [TomcatDeployer] deploy, ctxPath=/invoker, warUrl=file:/home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/server/all/deploy/http-invoker.sar/invoker.war/
      11:23:19,204 INFO [TomcatDeployer] deploy, ctxPath=/jboss-net, warUrl=file:/home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/server/all/deploy/jboss-net.sar/jboss-net.war/
      11:23:19,396 INFO [TomcatDeployer] deploy, ctxPath=/, warUrl=file:/home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/server/all/deploy/jbossweb-tomcat50.sar/ROOT.war/
      11:23:19,608 INFO [TomcatDeployer] deploy, ctxPath=/jbossmq-httpil, warUrl=file:/home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/server/all/deploy-hasingleton/jms/jbossmq-httpil.sar/jbossmq-httpil.war/
      11:23:19,801 INFO [DefaultDS] Bound connection factory for resource adapter for ConnectionManager 'jboss.jca:service=LocalTxCM,name=DefaultDS to JNDI name 'java:/DefaultDS'
      11:23:20,047 INFO [A] Bound to JNDI name: queue/A
      11:23:20,048 INFO [B] Bound to JNDI name: queue/B
      11:23:20,050 INFO [C] Bound to JNDI name: queue/C
      11:23:20,051 INFO [D] Bound to JNDI name: queue/D
      11:23:20,052 INFO [ex] Bound to JNDI name: queue/ex
      11:23:20,078 INFO [testTopic] Bound to JNDI name: topic/testTopic
      11:23:20,080 INFO [securedTopic] Bound to JNDI name: topic/securedTopic
      11:23:20,082 INFO [testDurableTopic] Bound to JNDI name: topic/testDurableTopic
      11:23:20,085 INFO [testQueue] Bound to JNDI name: queue/testQueue
      11:23:20,125 INFO [UILServerILService] JBossMQ UIL service available at : /0.0.0.0:8093
      11:23:20,215 INFO [DLQ] Bound to JNDI name: queue/DLQ
      11:23:20,239 INFO [JmsXA] Bound connection factory for resource adapter for ConnectionManager 'jboss.jca:service=TxCM,name=JmsXA to JNDI name 'java:/JmsXA'
      11:23:20,331 INFO [TomcatDeployer] deploy, ctxPath=/jmx-console, warUrl=file:/home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/server/all/deploy/jmx-console.war/
      11:23:20,535 INFO [TomcatDeployer] deploy, ctxPath=/web-console, warUrl=file:/home/abhinav/Work/jboss_server/trial/jboss-3.2.8.SP1/server/all/deploy/management/web-console.war/
      11:23:21,524 INFO [Http11Protocol] Starting Coyote HTTP/1.1 on http-0.0.0.0-8080
      11:23:21,779 INFO [ChannelSocket] JK2: ajp13 listening on /0.0.0.0:8009
      11:23:21,909 INFO [JkMain] Jk running ID=0 time=0/153 config=null
      11:23:21,931 INFO [Server] JBoss (MX MicroKernel) [3.2.8.SP1 (build: CVSTag=JBoss_3_2_8_SP1 date=200603031235)] Started in 1m:1s:726ms

      Here is my cluster-service.xml
      <?xml version="1.0" encoding="UTF-8"?>

      <!-- ===================================================================== -->
      <!-- -->
      <!-- Sample Clustering Service Configuration -->
      <!-- -->
      <!-- ===================================================================== -->





      <!-- ==================================================================== -->
      <!-- Cluster Partition: defines cluster -->
      <!-- ==================================================================== -->



      <!-- Name of the partition being built -->
      ${jboss.partition.name:DefaultPartition}
      <!-- The address used to determine the node name -->
      ${jboss.bind.address}
      <!-- Determine if deadlock detection is enabled -->
      False

      <!-- Time in milliseconds to wait for state to be transferred -->
      60000

      <!-- The JGroups protocol configuration -->


      <!-- UDP: if you have a multihomed machine,
      set the bind_addr attribute to the appropriate NIC IP address -->
      <!-- UDP: On Windows machines, because of the media sense feature
      being broken with multicast (even after disabling media sense)
      set the loopback attribute to true -->
      <UDP mcast_addr="228.1.2.3" mcast_port="45566"
      ip_ttl="8" ip_mcast="true"
      mcast_send_buf_size="800000" mcast_recv_buf_size="150000"
      ucast_send_buf_size="800000" ucast_recv_buf_size="150000"
      loopback="false" />
      <PING timeout="2000" num_initial_members="3"
      up_thread="true" down_thread="true" />
      <MERGE2 min_interval="10000" max_interval="20000" />
      <FD shun="true" up_thread="true" down_thread="true"
      timeout="2500" max_tries="5" />
      <VERIFY_SUSPECT timeout="3000" num_msgs="3"
      up_thread="true" down_thread="true" />
      <pbcast.NAKACK gc_lag="50" retransmit_timeout="300,600,1200,2400,4800"
      max_xmit_size="8192"
      up_thread="true" down_thread="true" />
      <UNICAST timeout="300,600,1200,2400,4800" window_size="100" min_threshold="10"
      down_thread="true" />
      <pbcast.STABLE desired_avg_gossip="20000"
      up_thread="true" down_thread="true" />
      <FRAG frag_size="8192"
      down_thread="true" up_thread="true" />
      <pbcast.GMS join_timeout="5000" join_retry_timeout="2000"
      shun="true" print_local_addr="true" />
      <pbcast.STATE_TRANSFER up_thread="true" down_thread="true" />





      <!-- ==================================================================== -->
      <!-- HA Session State Service for SFSB -->
      <!-- ==================================================================== -->


      jboss:service=${jboss.partition.name:DefaultPartition}
      <!-- Name of the partition to which the service is linked -->
      ${jboss.partition.name:DefaultPartition}
      <!-- JNDI name under which the service is bound -->
      /HASessionState/Default
      <!-- Max delay before cleaning unreclaimed state.
      Defaults to 30*60*1000 => 30 minutes -->
      0


      <!-- ==================================================================== -->
      <!-- HA JNDI -->
      <!-- ==================================================================== -->


      jboss:service=${jboss.partition.name:DefaultPartition}
      <!-- Name of the partition to which the service is linked -->
      ${jboss.partition.name:DefaultPartition}
      <!-- bind address of HA JNDI RMI endpoint -->
      ${jboss.bind.address}
      <!-- RmiPort to be used by the HA-JNDI service
      once bound. 0 => auto. -->
      1101
      <!-- Port on which the HA-JNDI stub is made available -->
      1100
      <!-- Backlog to be used for client-server RMI
      invocations during JNDI queries -->
      50

      <!-- Multicast Address and Group used for auto-discovery -->
      ${jboss.partition.udpGroup:230.0.0.4}
      1102

      <!-- IP Address to which should be bound: the Port, the RmiPort and
      the AutoDiscovery multicast socket. -->
      <!-- Client socket factory to be used for client-server
      RMI invocations during JNDI queries -->
      <!--attribute name="ClientSocketFactory">custom</attribute-->
      <!-- Server socket factory to be used for client-server
      RMI invocations during JNDI queries -->
      <!--attribute name="ServerSocketFactory">custom</attribute-->



      ${jboss.bind.address}
      4447
      <!--
      0
      custom
      custom
      -->


      <!-- ==================================================================== -->
      <!-- Distributed cache invalidation -->
      <!-- ==================================================================== -->


      jboss:service=${jboss.partition.name:DefaultPartition}
      jboss.cache:service=InvalidationManager
      jboss.cache:service=InvalidationManager
      ${jboss.partition.name:DefaultPartition}
      DefaultJGBridge





      Here is my tc5-cluster-service.xml
      <?xml version="1.0" encoding="UTF-8"?>

      <!-- ===================================================================== -->
      <!-- -->
      <!-- Customized TreeCache Service Configuration for Tomcat 5 Clustering -->
      <!-- -->
      <!-- ===================================================================== -->





      <!-- ==================================================================== -->
      <!-- Defines TreeCache configuration -->
      <!-- ==================================================================== -->



      jboss:service=Naming
      jboss:service=TransactionManager

      <!-- Configure the TransactionManager -->
      org.jboss.cache.JBossTransactionManagerLookup

      <!--
      Isolation level : SERIALIZABLE
      REPEATABLE_READ (default)
      READ_COMMITTED
      READ_UNCOMMITTED
      NONE
      -->
      REPEATABLE_READ

      <!--
      Valid modes are LOCAL, REPL_ASYNC and REPL_SYNC
      -->
      REPL_ASYNC

      <!-- Name of cluster. Needs to be the same for all clusters, in order
      to find each other
      -->
      Tomcat-Cluster

      <!-- JGroups protocol stack properties. Can also be a URL,
      e.g. file:/home/bela/default.xml

      -->



      <!-- UDP: if you have a multihomed machine,
      set the bind_addr attribute to the appropriate NIC IP address, e.g bind_addr="192.168.0.2"
      -->
      <!-- UDP: On Windows machines, because of the media sense feature
      being broken with multicast (even after disabling media sense)
      set the loopback attribute to true -->
      <UDP mcast_addr="230.1.2.3" mcast_port="45577"
      ip_ttl="8" ip_mcast="true"
      mcast_send_buf_size="150000" mcast_recv_buf_size="80000"
      ucast_send_buf_size="150000" ucast_recv_buf_size="80000"
      loopback="false"/>
      <PING timeout="2000" num_initial_members="3"
      up_thread="false" down_thread="false"/>
      <MERGE2 min_interval="10000" max_interval="20000"/>
      <FD_SOCK/>
      <VERIFY_SUSPECT timeout="1500"
      up_thread="false" down_thread="false"/>
      <pbcast.NAKACK gc_lag="50" retransmit_timeout="600,1200,2400,4800"
      max_xmit_size="8192" up_thread="false" down_thread="false"/>
      <UNICAST timeout="600,1200,2400" window_size="100" min_threshold="10"
      down_thread="false"/>
      <pbcast.STABLE desired_avg_gossip="20000"
      up_thread="false" down_thread="false"/>
      <FRAG frag_size="8192"
      down_thread="false" up_thread="false"/>
      <pbcast.GMS join_timeout="5000" join_retry_timeout="2000"
      shun="true" print_local_addr="true"/>
      <pbcast.STATE_TRANSFER up_thread="true" down_thread="true"/>



      <!-- Max number of milliseconds to wait for a lock acquisition -->
      15000