2 Replies Latest reply on Dec 23, 2008 2:52 AM by navkalp

    Session replication, Please help

      Hi,

      I have two sun solaris 10 machines, each machine has two nodes configured in cluster as defined in the following link.
      http://www.jboss.org/community/docs/DOC-9384

      Please note that session replication is working on second machine.

      On the first machine where session replication is not working,when both nodes started, I am seeing following message in the log file. In the cluster-service.xml for all the four nodes I am using TCP, as it seems that UDP is not supported on machines in the question,
      First node:
      GMS: address is 172.16.64.21:32804
      -------------------------------------------------------
      16:36:51,208 INFO [TreeCache] viewAccepted(): [172.16.64.21:32804|0] [172.16.64.21:32804]
      16:36:51,291 INFO [TreeCache] TreeCache local address is 172.16.64.21:32804
      16:36:51,291 INFO [TreeCache] State could not be retrieved (we are the first member in group)
      16:36:51,292 INFO [TreeCache] parseConfig(): PojoCacheConfig is empty
      16:36:51,610 INFO [STDOUT] no object for null
      16:36:51,621 INFO [STDOUT] no object for null
      16:36:51,710 INFO [STDOUT] no object for null
      16:36:51,800 INFO [STDOUT] no object for {urn:jboss:bean-deployer}supplyType
      16:36:51,842 INFO [STDOUT] no object for {urn:jboss:bean-deployer}dependsType
      61.070: [GC [PSYoungGen: 196608K->8128K(229376K)] 201156K->12677K(1015808K), 0.0691540 secs]
      16:36:59,765 INFO [NativeServerConfig] JBoss Web Services - Native
      16:36:59,765 INFO [NativeServerConfig] jbossws-native-2.0.1.SP2 (build=200710210837)
      16:37:02,629 INFO [SnmpAgentService] SNMP agent going active
      16:37:03,981 INFO [AtheebWeb] Initializing
      16:37:04,158 INFO [STDOUT]
      -------------------------------------------------------
      GMS: address is 172.16.64.21:7800
      -------------------------------------------------------
      16:37:07,259 INFO [AtheebWeb] Number of cluster members: 3
      16:37:07,260 INFO [AtheebWeb] Other members: 2
      16:37:07,260 INFO [AtheebWeb] Fetching state (will wait for 30000 milliseconds):
      16:37:07,420 INFO [AtheebWeb] state was retrieved successfully (in 159 milliseconds)
      16:37:07,580 INFO [HANamingService] Started ha-jndi bootstrap jnpPort=1100, backlog=50, bindAddress=/0.0.0.0
      16:37:07,600 INFO [DetachedHANamingService$AutomaticDiscovery] Listening on /0.0.0.0:1102, group=230.0.0.4, HA-JNDI address=
      172.16.64.21:1100
      16:37:08,159 INFO [TreeCache] No transaction manager lookup class has been defined. Transactions cannot be used
      16:37:08,639 INFO [STDOUT]

      Second node:

      GMS: address is 172.16.64.21:32819
      -------------------------------------------------------
      16:44:13,309 INFO [TreeCache] viewAccepted(): [172.16.64.21:32819|0] [172.16.64.21:32819]
      16:44:13,392 INFO [TreeCache] TreeCache local address is 172.16.64.21:32819
      16:44:13,392 INFO [TreeCache] State could not be retrieved (we are the first member in group)
      16:44:13,393 INFO [TreeCache] parseConfig(): PojoCacheConfig is empty
      16:44:13,729 INFO [STDOUT] no object for null
      16:44:13,741 INFO [STDOUT] no object for null
      16:44:13,830 INFO [STDOUT] no object for null
      16:44:13,924 INFO [STDOUT] no object for {urn:jboss:bean-deployer}supplyType
      16:44:13,970 INFO [STDOUT] no object for {urn:jboss:bean-deployer}dependsType
      16:44:16,432 INFO [STDOUT] 498761 [ContainerBackgroundProcessor[StandardEngine[jboss.web]]] DEBUG org.jboss.web.tomcat.servi
      ce.session.JBossCacheManager - Looking for sessions that have expired ...
      60.425: [GC [PSYoungGen: 196608K->8144K(229376K)] 201156K->12693K(1015808K), 0.0591634 secs]
      16:44:22,221 INFO [NativeServerConfig] JBoss Web Services - Native
      16:44:22,221 INFO [NativeServerConfig] jbossws-native-2.0.1.SP2 (build=200710210837)
      16:44:25,056 INFO [SnmpAgentService] SNMP agent going active
      16:44:25,130 INFO [STDOUT] 507459 [SnmpPortal--1] DEBUG org.jboss.jmx.adaptor.snmp.trapd.TrapdService - V1 Trap from agent
      /172.16.64.21 on port 32821
      IP Address......... 172.16.64.21
      Enterprise Id...... .1.2.3.4.5.6.7
      Generic ........... 0
      Specific .......... 0
      TimeStamp ......... 376
      Length............. 2
      Community string... public
      Varbind[0] := .1.2.3.4.5.6.7.9 --> Thu Dec 18 16:44:24 AST 2008
      Varbind[1] := .1.2.3.4.5.6.7.6 --> 0
      16:44:26,359 INFO [AtheebWeb] Initializing
      16:44:26,442 INFO [STDOUT] 508772 [ContainerBackgroundProcessor[StandardEngine[jboss.web]]] DEBUG org.jboss.web.tomcat.servi
      ce.session.JBossCacheManager - Looking for sessions that have expired ...
      16:44:26,528 INFO [STDOUT]

      GMS: address is 172.16.64.21:7801
      -------------------------------------------------------
      16:44:26,636 INFO [AtheebWeb] New cluster view for partition AtheebWeb: 7 ([172.16.64.24:1099, 172.16.64.24:1199, 172.16.64.
      21:1099, 172.16.64.21:1199] delta: 1)
      16:44:26,636 INFO [STDOUT] 508966 [IncomingMessageHandler (channel=AtheebWeb)] INFO org.jboss.ha.framework.interfaces.HAPartition.AtheebWeb - New cluster view for partition AtheebWeb: 7 ([172.16.64.24:1099, 172.16.64.24:1199, 172.16.64.21:1099, 172.16.64.21:1199] delta: 1)
      16:44:26,641 INFO [STDOUT] 508971 [IncomingMessageHandler (channel=AtheebWeb)] DEBUG org.jboss.ha.framework.interfaces.HAPartition.AtheebWeb - membership changed from 3 to 4
      16:44:26,642 INFO [STDOUT] 508971 [AsynchViewChangeHandler Thread] DEBUG org.jboss.ha.framework.interfaces.HAPartition.AtheebWeb - Begin notifyListeners, viewID: 7
      16:44:26,642 INFO [AtheebWeb] I am (172.16.64.21:1099) received membershipChanged event:
      16:44:26,643 INFO [STDOUT] 508972 [AsynchViewChangeHandler Thread] INFO org.jboss.ha.framework.server.DistributedReplicantManagerImpl.AtheebWeb - I am (172.16.64.21:1099) received membershipChanged event:
      16:44:26,643 INFO [AtheebWeb] Dead members: 0 ([])
      16:44:26,643 INFO [STDOUT] 508973 [AsynchViewChangeHandler Thread] INFO org.jboss.ha.framework.server.DistributedReplicantManagerImpl.AtheebWeb - Dead members: 0 ([])
      16:44:26,644 INFO [AtheebWeb] New Members : 1 ([172.16.64.21:1199])
      16:44:26,644 INFO [STDOUT] 508974 [AsynchViewChangeHandler Thread] INFO org.jboss.ha.framework.server.DistributedReplicantManagerImpl.AtheebWeb - New Members : 1 ([172.16.64.21:1199])
      16:44:26,644 INFO [AtheebWeb] All Members : 4 ([172.16.64.24:1099, 172.16.64.24:1199, 172.16.64.21:1099, 172.16.64.21:1199])
      16:44:26,644 INFO [STDOUT] 508974 [AsynchViewChangeHandler Thread] INFO org.jboss.ha.framework.server.DistributedReplicantManagerImpl.AtheebWeb - All Members : 4 ([172.16.64.24:1099, 172.16.64.24:1199, 172.16.64.21:1099, 172.16.64.21:1199])
      16:44:26,645 INFO [STDOUT] 508975 [AsynchViewChangeHandler Thread] DEBUG org.jboss.ha.framework.interfaces.HAPartition.AtheebWeb - End notifyListeners, viewID: 7
      16:44:26,667 INFO [AtheebWeb] Number of cluster members: 4
      16:44:26,668 INFO [AtheebWeb] Other members: 3
      16:44:26,668 INFO [AtheebWeb] Fetching state (will wait for 30000 milliseconds):
      16:44:26,925 INFO [AtheebWeb] state was retrieved successfully (in 257 milliseconds)
      16:44:27,039 INFO [STDOUT] 509369 [AsynchKeyChangeHandler Thread] DEBUG org.jboss.ha.framework.server.HARMIServerImpl$RefreshProxiesHATarget - replicantsChanged 'HAJNDI' to 4 (intra-view id: 1039009024)
      16:44:27,094 INFO [HANamingService] Started ha-jndi bootstrap jnpPort=1200, backlog=50, bindAddress=/0.0.0.0
      16:44:27,113 INFO [DetachedHANamingService$AutomaticDiscovery] Listening on /0.0.0.0:1102, group=230.0.0.4, HA-JNDI address=172.16.64.21:1200
      16:44:27,138 INFO [STDOUT] 509468 [IncomingMessageHandler (channel=AtheebWeb)] DEBUG org.jboss.cache.invalidation.bridges.JGCacheInvalidationBridge - Updating list of invalidation groups that are bridged...
      16:44:27,139 INFO [STDOUT] 509469 [IncomingMessageHandler (channel=AtheebWeb)] DEBUG org.jboss.cache.invalidation.bridges.JGCacheInvalidationBridge - ... nothing needs to be bridged.
      16:44:27,488 INFO [STDOUT] 509818 [AsynchKeyChangeHandler Thread] DEBUG org.jboss.ha.singleton.HASingletonController - partitionTopologyChanged, isElectedNewMaster=false, isMasterNode=false, viewID=1039009024
      16:44:27,705 INFO [TreeCache] No transaction manager lookup class has been defined. Transactions cannot be used
      16:44:28,206 INFO [STDOUT]
      -------------------------------------------------------


      For the second machine where two jboss nodes join each other and form cluster, and session replication is happening perfectly.

      Nodes at my first servers are exact replica of the second server, where everything is working, I am not able to trouble shoot, what is the problem.

      Please help me, how can I debug the whole setup on the machine where the session replication is not happening.

        • 1. Re: Session replication, Please help
          brian.stansberry

          My guess is your session replication channel is *not* using TCP.

          GMS: address is 172.16.64.21:32804

          32804 seems more like a port the UDP configs would use.

          The session replication channel isn't configured via cluster-service.xml. It's via tc5-cluster-service.xml or jboss-web-cluster.sar/META-INF/jboss-service.xml. Which depends on what AS release you are using (which you should always include in your post!)

          • 2. Re: Session replication, Please help

            Thanks very much sir,

            I am using Jboss4.2.2 application server, Apologies for not mentioneing the same.

            My system started working with your suggestions, thanks a lot.