How to configure the JGroups Subsystem to Use TCP
jjakub Jul 27, 2014 6:39 PMEAP 6.1-alpha.
Domain on amazon ec2, 2 physical hosts, each has two nodes.
With jgroups configuration as below, I cannot create a tcp cluster on all 4 nodes,
I get 2 clusters - on each host there is a cluster from nodes running on this host.
Session replication correctly works on this two separate clusters.
But why there is no one big cluster spanning across nodes ?
I turned off centos firewall, and enabled 7850 7950 57850 57950 ports in amazon firewall
I try to do as in https://access.redhat.com/documentation/en-US/JBoss_Enterprise_Application_Platform/6/html/Administration_and_Configurat… and my conf is as below:
<stack name="tcpping"> | |
<transport type="TCP" socket-binding="jgroups-tcp"/> | |
<protocol type="TCPPING"> | |
<property name="initial_hosts"> | |
10.0.0.11[7850],10.0.0.11[7950],10.0.0.12[7850],10.0.0.12[7950] | |
</property> | |
<property name="port_range"> | |
0 | |
</property> | |
<property name="timeout"> | |
3000 | |
</property> | |
<property name="num_initial_members"> | |
2 | |
</property> | |
</protocol> | |
<protocol type="MERGE2"/> | |
<protocol type="FD_SOCK" socket-binding="jgroups-tcp-fd"/> | |
<protocol type="FD"/> | |
<protocol type="VERIFY_SUSPECT"/> | |
<protocol type="BARRIER"/> | |
<protocol type="pbcast.NAKACK"/> | |
<protocol type="UNICAST2"/> | |
<protocol type="pbcast.STABLE"/> | |
<protocol type="pbcast.GMS"/> | |
<protocol type="UFC"/> | |
<protocol type="MFC"/> | |
<protocol type="FRAG2"/> | |
<protocol type="RSVP"/> | |
</stack> |
In console on master I get messages like:
[Server:srv-four] 22:32:31,069 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-16,shared=tcpping) dropping unicast message to wrong destination slave:server-three/web; my local_addr is master:srv-four/web
[Server:srv-four] 22:32:31,548 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-16,shared=tcpping) dropping unicast message to wrong destination slave:server-three/web; my local_addr is master:srv-four/web
[Server:server-three] 22:32:58,319 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-20,shared=tcpping) dropping unicast message to wrong destination slave:server-two/web; my local_addr is master:server-three/web
[Server:server-three] 22:32:59,297 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-17,shared=tcpping) dropping unicast message to wrong destination slave:server-two/web; my local_addr is master:server-three/web
[Server:server-three] 22:33:00,298 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-17,shared=tcpping) dropping unicast message to wrong destination slave:server-two/web; my local_addr is master:server-three/web
[Server:server-three] 22:33:01,298 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-17,shared=tcpping) dropping unicast message to wrong destination slave:server-two/web; my local_addr is master:server-three/web
[Server:server-three] 22:33:02,299 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-17,shared=tcpping) dropping unicast message to wrong destination slave:server-two/web; my local_addr is master:server-three/web
[Server:server-three] 22:33:03,299 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-17,shared=tcpping) dropping unicast message to wrong destination slave:server-two/web; my local_addr is master:server-three/web
[Server:server-three] 22:33:03,322 INFO [org.jboss.as.clustering] (Incoming-1,shared=tcpping) JBAS010226: New cluster view for partition web: 10 (org.jboss.as.clustering.impl.CoreGroupCommunicationService$GroupView@6c17a6fd delta: 0, merge: true)
[Server:server-three] 22:33:03,322 INFO [org.infinispan.remoting.transport.jgroups.JGroupsTransport] (Incoming-1,shared=tcpping) ISPN000093: Received new, MERGED cluster view: MergeView::[master:srv-four/web|10] [master:srv-four/web, master:server-three/web], subgroups=[master:srv-four/web|9] [master:srv-four/web, master:server-three/web]
[Server:srv-four] 22:33:03,324 INFO [org.jboss.as.clustering] (Incoming-7,shared=tcpping) JBAS010225: New cluster view for partition web (id: 10, delta: 0, merge: true) : [master:srv-four/web, master:server-three/web]
[Server:srv-four] 22:33:03,325 INFO [org.infinispan.remoting.transport.jgroups.JGroupsTransport] (Incoming-7,shared=tcpping) ISPN000093: Received new, MERGED cluster view: MergeView::[master:srv-four/web|10] [master:srv-four/web, master:server-three/web], subgroups=[master:srv-four/web|9] [master:srv-four/web, master:server-three/web]
[Server:srv-four] 22:33:17,083 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-20,shared=tcpping) dropping unicast message to wrong destination slave:server-three/web; my local_addr is master:srv-four/web
[Server:srv-four] 22:33:49,515 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-16,shared=tcpping) dropping unicast message to wrong destination slave:server-three/web; my local_addr is master:srv-four/web
[Server:server-three] 22:34:35,716 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-17,shared=tcpping) dropping unicast message to wrong destination slave:server-two/web; my local_addr is master:server-three/web
On slave :
[Server:server-two] 22:35:14,476 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-20,shared=tcpping) dropping unicast message to wrong destination 33ca5346-71a6-045d-fef8-48f1415d48a0; my local_addr is slave:server-two/web
[Server:server-two] 22:35:15,476 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-20,shared=tcpping) dropping unicast message to wrong destination 33ca5346-71a6-045d-fef8-48f1415d48a0; my local_addr is slave:server-two/web
[Server:server-three] 22:35:15,830 INFO [org.jboss.as.clustering] (Incoming-17,shared=tcpping) JBAS010225: New cluster view for partition web (id: 11, delta: 0, merge: true) : [slave:server-three/web, slave:server-two/web]
[Server:server-two] 22:35:15,833 INFO [org.jboss.as.clustering] (Incoming-14,shared=tcpping) JBAS010226: New cluster view for partition web: 11 (org.jboss.as.clustering.impl.CoreGroupCommunicationService$GroupView@5849dd13 delta: 0, merge: true)
[Server:server-three] 22:35:15,831 INFO [org.infinispan.remoting.transport.jgroups.JGroupsTransport] (Incoming-17,shared=tcpping) ISPN000093: Received new, MERGED cluster view: MergeView::[slave:server-three/web|11] [slave:server-three/web, slave:server-two/web], subgroups=[slave:server-three/web|10] [slave:server-three/web, slave:server-two/web]
[Server:server-two] 22:35:15,836 INFO [org.infinispan.remoting.transport.jgroups.JGroupsTransport] (Incoming-14,shared=tcpping) ISPN000093: Received new, MERGED cluster view: MergeView::[slave:server-three/web|11] [slave:server-three/web, slave:server-two/web], subgroups=[slave:server-three/web|10] [slave:server-three/web, slave:server-two/web]
[Server:server-three] 22:35:49,694 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-18,shared=tcpping) dropping unicast message to wrong destination c0e63db7-7501-24cd-b3a9-9b7bb0635821; my local_addr is slave:server-three/web
[Server:server-three] 22:35:54,418 WARN [org.jgroups.protocols.TCP] (OOB-17,shared=tcpping) null: no physical address for c0e63db7-7501-24cd-b3a9-9b7bb0635821, dropping message
[Server:server-three] 22:36:19,697 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-17,shared=tcpping) dropping unicast message to wrong destination c0e63db7-7501-24cd-b3a9-9b7bb0635821; my local_addr is slave:server-three/web
[Server:server-two] 22:36:19,886 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-18,shared=tcpping) dropping unicast message to wrong destination 33ca5346-71a6-045d-fef8-48f1415d48a0; my local_addr is slave:server-two/web
[Server:server-two] 22:36:20,488 WARN [org.jgroups.protocols.TP$ProtocolAdapter] (OOB-18,shared=tcpping) dropping unicast message to wrong destination 33ca5346-71a6-045d-fef8-48f1415d48a0; my local_addr is slave:server-two/web
Any idea how can I test it, and what can I fix ?