3 Replies Latest reply on Dec 1, 2006 2:03 PM by brian.stansberry

configuration question: how to limit size of NAKACK structur

bruyeron Dec 1, 2006 6:32 AM

I am running into an issue in the case when something is wrong with several nodes in the cluster, and the surviving node somehow does not evict the troublesome nodes and starts accumulating messages.

The current config looks like this:

 <property name="isolationLevel" value="REPEATABLE_READ" />
 <property name="cacheMode" value="REPL_ASYNC" />
 <property name="clusterName" value="${treeCache.clusterName}" />
 <property name="useReplQueue" value="false" />
 <property name="replQueueInterval" value="0" />
 <property name="replQueueMaxElements" value="0" />
 <property name="fetchInMemoryState" value="true" />
 <property name="initialStateRetrievalTimeout" value="20000" />
 <property name="syncReplTimeout" value="20000" />
 <property name="lockAcquisitionTimeout" value="5000" />
 <property name="useRegionBasedMarshalling" value="false" />
 <property name="clusterProperties"
 value="${treeCache.clusterProperties}" />
 <property name="serviceName">
 <bean class="javax.management.ObjectName">
 <constructor-arg value="jboss.cache:service=${treeCache.clusterName},name=${treeCache.instanceName}"/>
 </bean>
 </property>
 <property name="evictionPolicyClass" value="org.jboss.cache.eviction.LRUPolicy"/>
 <property name="maxAgeSeconds" value="${treeCache.eviction.maxAgeSeconds}"/>
 <property name="maxNodes" value="${treeCache.eviction.maxNodes}"/>
 <property name="timeToLiveSeconds" value="${treeCache.eviction.timeToLiveSeconds}"/>

The jgroups stack is this:

treeCache.clusterProperties=UDP(ip_mcast=true;ip_ttl=64;loopback=false;mcast_addr=${treeCache.mcastAddress};mcast_port=${treeCache.mcastPort};mcast_recv_buf_
size=80000;mcast_send_buf_size=150000;ucast_recv_buf_size=80000;ucast_send_buf_size=150000;bind_addr=${treeCache.bind_addr}):\
PING(down_thread=false;num_initial_members=3;timeout=2000;up_thread=false):\
MERGE2(max_interval=20000;min_interval=10000):\
FD_SOCK(down_thread=false;up_thread=false):\
VERIFY_SUSPECT(down_thread=false;timeout=1500;up_thread=false):\
pbcast.NAKACK(down_thread=false;gc_lag=50;retransmit_timeout=600,1200,2400,4800;up_thread=false):\
pbcast.STABLE(desired_avg_gossip=20000;down_thread=false;up_thread=false):\
UNICAST(down_thread=false;;timeout=600,1200,2400):\
FRAG(down_thread=false;frag_size=8192;up_thread=false):\
pbcast.GMS(join_retry_timeout=2000;join_timeout=5000;print_local_addr=true;shun=true):\
pbcast.STATE_TRANSFER(down_thread=true;up_thread=true)

The cluster has 12 nodes, and I had this situation occur when 3 of the nodes failed, which provoked the ops team into restarting 9 of them. The remaning 3 all went OOM quickly. Analysing the heap dump post-mortem, I see this:

org.jgroups.protocols.pbcast.NAKACK retained size=245MB

My first step is to add FD into the stack to adress the issue of failure detection not working properly in some cases. Then I would like to limit the size of the NAKACK structure (even if this means losing consistency accross the cluster): is this possible at all? What are your suggestions?

1. Re: configuration question: how to limit size of NAKACK stru

belaban Dec 1, 2006 7:14 AM (in response to bruyeron)

Yes, adding FD is the first step (see the discussion at http://wiki.jboss.org/wiki/Wiki.jsp?page=FDVersusFD_SOCK).
However, you probably also need flow control (FC) if you're sending loads of messages all the time, see
http://wiki.jboss.org/wiki/Wiki.jsp?page=JGroupsFC
Actions
2. Re: configuration question: how to limit size of NAKACK stru

bruyeron Dec 1, 2006 8:52 AM (in response to bruyeron)

ok for FD. However is FC appropriate for REPL_ASYNC (the doc suggests it's not ideal) ?
I think the #1 priority is to detect failure accurately - I am just wondering what options are left after adding FD if the problem persists.

Last question: is upgrading jgroups to 2.4 (instead of whatever was shipped with 1.4.0SP1) safe? is it recommended?

PS: it would be awesome if you could add the version to the jars names in the JC distribution :)
Actions
3. Re: configuration question: how to limit size of NAKACK stru

brian.stansberry Dec 1, 2006 2:03 PM (in response to bruyeron)

Re: FC where's the doc that says it's not recommended? It needs to be fixed.

FC is must with REPL_ASYNC, otherwise you run the risk of OOMs. It's not recommended with REPL_SYNC, where the synchronous replication adds a form of flow control. Particularly not recommended if up_thread="false" in all protocols in the stack, as that combination can lead to deadlock.

Upgrading 1.4.0.SP1 to 2.4 is fine and is recommended in the sense that 2.4 is newer and better, has issues fixed, etc.

There was a discussion on the forums or the jbosscache-dev or jboss-dev list a while back about adding the version number to jar names, and it got rejected. I don't recall why.
Actions

Go to original post