6 Replies Latest reply on May 9, 2008 7:16 AM by fungrim

A few throughts; Region based buddy backup?

fungrim May 6, 2008 5:50 AM

Hi, I thought I'd offer an idea that struck me this morning regarding buddy backup. Consider the following start of a tree:

/root/
 /A
 /B
 /C

In our scenario, changes on nodes A-C (including optional sub-trees) may be performed concurrently per node but the system guarantees that even though one thread may change A while another changes B no two threads will ever try to access the same node at the same time.

However, given a number of "node executing" threads for concurrency, buddy replication becomes somewhat of a bottle neck as A-C will all be backed up on the same remote node. For example, using TCP/UNICAST in the configuration will force all threads involved to lock on to the same monitors during replication as all backups are located on the same remove instance.

This led me to suspect that perhaps, and this is where you can tell me why I'm dead wrong of course, that some sort of "region based buddy backup" would yield a drastically improved performance for this kind of scenarios, as it would enable concurrent transport for all three nodes. In other words, given remote nodes N1, N2, and N3 and you could configure the cache as follows...

/root/A (with subtree) - to be buddy backed up at N1
/root/B (with subtree) - to be buddy backed up at N2
/root/C (with subtree) - to be buddy backed up at N3

... thread T1 could concurrently access and replicate A without interfering with thread T2 which is accessing B as 1) they're on different sub-trees in the cache (thus in-VM locking shouldn't be a problem); and 2) replication can be done concurrently (FIFO ordering etc).

Am I making any sense here?

Cheers
/Lars J. Nilsson
www.cubeia.com

1. Re: A few throughts; Region based buddy backup?

manik May 9, 2008 6:24 AM (in response to fungrim)

The only bottleneck I can think of is on the transport layer (JGroups) when replicating to the same buddy node. Once on the buddy node, since you are talking about disjoint subtrees, there won't be any contention. Even this replication contention can be minimised if you are using async replication.

At the moment BR backs up the entire state of one node onto another node. In future (see Partitioning) will allow for different regions being backed up on different nodes.
Actions
2. Re: A few throughts; Region based buddy backup?

fungrim May 9, 2008 6:38 AM (in response to fungrim)

"manik.surtani@jboss.com" wrote:
The only bottleneck I can think of is on the transport layer (JGroups) when replicating to the same buddy node.

That's correct, and we're indeed seeing contention in the JGroups layer. As we're load testing on a couple of thousand accesses/replication per second, they become quite visible (and understandably so).

"manik.surtani@jboss.com" wrote:
In future (see <a href="http://wiki.jboss.org/wiki/JBossCachePartitioning">Partitioning</a>) will allow for different regions being backed up on different nodes.

Ah, I didn't know. Thanks, I'll have a look at it over the weekend.
Actions
3. Re: A few throughts; Region based buddy backup?

manik May 9, 2008 6:41 AM (in response to fungrim)

What version of JGroups are you using?
Actions
4. Re: A few throughts; Region based buddy backup?

fungrim May 9, 2008 6:57 AM (in response to fungrim)

JGroups 2.6.2 with TCP transport (we had NAK/ACK issues with UDP).
Actions
5. Re: A few throughts; Region based buddy backup?

manik May 9, 2008 7:03 AM (in response to fungrim)

I'm guessing then that you are using JGroups' concurrent stack. Not that that will help you very much in the scenario you painted though, since the concurrent stack only parallelizes messages from different senders, not from the same one.

I've actually started a discussion with the JGroups devs on how we can use the concurrent stack to parallelize work from the same sender.

See: http://lists.jboss.org/pipermail/jbosscache-dev/2008-May/002239.html
Actions
6. Re: A few throughts; Region based buddy backup?

fungrim May 9, 2008 7:16 AM (in response to fungrim)

That's correct. Also, the lock in question which prompted this post was on sending (as we're using REPL_ASYNCH) and we're going down the stack using one thread.
Actions

Go to original post