12 Replies Latest reply on Jun 22, 2004 10:29 AM by idumali

Is anyone using TreeCache in production?

geoharp Mar 3, 2004 3:42 PM

My basic question is how ready is tree cache?
I would like to use it with hibernate (deployed as Mbean), but I am wondering is it to optimistic to try this (beta-2)?

1. Re: Is anyone using TreeCache in production?

ben.wang Mar 3, 2004 6:32 PM (in response to geoharp)

To answer part of your question. We are scheduled to have a 1.0 release in middle of this month. Bugs fixes and new features such as eviction policy and aop object graphs handling. On the aop side, it will also use the latest JBossAop that Bill has been working on.

I will be interested to hear some user experiences in this forum as well. :-)

-Ben
Actions
2. Re: Is anyone using TreeCache in production?

geoharp Mar 4, 2004 9:06 AM (in response to geoharp)

Thanks for the info on releases ill wait until then to implement and will post experience.

Thanks again,
gharp
Actions
3. Re: Is anyone using TreeCache in production?

jason.greene Mar 4, 2004 2:59 PM (in response to geoharp)

Excellent, we are just starting to use TreeCache this week (as a second level cache for hibernate). We are still in early development phases, and we were hoping it would be production ready by the time we hit rollout (several months from now).

Keep up the good work!

-Jason
Actions
4. Re: Is anyone using TreeCache in production?

idumali Jun 16, 2004 1:55 PM (in response to geoharp)

We (www.digijava.org) are trying to use JbossCache under Hibernate on production. Only ASYNC replication gave reasonable performance.

Number of caches objects that we are testing on is about hundred thousand objects

We are having real serious problems with the existing LRU implementation. LRU parameter wakeUpIntervalSeconds set to anything higher than 5secs gives inaccaptable performance.

We get nice results with wakeUpIntervalSeconds=1sec but I am not, really, very happy with some process starting up every second.

Profiler shows that the bottleneck is put() in edu.oswego.cs.dl.util.concurrent.BoundedLinkedQuee

"Something is rotten in the state of Denmark..."
Actions
5. Re: Is anyone using TreeCache in production?

ben.wang Jun 17, 2004 11:28 AM (in response to geoharp)

Regarding to performance, SYNC mode has to be slower than ASYNC since it is blocking. But if you have read-mostly data, then it can be acceptable. Otherwise, use ASYNC if you can.

For LRU, basically there is a TimerTask thread that wake up every x seconds to check for the node event queue. Do you evict the nodes often? Otherwise, I am a bit puzzled why it is slow.

If you can provide more information, I will be glad to look into that.

Thanks,

-Ben
Actions
6. Re: Is anyone using TreeCache in production?

idumali Jun 17, 2004 12:00 PM (in response to geoharp)

hi Ben,

thanks for replying.

here's a graph of LRU-enabled test benchmark. The only thing it does is makes put()s in JBossCache. Test is a standalone app, no interference with anything, no Hibernate or whatever. This snapshot is with wakeUpTime=5sec

http://www.powerdot.org/jbosscache-benchmark.gif

We are doing the same with different wakeUpTimes and with more fine-grained dots (on this picture too many dots were taken in the begining).

I will post results (graphs, code, profiler results) as soon as we have them (in several more hours). From the first glance - LRU queue usage has some serious performance/scalability problems

thanks
Actions
7. JBossCache with LRU benchmark

idumali Jun 18, 2004 3:17 PM (in response to geoharp)

OK,

here are the final results.

We did series of tests on the following wakeUpIntervalSeconds: 1,3,5,10,15. The default one indicated in the sample code on your site was 5sec. In each series, several series of puts() were performed: from 500 to 50,000 with step 1,000 (10 points for each series of wakeUpInterval).

Following is the source that performs the whole cycle:
http://www.powerdot.org/jbosscache-benchmark/JBossLRUTest.java
And sample configuration for 5 seconds:
http://www.powerdot.org/jbosscache-benchmark/treecache-5.xml

You can, also, download the complete package with all the config files, JARs, Ant build script and run.bat from here:
http://www.powerdot.org/jbosscache-benchmark/LRUTest.zip

The memory settings used in the test were the same as in run.bat:
-Xms 512M -Xmx 800M

The machine used was single-processor Intel Pentium 4 2.6GHz with Hyperthreading enabled.

Now the results.

Following is all the series together:
http://www.powerdot.org/jbosscache-benchmark/images/benchmark-all.gif

As you can see not only time of put() is proportional of the number of objects (inneficient algorithm) but the coefficient of linear dependence increases drastically with the increase of wakeUpInterval. If you want to get a feeling of the "speed" you can, also, see the proportionally scaled graph of the same results:
http://www.powerdot.org/jbosscache-benchmark/images/benchmark-all-propscale.gif

As you can see - at 15 secs it is almost vertical!

Please, also note the brown line representing the same test with LRU turned off - it is almost horizontal. Without LRU the cache performance, per se, is not bad - that if you can imagine a cache running without LRU in production, when JVM has hard limit of ~2GB on available memory :)

Following is just the LRU-turned-off test:
http://www.powerdot.org/jbosscache-benchmark/images/benchmark-lruOff.gif

Then we did profiling to get some feeling of what is going wrong.

Following are overall and zoomed-in snapshots:
http://www.powerdot.org/jbosscache-benchmark/images/profiling.gif
Diving into one of the slow branches:
http://www.powerdot.org/jbosscache-benchmark/images/profiling-detail.gif

These are just first-glance conclusions so may not be 100% the reson but this is what we think is wrong:

1. org.jboss.cache.Fqn class uses non-static logger, which means that its instance is created every time someone calls new Fqn() or clone().

2. The toString() method in org.jboss.cache.Fqn is VERY slow (no surprise there: StringBuffer and other slow stuff is used). Odd, for us, is that org.jboss.cache.eviction.LRUPolicy uses code like Region region = regionManager_.getRegion(fqn.toString()); while Fqn can be used as a key in java.util.Map (?)
Actions
8. Re: Is anyone using TreeCache in production?

ben.wang Jun 18, 2004 4:38 PM (in response to geoharp)

Hi,

Thank you very much for all efforts. This is valuable information for me to troubleshoot. Just one quick question. What is the log level for org.jboss.cache? "DEBUG" or "INFO"? Default shipped with the package is "DEBUG" and that may have signicant impact on the overall performance.

Cheers,

-Ben
Actions
9. Re: Is anyone using TreeCache in production?

mikheil Jun 18, 2004 4:58 PM (in response to geoharp)

Hi Ben,

In the tests above the log level was INFO. You can find the log4j.properties in the zip file that irakli has posted, under src directory.

Mikheil
Actions
10. Re: Is anyone using TreeCache in production?

ben.wang Jun 18, 2004 6:16 PM (in response to geoharp)

Mikheil,

Thanks for the info. I will take a look.

-Ben
Actions
11. Re: Is anyone using TreeCache in production?

ben.wang Jun 21, 2004 2:07 AM (in response to geoharp)

OK. I have fixed the performance bottleneck problem in LRUPolicy. I replaced the eviction queue from BoundLinkedQueue to BoundedBuffer. I have also increased the initial capacity of the queue. Eventually, we will externalize the initial capacity in the next full release.

I have run the example that you guys setup, now the worst case (25 seconds) is about twice as slow as the one without eviction policy turned on. So that's more reasonable. :-)

I will upload the patch JBoss1_02 to the jboss site tomorrow and then announce it here.

Thanks everyone for the help.

-Ben
Actions
12. Re: Is anyone using TreeCache in production?

idumali Jun 22, 2004 10:29 AM (in response to geoharp)

Wooow. Very impressive performance improvement.

Thanks a lot, Ben!
Actions

Go to original post