Hi Nikos, so are you saying that restarting Apache HTTPd was not enough and you had to restart all JBoss instances as well?
The operators have restarted the failed apache http several times during the day. What was strange was that although one of the http servers was stuck the other apache was distributing requests to all 4 jboss nodes without any problems.
We had in the end to restart the jboss instances as well. We had to restart all the servers in the cluster. 4 jboss instances and 2 apaches. Just Restarting the servers on the one physical server was not enough.
Hi Nikos, sad thing. I havent seen that myself yet - restarting should often help as long as the mod_cluster.sar part sends information to the Apache. Probably the best idea would be to switch to 1.1.1, I havent checked precisely the log but some related critical issues have been fixed there, have a look please: http://docs.jboss.org/mod_cluster/1.1.0/html/changelog.html#d0e3234