This content has been marked as final.
Show 4 replies
-
1. Re: Jopr Server takes more than 14 minutes to detect that th
pilhuhn Dec 4, 2009 9:05 AM (in response to carcara)14 minutes sound much too long. When you shut down the agent, the server should directly see this. Otherwise have a look at http://javablogs.com/Jump.action?id=534624 on how to decrease the availability check interval.
-
2. Re: Jopr Server takes more than 14 minutes to detect that th
mazz Dec 4, 2009 9:59 AM (in response to carcara)when I shutdown the agent the Jopr Server takes more than 14 minutes to detect the agent was down.
The issue you are seeing is not related to the availability checking performed on the agent - you are killing the agent entirely, so the agent is never reporting any availability data at all to the server.
To support cases like this (where the agent is completely down or unresponsive), periodically, the server needs to check to see what agents it hasn't heard from in a long time and then determine which of these "suspect" agents are really down.
Read this for background on this issue:
https://bugzilla.redhat.com/show_bug.cgi?id=RHQ-1098
That tells you why we increased the default time.
Read this for more and it talks about the new default time:
https://bugzilla.redhat.com/show_bug.cgi?id=RHQ-2349
"We have a quiet time of 15m right now (recently changed to that)."
What does this mean? It means, by default, if we have heard from an agent in 15 minutes (what we call the agent's "quiet time"), only then do we mark that agent and all of its resources down. This is why it takes more than 14 minutes to detect your agent was down.
If you do not like that, and you want it to report down faster, then, yes, you can change this - its configurable in the GUI... go to the main menu "Administration>SystemConfiguration>Settings" and change the setting "Agent Max Quiet Time Allowed" to something shorter. Note: the shorter your allowed quiet time interval is, the greater the possibility of a "false negative" - for example, if you set quiet time to 5 minutes and if your server can't process all your agent's availability reports fast enough, it may think it hasn't heard from an agent when in fact it just hasn't had time to process the latest avail report. When an agent is determined to be down, the server has to "backfill it" - marking all of its resources down - and this is expensive. So you don't want to do this often. -
3. Re: Jopr Server takes more than 14 minutes to detect that th
mazz Dec 4, 2009 10:09 AM (in response to carcara)This is a good question, I added it to the FAQ:
http://rhq-project.org/display/JOPR2/FAQ#FAQ-WhenIshutdowntheagent%2CtheRHQServertakesmorethan14minutestodetecttheagentwasdown.CanIconfigureittonottakesolong%3F -
4. Re: Jopr Server takes more than 14 minutes to detect that th
carcara Dec 7, 2009 6:41 AM (in response to carcara)Hi,
I am very grateful for the support, it has a good time he was having difficulty in solving this problem.
Thank you again!
Claudemir.