RHQ 4.9 bugs and quirks
genman Sep 13, 2013 4:36 PM20:13:09,518 WARN [com.arjuna.ats.arjuna] (Transaction Reaper Worker 28) ARJUNA012113: TransactionReaper::doCancellations worker Thread[Transaction Reaper Worker 28,5,main] miss ed interrupt when cancelling TX 0:ffff11b0d33f:-3da8ead:52324661:11297e7 -- exiting as zombie (zombie count decremented to 4)20:13:09,519 WARN [com.arjuna.ats.arjuna] (http-/0.0.0.0:7080-8) ARJUNA012077: Abort called on already aborted atomic action 0:ffff11b0d33f:-3da8ead:52324661:11297e7 20:13:09,519 ERROR [org.jboss.as.ejb3.invocation] (http-/0.0.0.0:7080-8) JBAS014134: EJB Invocation failed on component MeasurementOOBManagerBean for method public abstract org.r hq.core.domain.util.PageList org.rhq.enterprise.server.measurement.MeasurementOOBManagerLocal.getHighestNOOBsForResource(org.rhq.core.domain.auth.Subject,int,int): javax.ejb.EJBTransactionRolledbackException: Transaction rolled back
I'm seeing the above when clicking on the metric tab, and change the time (1h to 1d) for example, then the timeline renders wrong and the above shows up.
This is on Chrome, and does seem to happen on Safari.
Also, I also don't know if "Get Live" value is working or what it does anymore. It is useful so I hope it stays.
I also have gotten into a state where the storage node doesn't return any metrics at all. The components are up but no metrics are being returned.
Also, this hangs:
$ ./rhqctl stop --agent 20:21:45,447 INFO [org.jboss.modules] JBoss Modules version 1.2.0.CR1 Stopping RHQ Agent... RHQ Agent (pid=352) is stopping...
The agent seems to wait for a thread that's a scheduled executor:
"pool-3-thread-1" prio=10 tid=0x00007fe78c4c0800 nid=0x192 waiting on condition [0x00007fe788126000] java.lang.Thread.State: TIMED_WAITING (parking) at sun.misc.Unsafe.park(Native Method) - parking to wait for <0x00000000e1309b98> (a java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject) at java.util.concurrent.locks.LockSupport.parkNanos(LockSupport.java:226) at java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.awaitNanos(AbstractQueuedSynchronizer.java:2082) at java.util.concurrent.ScheduledThreadPoolExecutor$DelayedWorkQueue.take(ScheduledThreadPoolExecutor.java:1090) at java.util.concurrent.ScheduledThreadPoolExecutor$DelayedWorkQueue.take(ScheduledThreadPoolExecutor.java:807) at java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:1068) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1130) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:724)
Not sure where this is coming from. Maybe this?
modules/core/native-system/src/main/java/org/rhq/core/system/SigarAccessHandler.java SigarAccessHandler(SigarFactory sigarFactory) { this.sigarFactory = sigarFactory; sharedSigarLock = new ReentrantLock(); localSigarLock = new ReentrantLock(); scheduledExecutorService = Executors.newSingleThreadScheduledExecutor(); scheduledExecutorService.scheduleWithFixedDelay(new ThresholdChecker(), 1, 5, MINUTES); localSigarInstancesCount = 0; }