5 Replies Latest reply on Jun 10, 2011 11:05 AM by clebert.suconic

messages stuck in "delivering" state

carl.heymann Jun 9, 2011 11:55 AM

We are trying to swap out JBM for HornetQ, and have it running in our system test

environment at the moment. HornetQ is in standalone, non-clustered mode with most

default configuration settings still in place.

The system is set up as services, each with a

queue and a consumer pool. Consumers process messages in JMS transactions

(session transacted). When problem occurs while processing, then the JMS

transaction is completed, normally, but a message is sent back to the same

queue, but with the _HQ_SCHED_DELIVERY property set to a time in the future.

The idea is that the same request is attempted to be processed again at a later

stage. We control the max scheduled redelivery attempts manually, and back off

the redelivery time manually.

We had a lot of these failures today due to an external resource that was

failing. This caused many such scheduled messages to pile up in the queue.

When the resource was restored, the scheduled messages started getting

processed, as their scheduled times arrived. However, at the end there were

still 8 messages in the "DeliveringCount". Even if I restart all consumers,

this doesn't change. The state is currently:

ConsumerCount 1

DeliveringCount 8

MessageCount 19

What could cause the messages to stay in the "delivering" state, even if all

consumers are restarted?

Some more background:

- initially hornet was configured to BLOCK instead of PAGE, which caused

the system to hang once the dead letter queue filled up, as well as the service

queue that had problems (because we manuallysend to the DLQ once the

redelivery attempts run out). In this blocked state, many queues had

high "DeliveringCount" values.

- I then changed it to PAGE, which loosened up the system and let the

messages move to the DLQ, and new messages could be accepted into the

service queues.

- While the system was processing these messages, I stopped hornet,

increased the max size and max page size, and started hornetq again.

There were some warnings like this:

WARNING [org.hornetq.core.paging.cursor.impl.PageSubscriptionImpl] Couldn't locate page transaction 42734785, ignoring message on position PagePositionImpl [pageNr=1, messageNr=14, recordID=0]

Maybe this page size increase could have caused some messages to be

orphaned?

Thanks

Carl

1. Re: messages stuck in "delivering" state

ataylor Jun 9, 2011 12:04 PM (in response to carl.heymann)

what version are you using, make sure you try th elatest 2.2.2. If you still get errors then could you provide a test we can easily run
Actions
2. Re: messages stuck in "delivering" state

carl.heymann Jun 9, 2011 12:18 PM (in response to ataylor)

I am using 2.2.2-Final.

At this stage, all I really want to know is whether the system was designed to allow changing the max page size while there are paged messages, and whether the warning I saw indicates a problem (WARNING [org.hornetq.core.paging.cursor.impl.PageSubscriptionImpl] Couldn't locate page transaction 42734785, ignoring message on position PagePositionImpl [pageNr=1, messageNr=14, recordID=0] ).

It's been a rough day, so I'm just fishing for some direction before I can try and reproduce it.
Actions
3. Re: messages stuck in "delivering" state

clebert.suconic Jun 9, 2011 3:33 PM (in response to carl.heymann)

I have done some fixes around this on Branch_2_2. There's a case that if you stop the server before the page was deleted, you would see these messages on restart.

This is fixed on Branch_2_2_EAP.

Maybe you could try an update from the branch?
Actions
4. Re: messages stuck in "delivering" state

carl.heymann Jun 10, 2011 9:15 AM (in response to clebert.suconic)

Do I need to check out from SVN? I have a relatively expensive 3G link, can you tell me the approximate size of a single SVN branch?

Is there a git clone of the svn repo? E.g. one that is synchronized using git-svn? In this case, I could potentially save a lot of bandwidth in the future, since I wouldn't have to check out branches separately over the network. I did see http://community.jboss.org/thread/166848, just hoping
Actions
5. Re: messages stuck in "delivering" state

clebert.suconic Jun 10, 2011 11:05 AM (in response to carl.heymann)

Yeah.. some people check out from git.
Actions

Go to original post