12 Replies Latest reply on Jul 25, 2006 9:06 AM by juan_pane

    Searching CMS stored content...

    billgloff

      Curretnly evaluating JBoss Portal 2.2. I was wondering if we had JBoss Portal running and had seeveral CMS based portlets that we were using, would it be possible to offer some sort of search functioanlity maybe using something like Lucene as the indexer? Maybe an CMS event listener that would index the content as its being uploaded? Any thoughts on this, or best practices for offering a search in a portal environment?

      Thanks in advance,
      Bill

        • 1. Re: Searching CMS stored content...

          The lucene index is already built, but there is no search capability currently.

          We are exploring adding global search functionality for 2.6.

          • 2. Re: Searching CMS stored content...
            billgloff

            Would it be terriblly hard to get something working ourselves in 2.2/2.4 since we probably could not wait until the 2.6 timeline or is it way too much integration to even consider doing that?

            Guessing then we would have to live with indexing everything but the CMS controlled content right?

            Thanks,
            Bill

            • 3. Re: Searching CMS stored content...
              suniil

              i'm interested in this as well. If you can give some hints on where to start, i'll give a try myself

              • 4. Re: Searching CMS stored content...
                theute

                Guys, remember that the CMS is based on JackRabbit so you can use the indexing system of jackrabbit (using Lucene).

                Then you need to create a portlet to retrieve the content using Lucene and you have a "Search My CMS Portlet"

                • 5. Re: Searching CMS stored content...

                  The index is stored under:

                  C:\jboss-4.0.3SP1\server\default\data\portal\cms\conf\workspaces\default\index


                  The configuration for the index is under:
                  C:\projects\jboss-portal-2.4\core\src\resources\portal-cms-sar\META-INF\jboss-service.xml


                  It may be commented out - not sure why yet.

                  You may want to read this article, as it talks a bit about XPATH and search: http://www.artima.com/lejava/articles/contentrepository3.html

                  I have not looked too deep in to search functionality in jackrabbit, so this is all i have to go on right now.

                  • 6. Re: Searching CMS stored content...
                    glodo

                    Hi everyone! I'm interested too for the search in jboss portal, i will start now to develop..
                    It would be interesting that you provide this funcionality..
                    can you give more guide o others articles to understand or advices to provide to the development ?
                    thanks so much
                    Any advice it would be glad

                    • 7. Re: Searching CMS stored content...
                      glodo

                      I'll try to make it as simple as possible, search only by title and description of a particular file. At creation of a Html file or loading of a file to the CMS, for example a .pdf, a title and description is assigned to the archive, I need to get that title and description of the files stored at the CMS, how do I do that?. Can I get the attributes from a Blob stored at the DB?
                      thanks the advice!

                      • 8. Re: Searching CMS stored content...
                        glodo

                        I'll try to make it as simple as possible, search only by title and description of a particular file. At creation of a Html file or loading of a file to the CMS, for example a .pdf, a title and description is assigned to the archive, I need to get that title and description of the files stored at the CMS, how do I do that?. Can I get the attributes from a Blob stored at the DB?
                        thanks the advice!

                        • 9. Re: Searching CMS stored content...
                          glodo

                          Hi! i am trying to develop my search of the portal, one question.. where is stored the attributes title, description and language of the archive that is created in the CMS Manager like html or when load an pdf for example?
                          thanks for the information
                          glodo

                          • 10. Re: Searching CMS stored content...
                            theute

                            Hi glodo you shouldn't look directly in the store location you should use the indexing mechanism in place.
                            Look at:
                            http://jackrabbit.apache.org/doc/config.html
                            (The SearchIndex configuration)

                            You will probably need to use Lucene to retrieve content: http://lucene.apache.org/

                            You can also look at the very last chapter of the spec but 1) i am not sure it does what you want 2) i am not sure that jackrabbit implements it.

                            • 11. Re: Searching CMS stored content...
                              glodo

                              Thanks Thomas for the information and advice.. i am now learning about Lucene.. i am trying to develop my own search.. i hope that it works... :o)

                              • 12. Re: Searching CMS stored content...
                                juan_pane

                                Did anyone had this issue resolved?, I`m new to all this technology and I need some advice here...

                                Many Thanks