-
1. Re: Design discussion: Free Text Search
eric.wittmann Mar 12, 2015 2:59 PM (in response to brmeyer)When searching against content you'll do a full text search of some kind? Lucene or something?
Wondering how the wildcards translate to a full text search ending.
-
2. Re: Design discussion: Free Text Search
brmeyer Mar 12, 2015 3:15 PM (in response to eric.wittmann)When searching against content you'll do a full text search of some kind? Lucene or something?
We'd currently rely on ModeShape 4.0's full-text support (they provide a 'CONTAINS' function). From what I understand, that relies on internal indexing (MS 4 indexing is a lot different than 3) and Tika text extractors. However, they're currently working on additional index providers, Lucene being the first. An alternative could be using MS 'CONTAINS' for the metadata, but an external index provider the the content (which would be possible, since we use filesystem file storage). But, I'll cross that bridge when MS performance becomes an issue.
Wondering how the wildcards translate to a full text search ending.
https://docs.jboss.org/author/display/MODE/Full+text+search
JCR-SQL2 - ModeShape 3 - Project Documentation Editor
The full-text search 'CONTAINS' method supports wildcards, etc.
-
3. Re: Design discussion: Free Text Search
eric.wittmann Mar 13, 2015 7:50 AM (in response to brmeyer)Ah perfect - the state of the art for full text indexing has apparently progressed since the last time I really dove into it (*cough* 2001 *cough*).
-
4. Re: Design discussion: Free Text Search
brmeyer Mar 13, 2015 12:28 PM (in response to brmeyer)Actually, I might suggest simplifying this. ModeShape only supports full-text searching in general -- it's an all or nothing approach. IE, there's not a way to search metadata-only or content-only. Further, I may be overthinking it. Most users would probably need the original queries + full text search on the whole shebang. So, reduce the requirements to:
- xp2:matches(@FooProp, '.*foo.*')
- xp2:matches(*, '.*foo.*')
Thoughts?
-
5. Re: Design discussion: Free Text Search
eric.wittmann Mar 13, 2015 1:08 PM (in response to brmeyer)This makes sense to me. Keep it simple, stupid.
However if you're going with just these two options, then my suggestion would be to use . instead of * for #2:
2. xp2:matches(., 'foo.*')
That's more consistent with xpath semantics I think.
-
6. Re: Design discussion: Free Text Search
brmeyer Mar 13, 2015 3:29 PM (in response to eric.wittmann)That's more consistent with xpath semantics I think.
Fair point, will do. Thanks!