Mikemccandless - blog.mikemccandless.com - Changing Bits
General Information:
Latest News:
SuggestStopFilter carefully removes stop words for suggesters 14 Aug 2013 | 08:52 pm
Lucene now has a nice set of suggesters that use an analyzer to tokenize the suggestions: AnalyzingSuggester, FuzzySuggester and AnalyzingInfixSuggester. Using an analyzer is powerful because it lets ...
A new version of the Compact Language Detector 2 Aug 2013 | 10:27 pm
It's been almost two years since I originally factored outthe fast and accurate Compact Language Detector from the Chromium project, and the effort was clearly worthwhile: the project is popular and o...
2X faster PhraseQuery with Lucene using C++ via JNI 23 Jun 2013 | 02:15 am
I recently described the new lucene-c-boost github project, which provides amazing speedups (up to 7.8X faster) for common Lucene query types using specialized C++ implementations via JNI. The code w...
A new Lucene suggester based on infix matches 23 Jun 2013 | 01:37 am
Suggest, sometimes called auto-suggest, type-ahead search or auto-complete, is now an essential search feature ever since Google added it almost 5 years ago. Lucene has a number of implementations; I...
Screaming fast Lucene searches using C++ via JNI 19 Jun 2013 | 11:03 pm
At the end of the day, when Lucene executes a query, after the initial setup the true hot-spot is usually rather basic code that decodes sequential blocks of integer docIDs, term frequencies and posit...
Build your own finite state transducer 9 Jun 2013 | 08:01 pm
Have you always wanted your very own Lucene finite state transducer (FST) but you couldn't figure out how to use Lucene's crazy APIs? Then today is your lucky day! I just built a simple web applicati...
Dynamic faceting with Lucene 22 May 2013 | 03:23 am
Lucene's facet module has seen some great improvements recently: sizable (nearly 4X) speedups and new features like DrillSideways. The Jira issues search example showcases a number of facet features. ...
Eating dog food with Lucene 14 May 2013 | 01:35 am
Eating your own dog food is important in all walks of life: if you are a chef you should taste your own food; if you are a doctor you should treat yourself when you are sick; if you build houses for a...
Drill Sideways faceting with Lucene 24 Feb 2013 | 11:11 pm
Lucene's facet module, as I described previously, provides a powerful implementation of faceted search for Lucene. There's been a lot of progress recently, including awesome performance gains as measu...
Getting real-time field values in Lucene 26 Jan 2013 | 04:55 pm
We know Lucene's near-real-time search is very fast: you can easily refresh your searcher once per second, even at high indexing rates, so that any change to the index is available for searching or fa...