The list of stopwords is essential if you’re thinking of using MySQL’s FULLTEXT index type, and it hasn’t been mentioned in other places I’ve seen the fulltext index discussed.
I would heavily dissuade you from letting MySQL do full-text searching. It is currently very slow and you will take a big hit in system resources. Instead, I would suggest you consider some other alternatives, like Sphinx or Lucene. I can’t say much for Lucene, as I haven’t used it personally, but we used Sphinx originally for full-text searching, which it can do within a fraction of a second for even complex searches. However, since the most recent version, it also supports adding filters on other fields, which means that you can with a bit of effort effectively discard sending complex queries to MySQL on large datasets, instead letting Sphinx do the heavy work and passing the id’s to MySQL to get the resulting records… and all very very very fast.
Written by
Hans
on
November 07, 2008 at
4:04pm
If you enjoy the content, consider subscribing to the feed(s).
Jump to comments