Knowledgebase & FAQ
RecFind 6 Noise Word configuration
Full text indexing systems generally have the ability to configure "noise words" (also known as "stop words"). This gives you the ability to exclude commonly used terms from your index and therefore avoid your index from becoming bloated. For example, in the English language, words such as "a," "and," "is," and "the" are in the English noise word file and are left out of the full-text index since they are considered useless to a search.
Default Noise Words
Reconfiguring your RecFind 6 Noise Words (for SQL Server 2005)
RecFind 6 utilizes the Microsoft SQL Server full text indexing capabilities and therefore the configuration of Noise Words is predominantly a SQL Server configuration task.
For SQL Server 2005, the noise word files are located in the
This directory is created, and the noise-word files are installed when you set up SQL Server with the Full-Text Search support. Noise-word files are simple text files that can be edited using a text editor (eg. Notepad).
To alter your noise word configuration:
Reconfiguring your RecFind 6 Noise Words (for SQL Server 2008)
RecFind 6 utilizes the Microsoft SQL Server full text indexing capabilities and therefore the configuration of Noise Words is predominantly a SQL Server configuration task. In SQL Server 2008 noise words are now called stop words and are maintained via a new feature called Stoplists.
Unlike SQL Server 2005 the stop words are stored within the database, and by default text indexes (including RecFind 6's) use the System Stoplist. The following procedures explains how to create your own Stoplist for RecFind 6, update the applicable RecFind 6 indexes to use this Stoplist and then how to reconfigure your Stoplist.
Note: Before commencing this process you will need to ensure that your RecFind 6 database is in SQL 2008 compatibility mode. To check your database, from SQL Server Management Studio view the Properties of your RecFind 6 database and on the Options page ensure that the compatibility level is set to "SQL Server 2008 (100)".
To create your own Stoplist:
To link the full text indexes to your new Stoplist:
Now you have your own custom Stoplist, to alter your noise words perform the following:
Special note about noise words
Please note that SQL Server still takes in consideration the position of the noise word in phrases, and therefore you can receive unexpected results. For example searching for "walk the dog" ("the" is a noise word) will locate records with the phrase "walk a dog", "walk 1 dog", "walk over dog", "walk their dog", etc because a noise word is located in the same position in each instance and therefore could be a result.
Noise Words @ Microsoft MSDN : http://msdn.microsoft.com/en-us/library/ms142551(SQL.90).aspx (SQL 2005) or http://msdn.microsoft.com/en-us/library/ms142551.aspx?ppud=4 (SQL 2008)