Table of Contents
Table of Contents
When mnoGoSearch is used with ispell support all words are normalized. It allows to find different grammatic forms of the same words. During indexing all words are stored as is is the database. During the search all forms of the given keyword are selected and are taken into account. E.g. search frontend will try to find the word "test" if "testing" or "tests" is given in search query.
MnoGoSearch understands two types of ispell files: affixes and dictionaries. Ispell affixes file contains rules for words and has approximately the following format:
E | > -E, ION | As in create > creation |
Y | > -Y, ICATION | As in multiply > multiplication |
[^EY] | > EN | As in fall > fallen |
Ispell dictionary file contains words themselves and has the following format:
To make mnoGoSearch support ispell you must specify Affix and Spell commands in search.htm file. The format of commands:
The first parameter of both commands is two letters language abbrevation. The second is ispell files charset. The third one is filename. File names are relative to mnoGoSearch /etc directory. Absolute paths can be also specified.
Simultaneous loading of several languages is supported, e.g.:
It is possible that several rare words are found in your site which are not in ispell dictionaries. You may create the list of such words in plain text file with the following format (one word per line):
You may also use ispell flags in this file (for ispell flags refer to ISpell documentation). This will allow not to write the same word with different endings to the rare words file, for example "webmaster" and "webmasters". You may choose the word which have the same changing rules from existing ispell dictionary and just to copy flags from it. For example, English dictionary has this line:
So, webmaster with MS flags will be probably OK:
Then copy this file to /etc directory of mnoGoSearch and add this file by Spell command in ISpell tab of mnoGoSearch:
During next reindexing using of all documents new words will be considered as words with correct spelling. The only really incorrect words will remain.