Thank you so much for providing the webinars on findability.
My question relates to searchability when the index contains misspellings. In the real world, there are often many mistakes in documents, indexed fields and metadata for example as well as there will be some correct words OCR'd incorrectly (like missing a space and putting two words together). Thus after crawling, the index will be accurate for what it looked at but not findable by the user.
But in spite of this these documents need to be discoverable and I was wondering if SharePoint had any "smarts" built in to handle this. For example, if there was a handwritten document that was hard to read, the indexer may very well get it wrong and may type in 'Mixan' instead of 'Mixon' when assigning a keyword to the document. However when searching the user would have no idea that the name is wrong and search on 'Mixon'.
I have worked with a search provider that uses an algorithm where they use closeness of two words by counting how many sets of consecutive pairs of letters match for a given word (along with other factors). For example, there is a sliding scale where you can say you want only exact matches or 80% confidence. I believe this is called 'fuzzy searching'.
I am a developer, with a background in imaging, starting to dig into SharePoint and I'm very interested in its capabilities. I know it's not supposed to replace a document management system, but in fact many customers expect it to work like one.
Thanks,
Maggie