I have a solution with Search server 2008 Express up and running on a website. The content that are crawled/searced is a website with a menu to the left.
The problem is the search results. - Some results are irrelevant since the searched word is only present in the menu part of the result page, and not in the "content" part of the result page. - Is there any way one can tell the crawler that this type of content (the menu) should not get indexed, but off course crawled (to follow the links and crawl them.).
I have a solution myself, but if there was an easier way to set this up in the administration tool, I would prefer that!
My own solution to the problem would be:
1) When the user agent is the sharepoint crawler, I will replace the menuitems with new menuitems where the text of the menuitem is allways written as a dummy text like "MyDummyMenuItemForSharepoint".
2) When the crawler crawls through the menu, it will find the "MyDummyMenuItemForSharepoint" word very often, but this is irrelevant since you will never search for MyDummyMenuItemForSharepoint.
3) Irrelevant results will not show up in the search results.
Example:
You search for the word "tax" and get a set of results where "tax" is part of the page. The lowest ranked results also contain the word "tax", but it is in the menu part of the page, thus making it an irrelevant page, since the page does not cantain any information about "tax", except for a link to a page called "tax". I would like to exclude these pages.
Thanks in advance for any help!