Microsoft Office Online
Sign in to My Office Online (What's this?) | Sign in

 
 
Microsoft Office SharePoint Server (MOSS)
Search
Search
 
Icon: Flag: (c) Microsoft
Get up to speed
 
 
 
Warning: You are viewing this page with an unsupported Web browser. This Web site works best with Microsoft Internet Explorer 6.0 or later, Firefox 1.5, or Netscape Navigator 8.0 or later. Learn more about supported browsers.

Scoping and Configuring the Content Index for the Site Directory in SharePoint Portal Server 2003
 

By Erik Heino, Microsoft Corporation

Introduction

In Microsoft Office SharePoint Portal Server 2003, content from sites other than the portal site is crawled by using the content index for non-portal site content. This content index, which also includes non-portal site content that is not in the site directory, is managed separately from the content index for portal site content.

All sites in the site directory are added to a content source for the site directory that is provided in SharePoint Portal Server 2003 by default. The content of sites in the site directory is crawled according to site inclusion and exclusion rules and is included in the content index for non-portal site content along with all other non-portal site content. Searches for content on sites other than the portal site are then performed using the information in the content index for non-portal site content.

By default, all sites in the site directory are included when the content source for the site directory is crawled, or when updates are performed on the entire content index for non-portal site content. If sites have not been added to the site directory, the content of those sites is not included when the content source for the site directory is crawled.

In large organizations with many sites in the site directory, both crawling and searching using the site directory content source can be complex and time-consuming. To simplify the management of sites in the site directory, you can create new content sources organized by content source groups. These content source groups can be used to crawl only some of the sites in the content index for non-portal site content, or to create search scopes so that users can search only sites in a particular group.

By carefully managing the content source groups that you use for your site directory, you can simplify the crawling of thousands of sites using many fewer content sources. You can also aggregate content sources across servers in a shared services configuration to simplify crawling even further. This is a good practice for large organizations that manage search across many portal sites.

The process for scoping and configuring the content index for the site directory is as follows:

  1. Add sites to the site directory.
  2. Approve or reject sites for crawling.
  3. Configure a content source group.
  4. Assign source groups to sites.
  5. Create search scopes for sites in the site directory.
  6. Crawl the site directory.

This topic is part of an eight-topic series.

advertisement