The search architecture has changed in many ways. There are new components, new topology and new features. The new architecture is to facilitate greater redundancy and to be more scalable.
Search is one of the big bets in SharePoint 2013. Search is new, search is different from all previous versions of SharePoint. The platform has been consolidated. It is a combination of FAST Search and SharePoint Search components. And also the good news is, it is the same from Foundation to Server. No more different flavors.
Note: be aware, that this article is mainly written based on SharePoint 2013 Technical Review and although the beta version has arrived, functionality can still change when the RTM is released.
The search architecture has changed in many ways. There are new components, new topology and new features. The new architecture is to facilitate greater redundancy and to be more scalable. The following picture displays the logical architecture and its components. In this post all components are explained briefly.
Crawl Component
The crawler is responsible for crawling the content. The crawler uses the connectors to retrieve data from the content sources, but is does not parse any text or documents. The result of crawling is both the actual content and the associated metadata. All crawled items are passed over to the next component, the Content Processing Component.Content Processing Component
This component processes crawled items and then feeds these items to the Index Component. So, this content processing component does actually parse the content by means of Format Handlers. It has automatic file format detection and it no longer relies on file extension. Out of the box there are high-performance format handlers for HTML, DOCX, PPTX, TXT, Image, XML and PDF formats. IFilters are still supported.Index Component
The Index Component is used in both feeding and query processes. On one hand it receives processed items from the Content Processing Component and writes those items to the index. On the other hand it receives queries from the Query Processing Component and provides results sets in return.The Index Component is also responsible for moving the indexed content when the topology changes by the Search Administration Component.
No comments:
Post a Comment