Summary
As the volume and variety of information sources, especially on the World Wide Web (WWW), continue to grow, the requirements imposed on search applications are steadily increasing. The amount of available data is growing and so do the user demands. Search application should provide the users with accurate, sensible responses to their requests. It is difficult to provide information that accurately matches user information needs. Search effectiveness can be seen as the accuracy of matching user information needs against the retrieved information. There are problems emerging: users often do not present search queries in the form that optimally represents their information need, the measure of a document's relevance is often highly subjective between different users, and information sources might contain heterogeneous documents, in multiple formats and the representation of documents is not unified. This contribution presents a proposal to improve web search effectiveness via evolutionary optimization of the Boolean and vector search queries based on individual user models.
See the full content of this document
Extract
Evolutionary Improving World Wide Web Queries
1. Introduction
The emergence of the World Wide Web has resulted in access to a vast and exponentially growing, unstructured and dynamic network of information sources. It is timely and necessary to build new tools aimed at helping users retrieve documents that satisfy their information needs accurately and efficiently In the heart of every Internet search engine, such as Google, Yahoo, Altavista or others is an information retrieval (IR) system deploying certain IR technology to search the web for desired information.Current search methods retrieve documents from the documentary database in response to a user's query, which is compared (typically at keyword level) against documents with the goal to find those most likely to be related to the query according to the user's assessment of the retrieved documents. Users then have to scan through hundreds of retrieved documents to identify information that is really relevant to their needs. This is a time-consuming task. Furthermore, the same process of identifying relevant documents is repeated each time the user searches the same or similar information sources.An advanced IR system should be able to obtain from an information source only those documents that are relevant to a user's information needs, while at the same time excluding documents that are non-relevant. Contemporary general search application...See the full content of this document
Sponsored links
