Summary
An information retrieval (IR) system (IRs) (search engine) is said to be efficient, to the degree that always evaluates each object in the information base (database, document base, web,...) like the expert. The ability of IRs's is to retrieve mostly all relevant objects (measured by the recall), and only the (most) relevant objects (measured by the precision) from the collection queried. Recall and precision measures provide the classical measure of the retrieval efficiency. They measure the degree to which the query answer (the set of documents that retrieved by IRs as response to the user query). Where, the query answer is the set of relevant documents in the information based queried. Retrieving most relevant documents to the user query in IRs was one of the most important methods of World Wide Web (WWW) search engines used in the world now. So the searchers aim to use genetic programming (GP) and fuzzy optimization to optimize the user search query in the Boolean IRs model and in the fuzzy IRs model; and to use more Boolean operators (AND, OR, XOR, OF, and NOT) instead of using the standard operators (AND, OR, and NOT), and to use weights for terms and for Boolean operators. Weights are used to give the users more relaxation in defining how much the importance of the terms and of the Boolean operators is. The terms and the Boolean operators' weights are used in fuzzy IRs model. In addition, it investigates extensions of the classical measurement of effectiveness in IRs, precision; recall and harmonic mean. The researchers use harmonic mean measure as an objective function which uses both measures precision and recall at once for evaluating the results of the two IRs models to grow up the precision-recall relationship curve.
See the full content of this document
Extract
Grow Up Precision Recall Relationship Curve in Ir System Using Gp and Fuzzy Optimization in Optimizing the User Query
1. Introduction
In the principle of the IRs, suppose there is a collection of documents stored and a user formulates a query "search query". The user needs to retrieve a set of documents that satisfy the information needed his/her query from a collection of documents. In such a huge and unstable information collection, today's difficult problem is to find the most relevant information to search queries. The growth in the number of documents in the WWW made it necessary to use the best knowledge and methods for retrieving documents.2. Information RetrievalIR can be performed as one of its definitions "the processes involved in representation, storage, searching and finding information which is relevant to a requirement for information desired by the user". The goal of IR is to lead the user to those documents that will best enable the user to satisfy his/her need for information [I].Most of the IRs are based on the Boolean queries and on the collection of documents. The documents are usually composed of ...See the full content of this document
Sponsored links
