Chapter Five:




This week we are going to explore Power Search Techniques. There are many excellent guides describing Boolean logic and charts showing the features available at the "major" search engines, which you should consult for more details (please see the section For More Information). Here, our purpose is a brief overview of possible techniques. Please note that while most of the major search engines we have been using allow some advanced search capabilities, they appear or are implemented differently. 

And of course, we'll be going on another Info Quest.

 


Why "Power Search?"

We've all had the experience where we query a search engine and come back with thousands, maybe hundreds of thousands of "hits" or "matches" for our search terms. Unfortunately, it is time-consuming to sift through the web sites, and usually not very profitable. When you obtain a large number of "hits" from a search, this is known as high recall. While this might be the goal in some cases (for example, if you are working on a topic that is relatively new and you want everything published on it -- and the number of "hits" is only a couple of hundred), for the most part when people are searching the web, they are interested in high precision. High precision means that the retrieved documents are highly relevant to your subject, and is achieved by fine tuning your search to accurately describe your topic with its unique aspects. 

Many advanced web searching techniques are old friends of folks used to searching more traditional databases, such as those containing bibliographic citations or references to journal articles. Some techniques are unique to the web because of its media and structure. 

Important Note:
Not all advanced techniques are enabled by all search engines! Consult the Help documentation of the search facility you are using.

For the most part, search engines at directory sites do not offer advanced searching features.

 


What is Boolean Searching?

Boolean searching is an implementation of Boolean logic and set theory. Boolean operators, such as AND, OR and NOT, are used to combine search sets in a variety of ways and appear within Internet search engines in a range of disguises. A very brief overview
Search phrase: cats and dogs 
means find web pages in which both terms occur
Search phrase: cats or dogs 
means find web pages in which either term occurs
Search phrase: cats not dogs 
means find web pages in which the term cat appears but not dog

 Most web search engines have the capability to implement these basic Boolean operators but may present them in a different way. You will almost always need to go to an "Advanced" search function to use true Boolean operators; however, you may be able to search using implied Boolean using the symbols + (must include) or -(exclude) from the "Basic" search interface. 

Examples of usage:

AND
Use this operator to search for documents where you'd like both terms to appear, narrowing a search.

Dalmatians AND feeding 

OR
Use this operator to include synonyms, particularly where there are several terms or names used for a topic, or you would like to broaden a search.

Dalmatians OR spotted dogs 

NOT
Use this operator to exclude terms, particularly when your search terms have more than one meaning
Blues NOT depression 

Special Note: these Boolean operators are often presented as options like "include all the words," (AND operator) "include any of the words," (OR operator) and "exclude" (NOT operator).
 

 Another special note:
While you might expect that search engines default to an implied AND (which means if you enter 2 search terms it returns documents in which they BOTH occur) in fact this is not always the case -- some search engines default to the initially unhelpful OR (it returns documents in which EITHER occur)

 


What are "Proximity Operators"?

Also Boolean Operators, proximity operators such as NEAR or ADJ are used to control how closely the terms occur in the web document that is retrieved. For example, NEAR/3 means that the terms must occur within 3 words of each other. Proximity operators ensure that your terms are more closely related to another. 

Examples of usage:

NEAR/x
Use this operator to search for documents where you'd like both terms to appear within a specified distance of each other, narrowing a search

Dalmatians NEAR/3 feeding (web documents will be returned that have the term Dalmatians occurring within 3 words of the term Feeding

ADJ
Use this operator to search for documents where you'd like both terms to appear next to each other, narrowing a search

Dalmatians ADJ feeding 

Special Note: the ADJ Boolean operator is often disguised as the option "exact phrase."

 


Using The + Symbol to Add

Sometimes, you want to make sure that a search engine finds pages that have all the words you enter, not just some of them. The + symbol lets you do this. For example, imagine you want to find pages that have references to both President Clinton and Kenneth Starr on the same page. You could search this way:

+clinton  +starr

Only pages that contain both words would appear in your results. Here are some other examples:

+windows  +98  +bugs

That would find pages that have all three of the words on them, helpful if you wanted to narrow down a search to Windows 98 bugs, rather than on Windows 98 in general.

+star  +trek  +insurrection

That would get you pages about Star Trek that also specifically mention "Insurrection," the title of a Star Trek film

 


What is Field Searching?

Remember that a web search engine is only as good as its database and indexes. Databases are collections of records organized in a similar manner; simply put, this means they are divided into fields that contain the same information in each record. If data is entered into a separate field you can retrieve it using its field label. This means that if you want to search by title, the search engine looks in a special title index (or searches notations that indicate that the term occurs in the title field) where it has collected data from the field with the label title

Field searching is so wonderful because you can specify where to look in the web document; for example, in the title only, or the url fields. Field searching allows you to be very specific about where you want you terms to occur and hence is a very powerful tool. 

 Using Search templates:

 Some people are  big fans of advanced search templates such as the ones used by Hotbot (http://www.hotbot.com/) and Snap (http://www.snap.com/. These templates use many Boolean and field searching techniques without having to learn the syntax of yet another search engine.