Doing Research on the Internet -- A Brief Guide

The Internet can provide researchers with tremendous amounts of information in a very short time. In many cases, a researcher can easily be overwhelmed by the sheer quantity of information accessible. Since millions of documents are available over the net, ignorance of effective search techniques is not bliss. A researcher literally could spend days searching for information and never find exactly what he or she needs. Compounding this difficulty is the very openness of the Internet. Almost anybody with a computer can post almost any kind of information on the net regardless of its value or authenticity or accuracy. This makes it even more challenging to the researcher to identify ONLY those sources of information that can be judged reliable and accurate. There are a number of tricks to effectively locate information on the net.

The quickest way to find information on the Internet is to already know the address of a reliable resource that provides information on the subject you wish to research. Unless you are an old hand at Internet searching, you most likely will not have addresses that you can whip up at whim. In this case, the best way to search the net is to use one of the many web catalogs that are widely available on the World Wide Web. The catalogs are created using Web crawlers, electronic "robots" that figuratively "crawl" the Web searching for connections to different data sources. The links discovered by crawlers are then indexed by keywords so that Web surfers can quickly input subjects and gain access to the wealth of information available on the Internet.

No one search engine provides the complete story, though. Currently, the most comprehensive index of the Internet is provided by Google. Developed by students at Stanford University, Google currently indexes over 4 billion Web pages, the largest collection of any of the search engines. What really makes Google special among Internet search services is its uncanny ability to return highly relevant Web pages within the first ten or so listings in its search retrieval. Using complex algorithms that rank Web pages according to word frequency and link frequency, Google does an amazing job of providing users with the best sites in the top of its listings. Google has been so good at retrieving relevant Web pages that a number of other search services now partner with Google to expand their own search capabilities. A full history of Google can be found at their Web site.

Once you locate information on the Internet, you should make every effort to identify the source of the information so that you can try to judge its accuracy and reliability. One way to begin evaluating is to look at the composition of the Internet address. The following address extensions can help you to identify the source of the information that you retrieve.

  • .com--These are commercial sites, which may include corporate homepages or individual users's homepages. Commercial sites provide users with reliable information about business enterprises, product information, online technical support for software and hardware, and, in many cases, online product ordering capabilities. Since commercial sites may also provide Internet access to individuals, be wary of sources that have no obvious connection to a business enterprise. Example: Microsoft Corporation
  • .edu--Addresses ending in this extension indicate that you are connecting to a university, college, or other school computer system. These sites can contain both authoritative and frivolous information, depending on the institution's user policies. If a university grants students, faculty, and staff open access to its Internet system, you can expect to find documents containing anything from research to jokes and humor. Examine documents obtained from educational institutions carefully. Look for department affiliations, author credentials, and any other identifying criteria that would support a document's seriousness and reliability. Example: Florida State University
  • .gov--This extension identifies the information server as a government entity. These sources can be deemed reliable since government bodies (at least in the United States) are charged with the duty of providing their constituents with accurate information on laws, regulations, finances, almost any aspect of government. Example: The United States Government's Thomas Information System
  • .mil--This designates a military body, such as the Pentagon.
  • .org--These are associations or other non-commercial organizations that maintain Internet sites. Much research can be obtained from professional and research organizations. These sites typically will be reliable. Example: The United Nations

As the Web has continued its exponential growth, additional top level domains (TLDs) have been added. Following are TLDs that have been added. Further information on these domains can be found at the IANA (Internet Assigned Numbers Authority) Web site

.aero          reserved for members of the air-transport industry
.biz   restricted to businesses
.coop   reserved for cooperative associations
.info   reserved for entitites providing information services
.museum   reserved for museums on the Web
.name   reserved for individuals
.pro   still being developed, this will be reserved for credentialed professionals
.int   reserved for organizations established by international treaties between governments

Last updated 01/05.

counter        | About LIS1001 | Site Map | Contact Jim | ©2007 Jim Alderman |