Skip to main content

Currently Skimming:

Experience in Developing Information Retrieval Systems
Pages 699-710

The Chapter Skim interface presents what we've algorithmically identified as the most significant single chunk of text within every page in the chapter.
Select key terms on the right to highlight them within pages of the chapter.


From page 699...
... The operation consists of selecting those items which satisfy a given set of search criteria and then presenting the individual references to the searcher. In the electronic computer systems that we have been developing, the file has consisted of magnetic computer tape on which are stored coded individual items.
From page 700...
... Most of these machines use perforated paper tape or punched cards which have inherent reading time slower than most sorting machines. Assuming that the information content of a document can be stored in coded form on a single punched card (a serious limitation)
From page 701...
... If the organization does not~operate a computer, he will have to enter into contractual negotiations to rent computer time elsewhere. More serious problems arise when an organization currently operates a data processing machine.
From page 702...
... Heitz of the Dow Chemical Company's Western Division Research Laboratory proposed an alternative scheme consisting of a large array of small, specialized individual classifications where no pattern of conformity need be established. A proposal was later macle to provide for automatic interconversion of classification schemes in areas of overlapping interest.
From page 703...
... A single program was devised for the IBM 702 Electronic Data Processing Machine to search both files, retrieving according to the corresponding request form and delivering a printed, edited output for each requester. Some of the details of the computer programs might be of general interest and are outlined below.
From page 704...
... We wish to refer to this project in this context to the extent that it may be considered a good example of an information retrieval project carried out on a modern computing machine. In the system as presently constituted for the IBM 704 Electronic Data Processing Machine, a large fee of coded chemical structures stored on magnetic tape is searched at high speed, and the names of compounds complying with the search request are retrieved and printed.
From page 705...
... The chemical search problem illustrates the freest combination of retrieval methods, including simple correspondence, the use of numerical ranges, logical manipulations, and the difficult problem of topological connectivity. Our complex logic program demonstrates how the most realistic logical combinations can be built up to obtain as fine a discrimination as desired.
From page 706...
... Proper scheduling to minimize delays is the function ofthe computer administrative group and involves techniques beyond the scope of this paper. To summarize, the total elapsed time, at best, should be less than one hour for a moderately complex system involving up to 100,000 records.
From page 707...
... The material must first be read, digested, indexed, and entered onto magnetic tape directly or via intermediate punched cards. Work is now under development to make some of these processes automatic.
From page 708...
... While most of our experience has been gained on data processing machines assigned solely to rental users, we have also made considerable use of a machine engaged in daily processing using long runs. In this case, we were particularly
From page 709...
... Such data processors might be available at computing centers specially established for the rental business or at other organizations willing to sell machine time. Remote operation is widely used and the communication problem has been generally solved.
From page 710...
... Thus, it appears that the operating costs of machine retrieval wid depend on the length of the file, its rate of growth, its complexity, and the frequency of searching. Con cession The commercially produced electronic computer in the information retrieve field has yet to be accepted.


This material may be derived from roughly machine-read images, and so is provided only to facilitate research.
More information on Chapter Skim is available.