DSpace at the University of Texas at Austin School of Information >
School of Information Faculty >
Julie Hallmark >
Julie Hallmark Publications >
Please use this identifier to cite or link to this item:
|Title: ||Document Retrieval by Means of an Automatic Classification Algorithm for Citations / Julie Hallmark Bichteler|
|Authors: ||Hallmark, Julie|
|Keywords: ||Information storage and retrieval systems|
|Issue Date: ||1974|
|Publisher: ||Elsevier Inc.
|Citation: ||Document Retrieval by Means of an Automatic Classification Algorithm for Citations (with Ronald G. Parsons). 1974. Information Storage and Retrieval 10:267-78.|
|Series/Report no.: ||Published Papers;A-1974-041|
|Abstract: ||The applicability of an automatic classification technique to information retrieval was investigated. A modified version of the Schiminovich algorithm was used to classify articles in the data base, utilizing citations found in their bibliographies and a "triggering file" of bibliographically related papers. Classes of articles were formed by comparing the citations in the articles with members of the triggering file. Those papers with similar citing patterns formed a group; citations occurring sufficiently often within the papers of a group formed a bibliography. Bibliographies in turn became new triggering files in an iterative procedure.
Results of this nontraditional method were compared with those of retrieval by standard American Institute of Physics (AIP) subject analysis of the same material. A combination of the two methods was also investigated to test the hypothesis that one could be used to augment or refine the other.
Nine cooperating physicists defined queries in terms of (1) the AIP classification scheme and (2) a list of articles likely to be cited in current relevant literature. The data base was 18 months of 1971-1972 physics journal articles (AIP SPIN tapes). The chief means of evaluating the results of the three retrieval approaches was a comparison of recall and precision values obtained from relevance judgments by the physicists.
Average precision for the AIP subject analysis was 17 per cent and for the citation processing 62 per cent. Based on an assumption of 100 per cent recall for the AIP analysis, average recall for the citation processing was 45 per cent. All but one physicist would prefer to have both the citation and subject approaches available for information retrieval. Sometimes, only a few relevant articles are desired; at other times comprehensiveness is necessary. However, if only one method were available, all except one participant would choose the subject approach. They felt that one must be able to retrieve all the relevant articles, even if that would mean examining 25-30 irrelevant notices for every relevant one. Several expressed the notion that looking at a file with a very high percentage of relevant articles made one wonder what was missing.|
|Appears in Collections:||Julie Hallmark Publications|
Files in This Item:
|A-1974-041ocr.rtf||This version is a searchable file, not submitted to quality control.||8.17 MB||RTF||View/Open|
|A-1974-041.tif||This version is a high resolution file, scanned at 400 DPI, for preservation and archival purposes.||217.55 MB||TIFF||View/Open|
|A-1974-041.pdf||This version is a low resolution Adobe proprietary file for access purposes.||11.61 MB||Adobe PDF||View/Open|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.