GLORIA

GEOMAR Library Ocean Research Information Access

Your email was sent successfully. Check your inbox.

An error occurred while sending the email. Please try again.

Proceed reservation?

Export
Filter
  • General works  (1)
  • AN 10100  (1)
Material
Publisher
Language
Years
FID
Subjects(RVK)
  • General works  (1)
RVK
  • AN 10100  (1)
  • 1
    Online Resource
    Online Resource
    SAGE Publications ; 2008
    In:  Journal of Information Science Vol. 34, No. 2 ( 2008-04), p. 213-230
    In: Journal of Information Science, SAGE Publications, Vol. 34, No. 2 ( 2008-04), p. 213-230
    Abstract: In current library practice, trained human experts usually carry out document cataloguing and indexing based on a manual approach. With the explosive growth in the number of electronic documents available on the Internet and digital libraries, it is increasingly difficult for library practitioners to categorize both electronic documents and traditional library materials using just a manual approach. To improve the effectiveness and efficiency of document categorization at the library setting, more in-depth studies of using automatic document classification methods to categorize library items are required. Machine learning research has advanced rapidly in recent years. However, applying machine learning techniques to improve library practice is still a relatively unexplored area. This paper illustrates the design and development of a machine learning based automatic document classification system to alleviate the manual categorization problem encountered within the library setting. Two supervised machine learning algorithms have been tested. Our empirical tests show that supervised machine learning algorithms in general, and the k-nearest neighbours (KNN) algorithm in particular, can be used to develop an effective document classification system to enhance current library practice. Moreover, some concrete recommendations regarding how to practically apply the KNN algorithm to develop automatic document classification in a library setting are made. To our best knowledge, this is the first in-depth study of applying the KNN algorithm to automatic document classification based on the widely used LCC classification scheme adopted by many large libraries.
    Type of Medium: Online Resource
    ISSN: 0165-5515 , 1741-6485
    RVK:
    Language: English
    Publisher: SAGE Publications
    Publication Date: 2008
    detail.hit.zdb_id: 439125-1
    detail.hit.zdb_id: 2025062-9
    SSG: 24,1
    Location Call Number Limitation Availability
    BibTip Others were also interested in ...
Close ⊗
This website uses cookies and the analysis tool Matomo. More information can be found here...