Abstract
The goal of Cross language information Retrieval (CLIR) is the mapping of query written in one language (source language) to document collection in another language (target language). Over the years, CLIR employs approaches like dictionary translation, machine translations and query expansion to output documents based on users query. While relevance of retrieved document ranks high for the success of any Information Retrieval operation, obtaining relevance is encumbered with several issues compounded by the methods employed in CLIR. Thus, the research examines some of these existing methods in CLIR towards extending the dictionary translator and query expansion model. Here, a Fuzzy Bilingual Dictionary (FBD) with a dual concept driven document-clustering technique was proposed with a bid to retrieve more relevant document across language. The resulting model was tested on 3000 dataset of computer science related articles with queries posed in Yoruba language. A better recall and precision rate of the retrieval operation was achieved from the experiments.
Similar content being viewed by others
References
Yang CC, Wei C, Li KW (2008) Cross-lingual thesaurus for multilingual knowledge management. Decis Support Syst 45(3):596–605. ISSN:0167-9236
Pimienta D, Prado D, Álvaro B (2009) Twelve years of measuring linguistic diversity in the Internet: balance and perspectives, pp 1–65
Kimura F, Maeda A, Hatano K, Miyazaki J, Uemura S (2008) Cross-language information retrieval by domain restriction using web directory structure. In: HICSS, IEEE Computer Society, p 135
Nie JY (2010) Cross-language information retrieval. Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers
Adegbola T, Owolabi K, Odejobi T (2011) Localizing for Yoruba: experience, challenges and future directions. In: Conference on Human Language Technology for Development, Alexandria, Egypt, 2–5 May 2011
Cleverley PH, Burnett S (2015) Retrieving haystacks: a data driven information needs model for faceted search. J Inform Sci 41(1):97–113
Tsai P (2011) An Introduction to Cross Language Information Retrieval Approaches. pp 1–20. http://web.simmons.edu/~tsai/. Accessed 19 Aug 2014
SivaKumar AP, Premchand P, Govardhan A (2011) Indian language IR using latent semantic indexing. Int J Comput Sci Inf Tech IJCSIT 3(4):245–253. https://doi.org/10.5121/ijcsit.2011.3419
Oard DW (1998) A Comparative Study of Query and Document Translation for Cross-Language Information Retrieval. In: Proceedings of the Third Conference of the Association For Machine Translation in the Americas on Machine Translation and the information Soup (October 28–31, 1998)
Okabe M, Yamada S (2007) Semi supervised query expansion with minimal feedback. IEEE Trans Knowl Data Eng 19:1585{1589}
Sacaleanu B, Buitelaar P, Volk M (2003) A cross language document retrieval system based on semantic annotation. In: EACL., pp 231–234
Pirkola (2001) Dictionary-Based Cross-Language information retrieval: problems, methods, and research findings. J Inform Retrieval Arch 4(3–4):209–230, September–December 2001. Kluwer Academic Publishers Hingham, MA, USA. https://doi.org/10.1023/a:1011994105352
He B, Ounis I (2009) Studying Query Expansion Effectiveness. In: Proceedings of ECIR’09 European Conference in Information Retrieval
Gaillard B, Bouraoui JL, Guimier de Neef E, Boualem M (2010) Query expansion for cross language information retrieval improvement. Research Challenges in Information Science (RCIS), 2010 IEEE Explore, pp 337–342. https://doi.org/10.1109/rcis.2010.5507393
Manning CD, Raghavan P, Schütze H (2009). An introduction to information retrieval. Cambridge University Press, pp 177–194
Billerbeck B (2005) Efficient query expansion. PhD Thesis, RMIT University, Melbourne, Australia, September 2005
Onifade OFW (2016) Ibitoye AOJ (2016), “Fuzzy latent semantic query expansion model for enhancing information retrieval”. I.J. Mod Educ Comput Sci 8(2):49–53
Ayoade AM, Oyekunle VB, Onifade OFW (2014) Concept-based semantically-enhanced scientific paper recommender system. In: ISKO conf on Transition from Observation to Knowledge to Intelligence, 20–22 August, 2014, UniLag, Lagos, Nigeria
Onifade OFW, Osofisan AO (2011) Resolving ambiguous queries via fuzzy string matching and dynamic buffering techniques. Inform Intell Syst Technol Manag Commun Comput Inform Sci 141(3):198–205. Springer, Berlin Heidelberg. https://doi.org/10.1007/978-3-642-19423-8_21
Onifade OFW, Thiéry O, Osofisan AO, Duffing G (2010) Dynamic fuzzy-string matching model for information retrieval based on incongruous user’s queries. In: World Congress on Engineering Conference (WCE’10), London, UK, 30 June–2 July, 2010
Acknowledgements
This work was performed under the CV Raman fellowship. We acknowledged the Department of Computer Science and Engineering, IIT Kharagpur, India and Department of Computer Science, University of Ibadan, Nigeria.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Onifade, O.F.W., Ibitoye, A.O.J. & Mitra, P. Embedded Fuzzy Bilingual Dictionary model for cross language information retrieval systems. Int. j. inf. tecnol. 10, 457–463 (2018). https://doi.org/10.1007/s41870-018-0181-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41870-018-0181-5