Skip to main content
Log in

Embedded Fuzzy Bilingual Dictionary model for cross language information retrieval systems

  • Original Research
  • Published:
International Journal of Information Technology Aims and scope Submit manuscript

Abstract

The goal of Cross language information Retrieval (CLIR) is the mapping of query written in one language (source language) to document collection in another language (target language). Over the years, CLIR employs approaches like dictionary translation, machine translations and query expansion to output documents based on users query. While relevance of retrieved document ranks high for the success of any Information Retrieval operation, obtaining relevance is encumbered with several issues compounded by the methods employed in CLIR. Thus, the research examines some of these existing methods in CLIR towards extending the dictionary translator and query expansion model. Here, a Fuzzy Bilingual Dictionary (FBD) with a dual concept driven document-clustering technique was proposed with a bid to retrieve more relevant document across language. The resulting model was tested on 3000 dataset of computer science related articles with queries posed in Yoruba language. A better recall and precision rate of the retrieval operation was achieved from the experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  1. Yang CC, Wei C, Li KW (2008) Cross-lingual thesaurus for multilingual knowledge management. Decis Support Syst 45(3):596–605. ISSN:0167-9236

  2. Pimienta D, Prado D, Álvaro B (2009) Twelve years of measuring linguistic diversity in the Internet: balance and perspectives, pp 1–65

  3. Kimura F, Maeda A, Hatano K, Miyazaki J, Uemura S (2008) Cross-language information retrieval by domain restriction using web directory structure. In: HICSS, IEEE Computer Society, p 135

  4. Nie JY (2010) Cross-language information retrieval. Synthesis Lectures on Human Language Technologies. Morgan & Claypool Publishers

  5. Adegbola T, Owolabi K, Odejobi T (2011) Localizing for Yoruba: experience, challenges and future directions. In: Conference on Human Language Technology for Development, Alexandria, Egypt, 2–5 May 2011

  6. Cleverley PH, Burnett S (2015) Retrieving haystacks: a data driven information needs model for faceted search. J Inform Sci 41(1):97–113

    Article  Google Scholar 

  7. Tsai P (2011) An Introduction to Cross Language Information Retrieval Approaches. pp 1–20. http://web.simmons.edu/~tsai/. Accessed 19 Aug 2014

  8. SivaKumar AP, Premchand P, Govardhan A (2011) Indian language IR using latent semantic indexing. Int J Comput Sci Inf Tech IJCSIT 3(4):245–253. https://doi.org/10.5121/ijcsit.2011.3419

    Google Scholar 

  9. Oard DW (1998) A Comparative Study of Query and Document Translation for Cross-Language Information Retrieval. In: Proceedings of the Third Conference of the Association For Machine Translation in the Americas on Machine Translation and the information Soup (October 28–31, 1998)

  10. Okabe M, Yamada S (2007) Semi supervised query expansion with minimal feedback. IEEE Trans Knowl Data Eng 19:1585{1589}

  11. Sacaleanu B, Buitelaar P, Volk M (2003) A cross language document retrieval system based on semantic annotation. In: EACL., pp 231–234

  12. Pirkola (2001) Dictionary-Based Cross-Language information retrieval: problems, methods, and research findings. J Inform Retrieval Arch 4(3–4):209–230, September–December 2001. Kluwer Academic Publishers Hingham, MA, USA. https://doi.org/10.1023/a:1011994105352

  13. He B, Ounis I (2009) Studying Query Expansion Effectiveness. In: Proceedings of ECIR’09 European Conference in Information Retrieval

  14. Gaillard B, Bouraoui JL, Guimier de Neef E, Boualem M (2010) Query expansion for cross language information retrieval improvement. Research Challenges in Information Science (RCIS), 2010 IEEE Explore, pp 337–342. https://doi.org/10.1109/rcis.2010.5507393

  15. Manning CD, Raghavan P, Schütze H (2009). An introduction to information retrieval. Cambridge University Press, pp 177–194

  16. Billerbeck B (2005) Efficient query expansion. PhD Thesis, RMIT University, Melbourne, Australia, September 2005

  17. Onifade OFW (2016) Ibitoye AOJ (2016), “Fuzzy latent semantic query expansion model for enhancing information retrieval”. I.J. Mod Educ Comput Sci 8(2):49–53

    Article  Google Scholar 

  18. Ayoade AM, Oyekunle VB, Onifade OFW (2014) Concept-based semantically-enhanced scientific paper recommender system. In: ISKO conf on Transition from Observation to Knowledge to Intelligence, 20–22 August, 2014, UniLag, Lagos, Nigeria

  19. Onifade OFW, Osofisan AO (2011) Resolving ambiguous queries via fuzzy string matching and dynamic buffering techniques. Inform Intell Syst Technol Manag Commun Comput Inform Sci 141(3):198–205. Springer, Berlin Heidelberg. https://doi.org/10.1007/978-3-642-19423-8_21

  20. Onifade OFW, Thiéry O, Osofisan AO, Duffing G (2010) Dynamic fuzzy-string matching model for information retrieval based on incongruous user’s queries. In: World Congress on Engineering Conference (WCE’10), London, UK, 30 June–2 July, 2010

Download references

Acknowledgements

This work was performed under the CV Raman fellowship. We acknowledged the Department of Computer Science and Engineering, IIT Kharagpur, India and Department of Computer Science, University of Ibadan, Nigeria.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Olufade F. W. Onifade.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Onifade, O.F.W., Ibitoye, A.O.J. & Mitra, P. Embedded Fuzzy Bilingual Dictionary model for cross language information retrieval systems. Int. j. inf. tecnol. 10, 457–463 (2018). https://doi.org/10.1007/s41870-018-0181-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s41870-018-0181-5

Keywords

Navigation