Abstract
Word prediction is treated as an efficient technique to enhance text entry rate. Existing word prediction systems predict a word when a user correctly enters the initial few characters of the word. In fact, a word prediction system fails if the user makes errors in the initial input. Therefore, there is a need to develop a word prediction system that predicts desired words while coping with errors in initial entries. This requirement is more relevant in the case of text entry in Indian languages, which are involved with a large set of alphabets, words with complex characters and inflections, phonetically similar sets of characters, etc. In fact, text composition in Indian languages involves frequent spelling errors, which presents a challenge to develop an efficient word prediction system. In this article, we address this problem and propose a novel word prediction system. Our proposed approach has been tried with Hindi, the national language of India. Experiments with users substantiate 43.77% keystroke savings, 92.49% hit rate, and 95.82% of prediction utilization with the proposed word prediction system. Our system also reduces the spelling error by 89.75%.
- Ahmed, U. Z., Bali, K., Choudhury, M., and VB, S. 2011. Challenges in designing input method editors for Indian languages: The role of word-origin and context. In Proceedings of the Workshop on Advances in Text Input Methods (WTIM).Google Scholar
- Alm, N., Arnott, L. J., and Newell, A. F. 1992. Prediction and conversational momentum in an augmentative communication system. Commun. ACM 35, 5, 46--57. Google ScholarDigital Library
- Balasubramaniam, L. 2006. Translation article knowledgebase - Spell it Right! (in the Context of Hindi). http://www.proz.com/translation-articles/articles/705/.Google Scholar
- Begum, R., Husain, S., Dhwaj, A., Misra, D., Bai, L., and Sangal, R. 2008. Dependency annotation scheme for Indian languages. In Proceedings of the International Joint Conference on Natural Language Processing.Google Scholar
- Bharati, A., Rao, P., Sangal, R., and Bendre, S. M. 2002. Basic statistical analaysis of corpus and cross comparision. In Proceedings of ICON.Google Scholar
- Bhatia, T. A History of the Hindi Grammatical Tradition: Hindi-Hindustani Grammar, Grammarians, History and Problems. E.J. Brill, Leiden, Netherlands.Google Scholar
- Boissiere, P. 2003. An overview of existing writing assistance system. In Proceedings of the IFRATH Workshop.Google Scholar
- Carlberger, A., Carlberger, J., Magnuson, T., Hunnicutt, M. S., Palazuelos-Cagigas, S. E., and Navarro, S. A. 1997a. Profet, a new generation of word prediction: An evaluation study. In Proceedings of the 2nd Workshop on NLP for Communication Aids.Google Scholar
- Carlberger, A., Magnuson, T., Carlberger, J., Wachtmeister, H., and Hunnicutt, S. 1997b. Probability-based word prediction for writing support in dyslexia. In Proceedings of the Fonetik Conference. 17--20.Google Scholar
- Carlberger, J. 1997. Design and Implementation of a Probabilistic Word Prediciton Program. M.S. thesis, Computer Science, Nada, KTH, Stockholm, Sweden.Google Scholar
- CDAC. 2010a. Indian language search engine technologies-problems and solutions. http://iplugin.cdac.in/search-engine.htm.Google Scholar
- CDAC. 2010b. Problems with existing unicode based engines. http://pune.cdac.in/html/gist/research-areas/set.aspx.Google Scholar
- Consortium, T. U. 2009. Unicode detail. http://www.unicode.org.Google Scholar
- Consortium, U. 2010. Unicode normalization forms. http://www.unicode.org/reports/tr15/.Google Scholar
- Consortium, U. 2011. South Asian scripts-I. http://www.unicode.org/versions/Unicode5.0.0/ch09.pdf.Google Scholar
- Constable, P. 2004. Proposal on clarification and consolidation of the function of ZERO WIDTH JOINER in Indic scripts. Review document, Unicode Consortium. http://www.unicode.org/review/pr-37.pdf.Google Scholar
- Cormen, T. H., Leiserson, C. E., Rivest, R. L., and Stein, C. 2001. Introduction to Algorithms 3rd Ed. MIT Press, Cambridge, MA. Google ScholarDigital Library
- Darragh, J. J. and Witten, I. H. 1991. Adaptive predictive text generation and the reactive keyboard. Interact. Comput. 3, 1, 27--50. Google ScholarDigital Library
- Dyke, J. A. V. 1991. Word prediction for disabled users: Applying natural language processing to enhance communication. M.S. thesis, University of Delaware.Google Scholar
- Fazly, A. 2002. The Use of Syntax in Word Completion Utilities. M.S. thesis, Department of Computer Science, University of Toronto.Google Scholar
- Fazly, A. and Hirst, G. 2003. Testing the efficacy of part-of-speech information in word completion. In Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics. 9--16. Google ScholarDigital Library
- Garay-Vitoria, N. and Abascal, J. 2004. A comparison of prediction techniques to enhance the communication rate. In User-Centered Interaction Paradigms for Universal Access in the Information Society, Vol. 3196, Springer, Berlin, 400--417.Google Scholar
- Garay-Vitoria, N. and Abascal, J. 2006. Text prediction systems: A survey. Univ. Access Inform. Soc. 4, 3, 188--203. Google ScholarDigital Library
- Garay-Vitoria, N. and González, J. A. 1997. Intelligent word prediction to enhance text input rate (a syntactic analysis based word prediction aid for people with severe motor and speech disability). In Proceedings of the 2nd International Conference on Intelligent User Interfaces. 241--244. Google ScholarDigital Library
- Ghosh, P. K. and Knuth, D. E. 1983. An approach to type design and text composition in Indian scripts. Ph.D. thesis, Stanford University.Google Scholar
- Go, K. and Endo, Y. 2008. Advances in human-computer interaction. A touchscreen software keyboard for finger typing. In Advances in Affective and Pleasurable Design, Yong Gu Ji Ed., 287--296.Google Scholar
- Google. 2010. Google Indic on-screen keyboard. http://www.google.co.in/.Google Scholar
- Google. 2014. Input tools. http://www.google.co.in/inputtools/.Google Scholar
- Group, T. R. 2011. Email statistics report, 2011-15. http://www.radicati.com/wp/wp-content/uploads/2011/05/ Email-Statistics-Report-2011-2015-Executive-Summary.pdf.Google Scholar
- Gupta, A. and Jamal, G. 2006. An analysis of reading errors of dyslexic readers in Hindi and English. Asia Pacific Disability Rehab. J. 17, 1, 73--86.Google Scholar
- Herold, M. 2004. The use of word prediction as a tool to accelerate the typing speed and increase the spelling accuracy of primary school children. Ph.D. thesis, University of Pretoria.Google Scholar
- Herold, M., Alant, E., and Bornman, J. 2008. Typing speed, spelling accuracy, and the use of word prediction. South African J. Educ. 28, 1, 117--134.Google Scholar
- Higginbotham, D. J. 1992. Evaluation of keystroke savings across five assistive communication technologies. Augmentative Alt. Commun. 8, 258--272.Google ScholarCross Ref
- IBM. 2011. SPSS - Statistical package for the social sciences. http://www-01.ibm.com/software/analytics/spss/products/statistics/.Google Scholar
- Ishida, R. 2010. An introduction to writing systems & unicode: A review of script characteristics affecting computer-based script support and unicode. http://people.w3.org/rishida/docs/unicode-tutorial.Google Scholar
- Isokoski, P. 2004. Manual text input: Experiments, models, and systems. Ph.D. thesis, Department of Computer Sciences, University of Tampere.Google Scholar
- Joshi, A., Ganu, A., Chand, A., and Mathur, V. P. G. 2004. Keylekh: A keyboard for text entry in Indic scripts. In Proceedings of the Conference on Extended Abstracts on Human factors in Computing Systems (CHI). 928--942. Google ScholarDigital Library
- Joshi, R., Shoff, K., and Mudur, S. 2003. A phonemic code based scheme for effective processing of Indian languages. In Proceedings of 23rd Internationalization and Unicode Conference.Google Scholar
- Jurafsky, D. and Martin, J. 2000. Speech and Language Processing. Prentice Hall, New Jersey. Google ScholarDigital Library
- Katz, S. 1987. Estimation of probabilities from sparse data for the language model component of a speech recognizer. IEEE Trans. Acoust., Speech, Signal Process. 35, 3, 400--401.Google ScholarCross Ref
- Klund, J. and Novak, M. 1995. If word prediction can help, which program do you choose. In Proceedings of the 18th Annual Conference on Rehabilitation Technology.Google Scholar
- Koester, H. H. and Levine, S. P. 1998. Model Simulations of User Performance with Word Prediction. Augmentative and Alternative Communication 14, 1, 25--35.Google ScholarCross Ref
- Koul, O. N. 2008. Modern Hindi Grammar. Dunwoody Press, Hyattsville.Google Scholar
- Kristensson, P. O. 2009. Five challenges for intelligent text entry methods. AI Mag. 30, 4, 85--94.Google ScholarCross Ref
- Kukich, K. 1992. Techniques for automatically correcting words in text. ACM Comput. Surv. 24, 4, 377--439. Google ScholarDigital Library
- Levinson, S. E. 1985. Structural methods in automatic speech recognition. Proc. IEEE 73, 11, 1625--1650.Google ScholarCross Ref
- Lipik. 2012.Lipik: A predictive text input system. http://www.lipik.Google Scholar
- MacArthur, C. A. 1998. Word processing with speech synthesis and word prediction: Effects on the dialogue journal writing of students with learning disabilities. Learn. Disabil. Quart. 21, 2, 151--166.Google ScholarCross Ref
- MacArthur, C. A., Graham, S., Haynes, J. B., and De La Paz, S. 1996. Spelling checkers and students with learning disabilities: Performance comparisons and impact on spelling. J. Special Educ. 30, 35--57.Google ScholarCross Ref
- MacKenzie, I. S., Kober, H., Smith, D., Jones, T., and Skepner, E. 2001. LetterWise: Prefix-based disambiguation for mobile text input. In Proceedings of the 14th Annual ACM Symposium on User Interface Software and Technology. 111--120. Google ScholarDigital Library
- MacKenzie, I. S., Shawn, I. S., Zhang, X., and Soukoreff, R. W. 1999. Text entry using soft keyboards. Behav. Inform. Technol., 18, 4, 235--244.Google ScholarCross Ref
- MacKenzie, I. S. and Tanaka-Ishii, K. 2007. Text Entry Systems: Mobility, Accessibility, Universality. Morgan Kaufmann, San Francisco, CA. Google ScholarDigital Library
- Manning, C. D. and Schütze, H. June 1999. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA. Google ScholarDigital Library
- Matiasek, J., Baroni, M., and Trost, H. 2002. FASTY - A multi-lingual approach to text prediction. In Proceedings of the 8th International Conference on Computers Helping People with Special Needs. 243--250. Google ScholarDigital Library
- Mohanan, T. 1994. Argument Structure in Hindi. Center for the Study of Language and Information, Leland Stanford Junior University, CA.Google Scholar
- NCIP. 1994. Writing with word prediction software. http://www.edc.org/NCIP/LIBRARY/wp/Profile.htm.Google Scholar
- Oulasvirta, A., Reichel, A., Li, W., Zhang, Y., Bachynskyi, M., Vertanen, K., and Kristensson, P. O. 2013. Improving two-thumb text entry on touchscreen devices. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 2765--2774. Google ScholarDigital Library
- Pedler, J. 2007. Computer correction of real word spelling errors in dyslexic text. Ph.D. thesis, Birkbeck, University of London.Google Scholar
- Ramanathan, A., Choudhary, H., Ghosh, A., and Bhattacharyya, P. 2009. Case markers and morphology: Addressing the crux of the fluency problem in English-Hindi SMT. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP. Volume 2, 800--808. Google ScholarDigital Library
- Samanta, D., Sarcar, S., and Ghosh, S. 2013. An approach to design virtual keyboards for text composition in Indian languages. Intern. J. Hum.-Comput. Interact. 29, 8.Google ScholarCross Ref
- Sarcar, S., Ghosh, S., Saha, P. K., and Samanta, D. 2010. Virtual keyboard design: State of the arts and research issues. In Proceedings of IEEE Students’ Technology Symposium. 289--299.Google Scholar
- Seymore, K. and Rosenfeld, R. 1996. Scalable backoff language models. In Proceedings of the 4th International Conference on Spoken Language. 232--235.Google Scholar
- Sharma, M. K., Dey, S., Saha, P. K., and Samanta, D. 2010. Parameters effecting the predictive virtual keyboard. In Proceedings of the IEEE Students’ Technology Symposium. 268--275.Google Scholar
- SLM, C. 1999. The CMU statistical language modeling (SLM) toolkit. http://homepages.inf.ed.ac.uk/lzhang10/slm.html.Google Scholar
- Stolberg, H. O., Norman, G., and Trop, I. 2004. Fundamentals of clinical research for radiologists. Amer. J. Radiol. 183, 1539--1544.Google Scholar
- Strassel, S., Maxwell, M., and Cieri, C. 2003. Linguistic resource creation for research and technology development: A recent experiment. ACM Trans. Asian Lang. Inform. Process. 2, 2, 101--117. Google ScholarDigital Library
- Swiffin, A. L., Arnott, J. L., and Newell, A. F. 1987. The use of syntax in a predictive communication aid for the physically handicapped. In Proceedings of the 10th Annual Conference on Rehabilitation Technology. 124--126.Google Scholar
- Tachyon Technologies. 2012. Typing Hindi with Quillpad. http://www.quillpad.in/index.html\#.UtznOfvhUdV.Google Scholar
- Trnka, K., McCaw, J., Yarrington, D., McCoy, K. F., and Pennington, C. 2009. User interaction with word prediction: The effects of prediction quality. ACM Trans. Access. Comput. 1, 3, 1--34. Google ScholarDigital Library
- Vanderheiden, G. C. and Kelso, D. P. 1987. Comparative analysis of fixed-vocabulary communication acceleration techniques. Augmentative Alt. Commun. 3, 4, 196--206.Google ScholarCross Ref
- Wandmacher, T. 2009. Adaptive Word Prediction and its Application in an Assistive Communication System. Ph.D. thesis, University of Tubingan.Google Scholar
- Wolf, E., Vembu, S., and Miller, T. 2006. On the use of topic models for word completion. Adv. Natural Lang. Process. 4139, 500--511. Google ScholarDigital Library
- Wood, M. 1996. Syntactic pre processing in single word prediction for disabled people. Ph.D. thesis, University of Bristol.Google Scholar
- Zordell, J. 1990. The use of word prediction and spelling correction software with mildly handicapped students. Clos. Gap 9, 1, 10--11.Google Scholar
Index Terms
- Word Prediction System for Text Entry in Hindi
Recommendations
A Unique Word Prediction System for Text Entry in Hindi
ICTCS '16: Proceedings of the Second International Conference on Information and Communication Technology for Competitive StrategiesWord prediction is very effective technique for improving efficiency of entering text. Current word prediction systems predict a word if and only if a user has not made mistake in the starting of some characters of the word. This is more applicable for ...
A POS-Based Word Prediction System for the Persian Language
GoTAL '08: Proceedings of the 6th international conference on Advances in Natural Language ProcessingWord prediction is the problem of guessing the words which are likely to follow in a given text segment by displaying a list of the most probable words that could appear in that position. In this research, we designed and implemented three word ...
Basic word completion and prediction for hebrew
SPIRE'12: Proceedings of the 19th international conference on String Processing and Information RetrievalThis research aims to improve keystroke savings for completion and prediction of Hebrew words. This task is very important to augmentative and alternative communication systems as well as to search engines, short messages services, and mobile phones. ...
Comments