Abstract
Online handwriting classification has become an open research problem as it serves as a preliminary step for handwriting recognition systems and applications in several other fields. This paper aims to extend the current trends and knowledge with multiple contributions in handwriting classification using spatiotemporal information. Firstly, it enriches the annotations of several publicly available online handwriting datasets, SenseThePen, IAM-onDB, and IAMonDo, for online handwriting classification and recognition tasks. The said datasets are updated with three distinguished levels of annotations, i.e., stroke, sequence, and line levels. The enriched annotations of these datasets extend their functionality for online handwriting classification at different levels for further research analysis. In addition to enrichment, it also unifies the annotation levels across the datasets, which enables the research community to benchmark proposed methods for comparative analysis using multiple datasets. All the datasets with enriched annotations are made publicly available for the research community as part of the IAMonSense dataset. Moreover, this paper presents a comprehensive benchmark of these datasets using multiple deep neural networks such as traditional convolutional neural networks (CNNs), graph convolutional networks(GCNs), attention-based neural networks, and transformers. These benchmarks can be used later on for further development in online handwriting classification.
Similar content being viewed by others
References
Liwicki, M., Bunke, H.: Recognition of Whiteboard Notes: Online, Offline and Combination. Series in machine perception and artificial intelligence. World Scientific, Singapore (2008). https://doi.org/10.1142/6854
Liwicki, M., Bunke, H.: Feature selection for hmm and BLSTM based handwriting recognition of whiteboard notes. IJPRAI 23, 907–923 (2009). https://doi.org/10.1142/S0218001409007417
Mandal, S., Prasanna, S.R.M., Sundaram, S.: GMM posterior features for improving online handwriting recognition. Expert Syst. Appl. 97, 421–433 (2018). https://doi.org/10.1016/j.eswa.2017.12.047
Van Phan, T., Nakagawa, M.: Text/non-text classification in online handwritten documents with recurrent neural networks. In: 2014 14th International Conference on Frontiers in Handwriting Recognition, pp. 23–28 (2014). https://doi.org/10.1109/ICFHR.2014.12
Ye, J.-Y., Zhang, Y.-M., Yang, Q., Liu, C.-L.: Contextual stroke classification in online handwritten documents with edge graph attention networks. SN Comput. Sci. 1(3), 1–13 (2020)
Weber, M., Liwicki, M., Schelske, Y.T.H., Schoelzel, C., Strauß, F., Dengel, A.: MCS for Online Mode Detection: Evaluation on Pen-Enabled Multi-touch Interfaces. In: 2011 International Conference on Document Analysis and Recognition, pp. 957–961 (2011). https://doi.org/10.1109/ICDAR.2011.194
Kholmatov, A., Yanikoglu, B.: Identity authentication using improved online signature verification method. Pattern Recogn. Lett. 26(15), 2400–2408 (2005)
Liwicki, M., Schlapbach, A., Bunke, H., Bengio, S., Mariéthoz, J., Richiardi, J.: Writer Identification for Smart Meeting Room Systems. Technical report (2005). Published in Seventh IAPR Workshop on Document Analysis Systems, DAS, 2006. http://publications.idiap.ch/downloads/reports/2005/liwiki-idiap-rr-05-70.pdf
Said, H.E.S., Baker, K.D., Tan, T.N.: Personal identification based on handwriting. In: Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170), vol. 2, pp. 1761–17642 (1998). https://doi.org/10.1109/ICPR.1998.712068
Sesa-Nogueras, E., Faundez-Zanuy, M., Roure-Alcobé, J.: Gender classification by means of online uppercase handwriting: a text-dependent allographic approach. Cogn. Comput. 8(1), 15–29 (2016). https://doi.org/10.1007/s12559-015-9332-1
Younas, J., Malik, M.I., Ahmed, S., Shafait, F., Lukowicz, P.: Sense the pen: classification of online handwritten sequences (text, mathematical expression, plot/graph). Expert Syst. Appl. 172, 114588 (2021). https://doi.org/10.1016/j.eswa.2021.114588
Younas, J., Fritsch, S., Pirkl, G., Ahmed, S., Malik, M.I., Shafait, F., Lukowicz, P.: What am i writing: Classification of on-line handwritten sequences. In: Intelligent Environments (Workshops). Ambient Intelligence and Smart Environments, vol. 23, pp. 417–426. IOS Press, Amsterdam (2018)
Delaye, A., Liu, C.-L.: Contextual text/non-text stroke classification in online handwritten notes with conditional random fields. Pattern Recognit. 47(3), 959–968 (2014). https://doi.org/10.1016/j.patcog.2013.04.017
Van Phan, T., Nakagawa, M.: Combination of global and local contexts for text/non-text classification in heterogeneous online handwritten documents. Pattern Recogn. 51, 112–124 (2016). https://doi.org/10.1016/j.patcog.2015.07.012
Bresler, M., Průša, D., Hlaváč, V.: Online recognition of sketched arrow-connected diagrams. Int. J. Doc. Anal. Recognit. (IJDAR) 19(3), 253–267 (2016). https://doi.org/10.1007/s10032-016-0269-z
Indermühle, E., Liwicki, M., Bunke, H.: IAMonDo-database: an online handwritten document database with non-uniform contents. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems. DAS ’10, pp. 97–104. Association for Computing Machinery, New York (2010). https://doi.org/10.1145/1815330.1815343
Liwicki, M., Bunke, H.: IAM-OnDB - an On-Line English Sentence Database Acquired from Handwritten Text on a Whiteboard. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition. ICDAR ’05, pp. 956–961. IEEE Computer Society, USA (2005). https://doi.org/10.1109/ICDAR.2005.132
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997). https://doi.org/10.1162/neco.1997.9.8.1735
Graves, A., Schmidhuber, J.: Framewise phoneme classification with bidirectional lstm networks. In: Proceedings of 2005 IEEE International Joint Conference on Neural Networks, 2005., vol. 4, pp. 2047–2052 (2005). IEEE
Kipf, T.N., Welling, M.: Semi-Supervised Classification with Graph Convolutional Networks. arXiv (2016). arXiv:1609.02907
Veličković, P., Cucurull, G., Casanova, A., Romero, A., Liò, P., Bengio, Y.: Graph Attention Networks (2017). arXiv arXiv:1710.10903
Brody, S., Alon, U., Yahav, E.: How Attentive are Graph Attention Networks? (2021). arXiv arXiv:2105.14491
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention Is All You Need (2017). arXiv arXiv:1706.03762
Livingston, O.B.: A handwriting and pen-printing classification system for identifying law violators. J. Criminal Law Criminol. 49, 487 (1959)
Moore, J.: Handwriting classification. Police J. 18(1), 39–61 (1945). https://doi.org/10.1177/0032258X4501800107
Tappert, C.C., Suen, C.Y., Wakahara, T.: The state of the art in online handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 12(8), 787–808 (1990). https://doi.org/10.1109/34.57669
Plamondon, R., Srihari, S.N.: Online and off-line handwriting recognition: a comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 22(1), 63–84 (2000). https://doi.org/10.1109/34.824821
Pham, V., Bluche, T., Kermorvant, C., Louradour, J.: Dropout Improves Recurrent Neural Networks for Handwriting Recognition. In: 2014 14th International Conference on Frontiers in Handwriting Recognition, pp. 285–290 (2014). https://doi.org/10.1109/ICFHR.2014.55
Michal, Kozielski, M., Doetsch, P., Ney, H.: Improvements in RWTH’s System for Off-Line Handwriting Recognition. In: 2013 12th International Conference on Document Analysis and Recognition, pp. 935–939 (2013). https://doi.org/10.1109/ICDAR.2013.190
Carbune, V., Gonnet, P., Deselaers, T., Rowley, H.A., Daryin, A., Calvo, M., Wang, L.-L., Keysers, D., Feuz, S., Gervais, P.: Fast multi-language LSTM-based online handwriting recognition. Int. J. Doc. Anal. Recognit. (IJDAR) 23(2), 89–102 (2020)
Nguyen, C.T., Nakagawa, M.: Finite state machine based decoding of handwritten text using recurrent neural networks. In: 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 246–251 (2016). https://doi.org/10.1109/ICFHR.2016.0055
Shivram, A., Ramaiah, C., Govindaraju, V.: A hierarchical Bayesian approach to online writer identification. Iet Biomet. 2(4), 191–198 (2013)
Shivram, A., Ramaiah, C., Govindaraju, V.: Data sufficiency for online writer identification: A comparative study of writer-style space vs. feature space models. In: 2014 22nd International Conference on Pattern Recognition, pp. 3121–3125 (2014). https://doi.org/10.1109/ICPR.2014.538
Ahmed, M., Rasool, A.G., Afzal, H., Siddiqi, I.: Improving handwriting based gender classification using ensemble classifiers. Expert Syst. Appl. 85, 158–168 (2017). https://doi.org/10.1016/j.eswa.2017.05.033
Illouz, E., Netanyahu, N.S., et al.: Handwriting-based gender classification using end-to-end deep neural networks. In: International Conference on Artificial Neural Networks, pp. 613–621. Springer (2018)
Ye, J.-Y., Zhang, Y.-M., Liu, C.-L.: Joint training of conditional random fields and neural networks for stroke classification in online handwritten documents. In: 2016 23rd International Conference on Pattern Recognition (ICPR), pp. 3264–3269 (2016). https://doi.org/10.1109/ICPR.2016.7900138
Degtyarenko, I., Deriuga, I., Grygoriev, A., Polotskyi, S., Melnyk, V., Zakharchuk, D., Radyvonenko, O.: Hierarchical recurrent neural network for handwritten strokes classification. In: ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 2865–2869 (2021). https://doi.org/10.1109/ICASSP39728.2021.9413412
Ott, F., Wehbi, M., Hamann, T., Barth, J., Eskofier, B., Mutschler, C.: The OnHW Dataset: Online Handwriting Recognition from IMU-Enhanced Ballpoint Pens with Machine Learning. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 4(3) (2020). https://doi.org/10.1145/3411842
Hamdi, Y., Boubaker, H., Alimi, A.M.: Data Augmentation Using Geometric, Frequency, and Beta Modeling Approaches for Improving Multi-Lingual Online Handwriting Recognition. Int. J. Doc. Anal. Recognit. 24(3), 283–298 (2021). https://doi.org/10.1007/s10032-021-00376-2
Tang, W., Long, G., Liu, L., Zhou, T., Jiang, J., Blumenstein, M.: Rethinking 1d-cnn for time series classification: a stronger baseline (2020). arXiv preprint arXiv:2002.10061
Liu, M., Ren, S., Ma, S., Jiao, J., Chen, Y., Wang, Z., Song, W.: Gated Transformer Networks for Multivariate Time Series Classification (2021). arXiv arXiv:2103.14438
Wang, M., Zheng, D., Ye, Z., Gan, Q., Li, M., Song, X., Zhou, J., Ma, C., Yu, L., Gai, Y., Xiao, T., He, T., Karypis, G., Li, J., Zhang, Z.: Deep Graph Library: A Graph-Centric, Highly-Performant Package for Graph Neural Networks (2019). arXiv:1909.01315
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix A: Classification result error
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Mustafid, A., Younas, J., Lukowicz, P. et al. IAMonSense: multi-level handwriting classification using spatiotemporal information. IJDAR 26, 303–319 (2023). https://doi.org/10.1007/s10032-023-00433-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-023-00433-y