Text Detection Based on Faster R-CNN Algorithm with Skip Pooling and Fusion of Hindi Handwritten Characters
DOI:
https://doi.org/10.25008/ijadis.v2i1.1197Keywords:
object detection, Faster R-CNN, skip pooling, guided anchor RPNAbstract
Significant learning is at present the standard system for object disclosure. Speedier territory based convolutional neural association (Faster R-CNN) has a basic circumstance in significant learning. It has stunning area impacts in standard scenes. Regardless, under unprecedented conditions, there can even now be inadmissible acknowledgment execution, for instance, the thing having issues like hindrance, contorting, or little size. This paper proposes a novel and improved estimation reliant on the Faster R-CNN framework got together with the Faster R-CNN figuring with skip pooling and mix of consistent information. This computation can improve the revelation execution under uncommon conditions dependent on Faster R-CNN. The improvement basically has three segments: The underlying portion adds a setting information incorporate extraction model after the conv5_3 of the convolutional layer; the resulting part adds skip pooling so the past can totally secure the coherent information of the article, especially for conditions where the thing is hindered and distorted; and the third part replaces the area recommendation association (RPN) with a more capable guided anchor RPN (GA-RPN), which can keep up the survey rate while improving the revelation execution. The last can get more positive information from different segment layers of the significant neural association figuring, and is especially centered around scenes with little articles. Differentiated and Faster R-CNN, you simply look once plan, (for instance, YOLOv3), single shot pointer, (for instance, SSD512), and other article revelation computations, the estimation proposed in this paper has an ordinary improvement of 6.857% on the mean typical precision (mAP) appraisal list while keeping up a particular audit rate. This unequivocally exhibits that the proposed methodology has higher ID rate and disclosure efficiency for this circumstance.
Downloads
Plum Analytics | Dimensions | |
References
Li, C. Dangerous Posture Monitoring for Undersea Diver Based on Frame Difference Method. J. Coast. Res. 2020, 103, 939-942. https://doi.org/10.2112/SI103-195.1
Aronniemi, M.; Sainio, J.; Lahtinen, J. Chemical state quantification of iron and chromium oxides using XPS: The effect of the background subtraction method. Surf. Sci. 2005, 578, 108-123. https://doi.org/10.1016/j.susc.2005.01.019
Dougherty, L.; Asmuth, J.; Blom, A.; Axel, L.; Kumar, R. Validation of an optical flow method for tag displacement estimation. IEEE Trans. Med. Imaging 1999, 18, 359-363. https://doi.org/10.1109/42.768845
Shi, D.; Zheng, L.; Liu, J. Advanced Hough Transform Using A Multilayer Fractional Fourier Method. IEEE Trans. Image Process. 2010, 19, 1558-1566. https://doi.org/10.1109/TIP.2010.2042102
Song, Y.-S.; Cho, S.-B.; Suh, I.H. Activity-Object Bayesian Networks for Detecting Occluded Objects in Uncertain Indoor Environment. In Proceedings of the Knowledge-Based Intelligent Information & Engineering Systems, International Conference, Kes, Melbourne, Australia, 14-16 September 2005. https://doi.org/10.1007/11553939_132
Shlezinger, N.; Farsad, N.; Eldar, Y.C.; Goldsmith, A.J. ViterbiNet: A Deep Learning Based Viterbi Algorithm for Symbol Detection. IEEE Trans. Wirel. Commun. 2020, 19, 3319-3331. https://doi.org/10.1109/TWC.2020.2972352
Piotr, D.; Wojek, C.; Schiele, B.; Perona, P. Pedestrian Detection: The State of the Art. IEEE Trans. Softw. Eng. 2011, 34, 743-761. https://doi.org/10.1109/TPAMI.2011.155
Viola, P.; Jones, M.J. Robust Real-Time Face Detection. Int. J. Comput. Vis. 2004, 57, 137-154. https://doi.org/10.1023/B:VISI.0000013087.49260.fb
Chen, B.-H.; Huang, S.-C. An Advanced Moving Object Detection Algorithm for Automatic Traffic Monitoring in Real-World Limited Bandwidth Networks. IEEE Trans. Multimedia 2014, 16, 837-847. https://doi.org/10.1109/TMM.2014.2298377
Zhang, J.; Wang, F.-Y.; Wang, K.; Lin, W.-H.; Xu, X.; Chen, C. Data-Driven Intelligent Transportation Systems. IEEE Trans. Intell. Transp. Syst. 2011, 12, 1624-1639. https://doi.org/10.1109/TITS.2011.2158001
Hua, X.; Wang, X.-Q.; Wang, D.; Huang, J.; Hu, X. Military Object Real-Time Detection Technology Combined with Visual Salience and Psychology. Electronics 2018, 7, 216. https://doi.org/10.3390/electronics7100216
Zhu, W.; Huang, W.; Lin, Z.; Yang, Y.; Huang, S.; Zhou, J. Data and feature mixed ensemble based extreme learning machine for medical object detection and segmentation. Multimed. Tools Appl. 2015, 75, 2815-2837. https://doi.org/10.1007/s11042-015-2582-9
Kanezaki, A.; Rodolà, E.; Cremers, D.; Harada, T. Learning Similarities for Rigid and Non-rigid Object Detection. In Proceedings of the 2014 2nd International Conference on 3D Vision, Tokyo, Japan, 8-11 December 2014; IEEE: Piscataway, NJ, USA, 2014; Volume 1, pp. 720-727. https://doi.org/10.1109/3DV.2014.61
Wang, J.; Chen, K.; Yang, S.; Loy, C.C.; Lin, D. Region Proposal by Guided Anchoring. In Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 16-20 June 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 2960-2969. https://doi.org/10.1109/CVPR.2019.00308
Kulkarni, A.; Callan, J.; Selective, S. Efficient and Effective Search of Large Textual Collections. ACM Trans. Inf. Syst. 2015, 33, 17.1-17.33. https://doi.org/10.1145/2738035
Wang, X.; Xiao, T.; Jiang, Y.; Shao, S.; Sun, J.; Shen, C. Repulsion Loss: Detecting Pedestrians in a Crowd. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition 2018, Salt Lake City, UT, USA, 18-23 June 2018; pp. 7774-7783. https://doi.org/10.1109/CVPR.2018.00811
Bell, S.; Zitnick, C.L.; Bala, K.; Girshick, R. Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks. arXiv 2015, arXiv:1512.04143. https://doi.org/10.1109/CVPR.2016.314
Cheng, G.; Han, J.; Zhou, P.; Xu, D. Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection. IEEE Trans. Image Process. 2019, 28, 265-278. https://doi.org/10.1109/TIP.2018.2867198
Fan, D.-P.; Ji, G.-P.; Sun, G.; Cheng, M.-M.; Shen, J.; Shao, L. Camouflaged Object Detection. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13-19 June 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 2774-2784. https://doi.org/10.1109/CVPR42600.2020.00285
Girshick, R.; Donahue, J.; Darrell, T.; Malik, J.; Malik, J. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 24-27 June 2014. https://doi.org/10.1109/CVPR.2014.81
He, K.; Zhang, X.; Ren, S.; Sun, J. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. In European Conference on Computer Vision; Springer: Cham, Switzerland, 2014. https://doi.org/10.1007/978-3-319-10578-9_23
Girshick, R. Fast R-CNN. In Proceedings of the 2015 IEEE International Conference on Computer Vision, Santiago, Chile, 7-13 December 2015. https://doi.org/10.1109/ICCV.2015.169
R. Rachmat and S. Suhartono, "Comparative Analysis of Single Exponential Smoothing and Holt's Method for Quality of Hospital Services Forecasting in General Hospital", Bulletin of Comp. Sci. & Electr. Eng., vol. 1, no. 2, pp. 80-86, Aug. 2020. https://doi.org/10.25008/bcsee.v1i2.8
Published
How to Cite
Issue
Section
License
Copyright (c) 2021 Abhishek Mehta, Subhashchandra Desai, Ashish Chaturvedi

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
