Self-supervised memory learning for scene text image super-resolution
Computerised recognition of low-resolution scene text images has been a persistent challenge. To improve the recognition performance, image quality enhancement via image super-resolution technology provides an intuitive solution. Typical deep learning-based scene text image super-resolution methods assume that the image quality degradation from high-resolution images to their corresponding low-resolution counterparts can be represented by mapping well-distributed samples, which limits their reconstruction performance in a practical text recognition system. For real-world scenarios this assumption typically does not hold since image degradations arise from multiple sources during image capture and processing. In this paper, to alleviate this problem, we propose a novel self-supervised end-to-end memory network model for scene text image super-resolution. In particular, after extracting enriched and finer representations from low-resolution text images via a spatial refinement block, we introduce a memory-based network to yield an improved super-resolution model that can handle complex degradation sources. Furthermore, to boost the effectiveness of our method, we design a multi-term loss to exploit textual structure information, where, in addition to the traditional reconstruction loss, we embed a character perceptual loss and a boundary enhancement loss. Extensive experiments on different datasets demonstrate that our proposed MNTSR method effectively improves the recognition accuracy for several scene text image recognition models and achieves state-of-the-art results.
The source code is made available at https://github.com/xyzhu1/MNTSR
Funding
National Natural Science Foundation of China under Grant 62472443 and 62076255
Hunan Provincial Natural Science Foundation, China 2024JJ3032
Frontier Cross Project of Central South University, China (2023QYJC008)
General Project of Xiangjiang Lab, China (23XJ03008)
Key Research and Development Plan of Hunan province, China (2023SK2027)
History
School
- Science
Published in
Expert Systems with ApplicationsVolume
258Publisher
Elsevier Ltd.Version
- VoR (Version of Record)
Rights holder
© The Author(s)Publisher statement
This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/)Acceptance date
2024-08-26Publication date
2024-09-29Copyright date
2024ISSN
0957-4174eISSN
1873-6793Publisher version
Language
- en