Loughborough University
Browse

Self-supervised memory learning for scene text image super-resolution

Download (2.59 MB)
journal contribution
posted on 2025-03-05, 09:13 authored by Kehua Guo, Xiangyuan Zhu, Gerald Schaefer, Rui Ding, Hui FangHui Fang

Computerised recognition of low-resolution scene text images has been a persistent challenge. To improve the recognition performance, image quality enhancement via image super-resolution technology provides an intuitive solution. Typical deep learning-based scene text image super-resolution methods assume that the image quality degradation from high-resolution images to their corresponding low-resolution counterparts can be represented by mapping well-distributed samples, which limits their reconstruction performance in a practical text recognition system. For real-world scenarios this assumption typically does not hold since image degradations arise from multiple sources during image capture and processing. In this paper, to alleviate this problem, we propose a novel self-supervised end-to-end memory network model for scene text image super-resolution. In particular, after extracting enriched and finer representations from low-resolution text images via a spatial refinement block, we introduce a memory-based network to yield an improved super-resolution model that can handle complex degradation sources. Furthermore, to boost the effectiveness of our method, we design a multi-term loss to exploit textual structure information, where, in addition to the traditional reconstruction loss, we embed a character perceptual loss and a boundary enhancement loss. Extensive experiments on different datasets demonstrate that our proposed MNTSR method effectively improves the recognition accuracy for several scene text image recognition models and achieves state-of-the-art results.

The source code is made available at https://github.com/xyzhu1/MNTSR


Funding

National Natural Science Foundation of China under Grant 62472443 and 62076255

Hunan Provincial Natural Science Foundation, China 2024JJ3032

Frontier Cross Project of Central South University, China (2023QYJC008)

General Project of Xiangjiang Lab, China (23XJ03008)

Key Research and Development Plan of Hunan province, China (2023SK2027)

History

School

  • Science

Published in

Expert Systems with Applications

Volume

258

Publisher

Elsevier Ltd.

Version

  • VoR (Version of Record)

Rights holder

© The Author(s)

Publisher statement

This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/)

Acceptance date

2024-08-26

Publication date

2024-09-29

Copyright date

2024

ISSN

0957-4174

eISSN

1873-6793

Language

  • en

Depositor

Dr Hui Fang. Deposit date: 26 August 2024

Article number

125247

Usage metrics

    Loughborough Publications

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC