Loughborough University
Browse

A review of artificial intelligence in herbarium specimen image analysis

journal contribution
posted on 2025-10-13, 08:26 authored by Yuyue GuoYuyue Guo, Haibin CaiHaibin Cai, Gemma Bramley, Hannah Atkins, Baihua LiBaihua Li, Stephanos TheodossiadesStephanos Theodossiades
The digitisation of hundreds of millions of herbarium specimen images and their labels has created an unprecedented resource for taxonomy, ecology, and conservation, motivating the development of artificial intelligence (AI) solutions. Automated analysis of these high-resolution scans faces significant challenges, including data imbalance, information loss, model interpretability and explainability, and scalable Open-Set Recognition (OSR). This paper provides an in-depth algorithm-level review of AI methodologies for herbarium image classification, tracing the development from classical classification models like Convolutional Neural Network (CNN) and Vision Transformer (ViT) to cutting-edge multimodal frameworks. In addition to classification, the review further investigates vision-based analytical tasks critical to herbarium image analysis, including specimen image segmentation, label text identification using Large Language Model (LLM), and Human-in-the-Loop (HITL) quality assurance strategies. Furthermore, this review reveals practical challenges in specimen image analysis along with their promising solutions and potential future directions.<p></p>

Funding

The Central England NERC Training Alliance 2 (CENTA2)

Natural Environment Research Council

Find out more...

History

School

  • Science

Published in

Artificial Intelligence Review

Publisher

Springer (part of Springer Nature)

Version

  • AM (Accepted Manuscript)

Rights holder

© The Author(s)

Acceptance date

2025-09-23

ISSN

0269-2821

eISSN

1573-7462

Language

  • en

Depositor

Miss Yuyue Guo. Deposit date: 10 October 2025

Usage metrics

    Loughborough Publications

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC