The digitisation of hundreds of millions of herbarium specimen images and their labels has created an unprecedented resource for taxonomy, ecology, and conservation, motivating the development of artificial intelligence (AI) solutions. Automated analysis of these high-resolution scans faces significant challenges, including data imbalance, information loss, model interpretability and explainability, and scalable Open-Set Recognition (OSR). This paper provides an in-depth algorithm-level review of AI methodologies for herbarium image classification, tracing the development from classical classification models like Convolutional Neural Network (CNN) and Vision Transformer (ViT) to cutting-edge multimodal frameworks. In addition to classification, the review further investigates vision-based analytical tasks critical to herbarium image analysis, including specimen image segmentation, label text identification using Large Language Model (LLM), and Human-in-the-Loop (HITL) quality assurance strategies. Furthermore, this review reveals practical challenges in specimen image analysis along with their promising solutions and potential future directions.<p></p>
Funding
The Central England NERC Training Alliance 2 (CENTA2)