UNC-Chapel Hill study shows AI can dramatically speed up digitizing natural history collections – EurekAlert!

Low-Res_Yuyang Xie - PR Photo Low-Res_Yuyang Xie - PR Photo

News Release 5-Dec-2025

Yuyang Xie
Image:ย UNC research team check a plant specimen at the UNC Herbarium. view moreย  Credit: Shanna Oberreiter

UNC-Chapel Hill study shows AI can dramatically speed up digitizing natural history collections, University of North Carolina at Chapel Hill

A new study from UNC-Chapel Hill researchers shows that advanced artificial intelligence tools, specifically large language models (LLMs), can accurately determine the locations where plant specimens were originally collected, a process known as georeferencing. This task has traditionally been slow, expensive and dependent on significant manual effort. The team found that LLMs can complete this work with near-human accuracy while being significantly faster and more cost-effective. 

โ€œOur study explores how large language models can take on one of the biggest bottlenecks in digitizing plant collections,โ€ said Yuyang Xie, first author and postdoctoral researcher in the department of biology at UNC. โ€œWe are pioneering the use of these tools for georeferencing, a breakthrough that will accelerate the digitization of plant specimens and unlock new possibilities for ecological research.โ€ 

The research set out to answer a central question: Can AI automate one of the most time-consuming steps in digitizing natural history collections? The Carolina team found out that yes, it can. LLMs not only performed georeferencing with an error margin of less than 10 kilometers, outperforming traditional methods, but also completed the task at a fraction of the time and cost. 

โ€œRecent advances in LLMs can potentially transform the georeferencing process, making it faster and more accurate,โ€ said Xiao Feng, corresponding author and assistant professor in the department of biology at UNC. โ€œThis gives researchers unprecedented opportunities to advance our understanding of global biodiversity distributions.โ€ 

The implications are significant. An estimated 2โ€“3 billion herbarium specimens exist worldwide, but only a small fraction have been digitized. Without digital records and spatial data, researchers face major limitations in tracking biodiversity loss, understanding species movement under climate change and analyzing ecosystem shifts. By deploying AI-powered georeferencing, scientists may soon be able to rapidly digitize vast natural history collections that have remained largely inaccessible. 

โ€œThis technology allows us to unlock millions of records that are currently sitting in cabinets,โ€ said Xie. โ€œWith the power of LLMs, we can rapidly digitize plant specimen data that will be critical for addressing global environmental challenges.โ€ 

Traditional approaches to georeferencing rely on manual interpretation, specialized software, or multiple rounds of expert review. The UNC study is among the first to apply LLMs to this task and to show they can outperform existing methods in accuracy, efficiency, and scalability. This new approach opens the door to digitizing natural history collections at a speed never before possible. 

The research paper is available online in Nature Plants at: https://www.nature.com/articles/s41477-025-02162-y  

Continue/Read Original Article Here: UNC-Chapel Hill study shows AI can dramatically speed up digitizing natural history collections | EurekAlert!


Discover more from DrWeb's Domain

Subscribe to get the latest posts sent to your email.

Leave Your Comments

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Leave Your Comments

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Scroll to Top

Discover more from DrWeb's Domain

Subscribe now to keep reading and get access to the full archive.

Continue reading

Verified by MonsterInsights