Sunday, February 15, 2026 2/15/2026

Curation at scale: Integrating AI into community curation

Award Number: R01LM013871
ORGANIZATION: NATIONAL LIBRARY OF MEDICINE
OPDIV: NIH
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)
PERIOD OF PERFORMANCE START DATE: 09/01/2021
PERIOD OF PERFORMANCE END DATE: 05/31/2025

Group Awards By:

View Award Description

Curation at scale: Integrating AI into community curation - Project Summary Biological knowledgebases are a critical resource for researchers and accelerate scientific discoveries by providing manually curated, machine-readable data collections. However, the aggregation and manual curation of biological data is a labor-intensive process that relies almost entirely on professional biocurators. Two approaches have been advanced to help with this problem: natural language processing (NLP; text mining (TM) and machine learning (ML)) and engagement of researchers (community curation). However, neither of these approaches alone is sufficient to address the critical need for increased efficiency in the biocuration process. Our solution to these challenges is an NLP-enhanced community curation portal, Author Curation to Knowledgebase (ACKnowledge). The ACKnowledge system, currently implemented for the C. elegans literature, couples statistical methods and text mining algorithms to enhance community curation of research articles. We propose to strengthen and expand ACKnowledge by including other species into our pipeline, incorporating more sophisticated machine learning models, and presenting sentence-level entity and concept extraction for more detailed author curation. In addition, we will develop an Author Curation Portal (ACP) to allow authors to easily upload and curate their own documents. Taken together, these enhancements will allow us to maximize community curation efforts by leveraging author expertise in multiple areas of biology, while at the same time supporting authors with as much AI-assisted curation as possible. This reciprocal interaction will improve not only the content of knowledgebases, but the AI methods themselves, as we will receive valuable feedback on our models. By developing an Author Curation Portal, we will further empower authors to participate in the curation process and alert knowledgebases to key information that can, and should, be readily discoverable in accordance with FAIR (Findable, Accessible, Interoperable, and Reusable) data principles.


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2025 ( Subtotal = $0 )
2025	2024	CALIFORNIA INSTITUTE OF TECHNOLOGY	1200 E CALIFORNIA BLVD	PASADENA	CA	91125	LOS ANGELES	USA	Medical Library Assistance	000	4	7/16/2025	NON-COMPETING CONTINUATION	$0
														Subtotal = $0

Issue Date FY: 2024 ( Subtotal = $355,938 )
2024	2024	CALIFORNIA INSTITUTE OF TECHNOLOGY	1200 E CALIFORNIA BLVD	PASADENA	CA	91125	LOS ANGELES	USA	Medical Library Assistance	000	4	5/30/2024	NON-COMPETING CONTINUATION	$355,938
														Subtotal = $355,938

Issue Date FY: 2023 ( Subtotal = $355,938 )
2023	2023	CALIFORNIA INSTITUTE OF TECHNOLOGY	1200 E CALIFORNIA BLVD	PASADENA	CA	91125	LOS ANGELES	USA	Medical Library Assistance	000	3	4/25/2023	NON-COMPETING CONTINUATION	$355,938
														Subtotal = $355,938

Issue Date FY: 2022 ( Subtotal = $355,938 )
2022	2022	CALIFORNIA INSTITUTE OF TECHNOLOGY	1200 E CALIFORNIA BLVD	PASADENA	CA	91125	LOS ANGELES	USA	Medical Library Assistance	000	2	5/12/2022	NON-COMPETING CONTINUATION	$355,938
														Subtotal = $355,938

Issue Date FY: 2021 ( Subtotal = $355,938 )
2021	2021	CALIFORNIA INSTITUTE OF TECHNOLOGY	1200 E CALIFORNIA BLVD	PASADENA	CA	91125	LOS ANGELES	USA	Medical Library Assistance	000	1	9/1/2021	NEW	$355,938
														Subtotal = $355,938

Grand Total All Awards = $1,423,752

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Curation at scale: Integrating AI into community curation

Award Number: R01LM013871

ORGANIZATION: NATIONAL LIBRARY OF MEDICINE

OPDIV: NIH

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)

PERIOD OF PERFORMANCE START DATE: 09/01/2021

PERIOD OF PERFORMANCE END DATE: 05/31/2025

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer