Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( 🔒 ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Need Help

Data analysis tools for leveraging massive public data to improve hypothesis-driven research

Award Number: R35GM144128
ORGANIZATION: NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES
OPDIV: NIH
AWARD CLASS: DISCRETIONARY
AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)
PERIOD OF PERFORMANCE START DATE: 04/01/2022
PERIOD OF PERFORMANCE END DATE: 02/28/2027

Group Awards By:

View Award Description

Data analysis tools for leveraging massive public data to improve hypothesis-driven research - Project summary There is a crisis of reproducibility and replicability of scientiﬁc results. This crisis is an increasing source of concern both in the scientiﬁc and popular press. The crisis is so acute that the United States Congress is currently investigating reproducibility of the scientiﬁc process. At the heart of this crisis is a collection of problems including small-sample sizes, under-powered studies, under-trained data analysts and an inability to directly leverage prior results in the statistical analysis of smaller, hypothesis-driven experiments using high-throughput technologies. Advances in technology have dramatically reduced the cost and diﬃculty of collecting high-throughput molecular data. Large collections of raw data are increasingly publicly available but are usually incorporated into individual analyses by NIGMS and other investigators on an ad-hoc basis. Meanwhile, the other costs of running a designed, hypothesis-driven study have not decreased at the same speed with technological advances. It is still expensive to identify, recruit, collect, and follow up samples even if the high-throughput measurements themselves are cheap. Despite the incredible amount of available public data, it is still common practice to perform statistical inference in these hypothesis-driven experiments study-by-study, only indirectly including previous data, estimates, and results. So ﬁndings from these studies may be highly variable, unreliable, or unreplicable. Our group has focused on developing statistical methods, data resources, and software and training that allow researchers to borrow strength empirically from public repositories, large-scale data generation projects, and crowd-sourced data to improve inference in individual, hypothesis driven studies. We propose to build on our work in developing statistical data sources, methods, software and training that facilitate and speed the work of our biological and medical collaborators. The result will be a research community that can take advantage of public data already collected at a large cost to the NIH to improve power, reduce required sample sizes, and improve replication in many new hypothesis driven molecular studies of development and disorder.


Issue Date FY	Funding FY	Legal Entity Name	Legal Entity Address	Legal Entity City	Legal Entity State	Legal Entity Zip Code	Legal Entity COUNTY	Legal Entity COUNTRY	Assistance Listing	Award Code	Budget Year	Action Date	Action Type	Action Amount

Issue Date FY: 2026 ( Subtotal = $431,963 )
2026	2026	FRED HUTCHINSON CANCER CENTER	1100 FAIRVIEW AVE N	SEATTLE	WA	98109	KING	USA	Biomedical Research and Research Training	000	6	2/23/2026	NON-COMPETING CONTINUATION	$431,963
														Subtotal = $431,963

Issue Date FY: 2025 ( Subtotal = $430,679 )
2025	2025	FRED HUTCHINSON CANCER CENTER	1100 FAIRVIEW AVE N	SEATTLE	WA	98109	KING	USA	Biomedical Research and Research Training	001	5	4/4/2025	NON-COMPETING CONTINUATION	$430,679
2025	2024	FRED HUTCHINSON CANCER CENTER	1100 FAIRVIEW AVE N	SEATTLE	WA	98109	KING	USA	Biomedical Research and Research Training	000	4	11/8/2024	NON-COMPETING CONTINUATION	$0
														Subtotal = $430,679

Issue Date FY: 2024 ( Subtotal = $429,434 )
2024	2024	FRED HUTCHINSON CANCER CENTER	1100 FAIRVIEW AVE N	SEATTLE	WA	98109	KING	USA	Biomedical Research and Research Training	000	4	2/16/2024	NON-COMPETING CONTINUATION	$386,491
2024	2024	FRED HUTCHINSON CANCER CENTER	1100 FAIRVIEW AVE N	SEATTLE	WA	98109	KING	USA	Biomedical Research and Research Training	001	4	6/18/2024	NON-COMPETING CONTINUATION	$42,943
														Subtotal = $429,434

Issue Date FY: 2023 ( Subtotal = $428,224 )
2023	2023	FRED HUTCHINSON CANCER CENTER	1100 FAIRVIEW AVE N	SEATTLE	WA	98109	KING	USA	Biomedical Research and Research Training	001	3	2/23/2023	NON-COMPETING CONTINUATION	$428,224
2023	2022	THE JOHNS HOPKINS UNIVERSITY	3400 N CHARLES ST	BALTIMORE	MD	21218	BALTIMORE CITY	USA	Biomedical Research and Research Training	000	1	11/13/2022	NEW	$0
														Subtotal = $428,224

Issue Date FY: 2022 ( Subtotal = $431,270 )
2022	2022	FRED HUTCHINSON CANCER CENTER	1100 FAIRVIEW AVE N	SEATTLE	WA	98109	KING	USA	Biomedical Research and Research Training	002	2	8/30/2022	CHANGE OF GRANTEE / TRAINING INSTITUTION / AWARDING INSTITUTION	$404,433
2022	2022	JOHNS HOPKINS UNIVERSITY, THE	3400 N CHARLES ST	BALTIMORE	MD	21218	BALTIMORE CITY	USA	Biomedical Research and Research Training	000	1	3/29/2022	NEW	$409,375
2022	2022	JOHNS HOPKINS UNIVERSITY, THE	3400 N CHARLES ST	BALTIMORE	MD	21218	BALTIMORE CITY	USA	Biomedical Research and Research Training	001	1	8/29/2022	NEW	-$382,538
														Subtotal = $431,270

Grand Total All Awards = $2,151,570

Top

All Categories

About

Search

Reports

Data Submission

Award Information

Data analysis tools for leveraging massive public data to improve hypothesis-driven research

Award Number: R35GM144128

ORGANIZATION: NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES

OPDIV: NIH

AWARD CLASS: DISCRETIONARY

AWARD ACTIVITY TYPE: SCIENTIFIC/HEALTH RESEARCH (INCLUDES SURVEYS)

PERIOD OF PERFORMANCE START DATE: 04/01/2022

PERIOD OF PERFORMANCE END DATE: 02/28/2027

Federal Websites

Department of Health & Human Services

HHS Operating Divisions

HHS Staff Divisions

Download A Document Viewer