Skip to main content
NIH Intramural Research Program, Our Research Changes Lives

Navigation controls

  • Search
  • Menu

Social follow links

  • Podcast
  • Instagram
  • Twitter
  • YouTube
  • LinkedIn

Main navigation

  • About Us
    • What Is the IRP?
    • History
    • Honors
      • Nobel Prize
      • Lasker Award
      • Breakthrough Prize
      • Shaw Prize
      • Presidential Early Career Award for Scientists and Engineers (PECASE)
      • Presidential Medal of Freedom
      • National Medal of Science
      • Searle Scholars
      • The National Academy of Sciences
      • The National Academy of Medicine
      • The National Academy of Engineering
      • The American Academy of Arts and Sciences
      • National Medal of Technology & Innovation
      • Samuel J. Heyman Service to America Medals
      • Crafoord Prize
      • Fellows of the Royal Society
      • Canada Gairdner Awards
    • Organization & Leadership
    • Our Programs
      • NCI
      • NEI
      • NHGRI
      • NHLBI
      • NIA
      • NIAAA
      • NIAID
      • NIAMS
      • NIBIB
      • NICHD
      • NIDA
      • NIDCD
      • NIDCR
      • NIDDK
      • NIEHS
      • NIMH
      • NIMHD
      • NINDS
      • NINR
      • NLM
      • CC
      • NCATS
      • NCCIH
    • Research Campus Locations
    • Contact Information
  • Our Research
    • Scientific Focus Areas
      • Biomedical Engineering & Biophysics
      • Cancer Biology
      • Cell Biology
      • Chemical Biology
      • Chromosome Biology
      • Clinical Research
      • Computational Biology
      • Developmental Biology
      • Epidemiology
      • Genetics & Genomics
      • Health Disparities
      • Immunology
      • Microbiology & Infectious Diseases
      • Molecular Biology & Biochemistry
      • Molecular Pharmacology
      • Neuroscience
      • RNA Biology
      • Social & Behavioral Sciences
      • Stem Cell Biology
      • Structural Biology
      • Systems Biology
      • Virology
    • Principal Investigators
      • View by Investigator Name
      • View by Scientific Focus Area
    • Accomplishments
      • View All Accomplishments by Date
      • View All Health Topics
      • The Body
      • Health & Wellness
      • Conditions & Diseases
      • Procedures
    • Accelerating Science
      • Investing in Cutting-Edge Animal Models
      • Creating Cell-Based Therapies
      • Advancing Computational and Structural Biology
      • Combating Drug Resistance
      • Developing Novel Imaging Techniques
      • Charting the Pathways of Inflammation
      • Zooming in on the Microbiome
      • Uncovering New Opportunities for Natural Products
      • Stimulating Neuroscience Research
      • Pursuing Precision Medicine
      • Unlocking the Potential of RNA Biology and Therapeutics
      • Producing Novel Vaccines
    • Research in Action
      • View All Stories
      • No Tech? No Problem!
      • Battling Blood-Sucking Bugs
      • Unexpected Leads to Curb Addiction
      • Shaping Therapies for Sickle Cell Disease
    • Trans-IRP Research Resources
      • Supercomputing
    • IRP Review Process
    • Commercializing Inventions
  • NIH Clinical Center
    • Clinical Center Facilities
    • Clinical Faculty
    • Advancing Translational Science
    • Clinical Trials
      • Get Involved with Clinical Research
      • Physician Resources
  • News & Events
    • In the News
    • I am Intramural Blog
    • Speaking of Science Podcast
    • SciBites Video Shorts
    • The NIH Catalyst Newsletter
    • Events
  • Careers
    • Faculty-Level Scientific Careers
    • Trans-NIH Scientific Recruitments
      • Stadtman Tenure-Track Investigators
        • Science, the Stadtman Way
      • Lasker Clinical Research Scholars
      • Independent Research Scholar
    • Scientific & Clinical Careers
    • Administrative Careers
  • Research Training
    • Program Information
    • Training Opportunities
    • NIH Work/Life Resources
The NIH Catalyst: A Publication About NIH Intramural Research

National Institutes of Health • Office of the Director | Volume 33 Issue 4 • July–August 2025

From ToxPipe to FAIRkit

NIH-Built AI Chatbots Are Helping Scientists Sift Through the Data

BY PAIGE JARREAU, NIA; and THE NIH CATALYST STAFF

illustration of a chatbot

NIH researchers are creating an array of AI-powered tools to address unique intramural challenges.

Artificial intelligence (AI) tools are taking root across NIH, reshaping how researchers access information, analyze data, and advance biomedical discovery. From generative chatbots that streamline scientific queries to machine learning models that help harmonize massive datasets, AI is proving to be a powerful partner in tackling complex hypotheses in research topics spanning from toxicology to dementia and beyond. There are many, so let’s chat about ’em!

Chatbot creation for NIH biomedical research

Speakers at a June 11 NIH Library event that featured members from the NIH Generative AI Community of Practice showcased a range of AI-driven chatbot initiatives under development across the agency. Speakers and topics at a roundtable discussion, archived on the NIH Library YouTube channel, included:

  • “Generative AI Chatbots in the NIH Landscape: Foundations, Opportunities, and Considerations” by Alicia Lillich, NIH Library
  • “Chatbot for the Intramural Research Program, or ChIRP,” by Steevenson Nelson, OD
  • “ToxPipe: Chatbots and Retrieval-Augmented Generation on Toxicological Data Streams” by Trey Saddler, NIEHS
  • “CARDbiomedbench: Biomedical Benchmark of Chatbots, CARD.AI Arena, CARD.AI, FAIRkit” by Faraz Faghri, NIA
  • “AI Chatbots: Opportunities and Considerations at NLM” by Dianne Babski, NLM
  • “Using AI to Create a Travel Chatbot” by Fiona Vaughans, NCI

Lillich, an emerging technology specialist at the NIH Library, presented opportunities and ethical considerations in deploying generative AI chatbots within NIH’s information ecosystem. These tools, which use large language models (LLMs) to assist researchers, have the potential to improve literature discovery, scientific synthesis, and internal knowledge management, she noted.

Nelson shared ChIRP updates and reminded attendees that it was designed to respond to NIH intramural queries within a more secure environment than public LLMs offer. Read more about ChIRP in this recent NIH Catalyst article. ChIRP is being reimagined as a tool for all of NIH, encompassing both research and administrative tasks. The new and improved tool will be rolled out to the entire NIH community later this summer.

Saddler, a data scientist in the NIEHS Division of Translational Toxicology, highlighted ToxPipe, a chatbot-enabled platform that lets users explore toxicology databases through an intuitive interface powered by LibreChat. Saddler also demonstrated ChemBioTox, which uses autonomous AI agents to answer toxicology questions such as “What are the exposure levels of bisphenols?.” Responses are generated through multistep reasoning. The functionality can be evaluated via open-source tools that allow scientists to rate accuracy and refine results.

Faghri from NIA’s Center for Alzheimer’s and Related Dementias (CARD), presented several AI-driven platforms, including CARDBiomedBench, CARD.AI Arena, and FAIRkit. These tools, which are described in more detail below, “are using AI to better describe diseases, predict disease progression, and identify new drug targets,” said Faghri, a computer scientist in the advanced analytics expert group at CARD. “AI-powered tools are helping us solve problems that weren’t solvable before.”

Headshot of Faraz Faghri

CREDIT: NIA, CARD

Faraz Faghri

Advancing data harmonization

CARD’s advanced analytics expert group is applying AI to one of biomedical research’s greatest challenges—data harmonization. Different research groups collect different types of patient data, including genetic profiles, imaging, and environmental exposures. These datasets are often incompatible by default, which complicates, if not impedes, large-scale analyses. Standardization of so many data points is impossible to achieve yet incredibly necessary as biomedical research speeds toward a future of open access data and the application of machine learning.

The DIVER (Data Inventory and Verification Environment for Research) platform, which uses OpenAI’s GPT models to automate the creation of common data elements (CDEs), may help. Faghri’steam applied DIVER to 31 dementia-related datasets and achieved interoperability scores of up to 60% when combining data from the Alzheimer’s Disease Neuroimaging Initiative and the Global Parkinson’s Genetic Program (PMID: 39484274). By automating what is typically a labor-intensive and error-prone process, DIVER enabled them to merge datasets and perform cross-study comparisons, which is a critical step toward identifying early biomarkers and validating therapeutic targets.

Predicting missing data

Missing or incomplete data, especially in electronic health records (EHRs), is another persistent obstacle in biomedical research. Traditional data collection and analysis techniques often fall short in capturing the complexity of health care information consistently.

The CARD team developed a machine learning framework called MUSE (Multimodal Unsupervised Embedding), which helps to predict missing values in patient data. MUSE uses graph neural networks to analyze the relationships among patient data across multiple data types such as brain scans, cognitive scores, and biomarkers. Rather than addressing each data gap in isolation, MUSE models the entire patient ecosystem to generate more accurate predictions.

The model improved predictions of Alzheimer’s disease progression by more than 3% compared with standard approaches. “There’s value in retaining data even from patients with large missing segments,” Faghri said. “We’re trying to figure out the broader structure of the data system and see where people with missing data might fit in. Graph neural networks help us connect the dots.”

An AI web crawler for 508 compliance

Dianne Babski, director of the NLM User Services and Collection Division, presented on NLM’s efforts to pilot human-centered AI. Babski demonstrated the NLM Web Accessibility Assistant, which was created by Dan Wendling to aid in making webpage content more accessible for users. The assistant identifies accessibility issues to help ensure Section 508 compliance and improve user accessibility. The assistant recommends fixes and provides code to make those webpage changes. To date, it has flagged over 67 unique error types across 9,000 website pages.

Babski and Nick Weber, acting director of CIT’s Office of Scientific Computing Services, co-chair the NIH Generative AI Community of Practice group. Hundreds of NIHers attend the group’s monthly meetings. Will we see you there?

The NIH Office of Science Policy is currently seeking input on responsible development of generative AI tools using controlled-access human genomic data. NIH encourages staff and stakeholders to comment on best practices for mitigating data leakage while promoting innovation. Comments are due by July 16, 2025, and a roundtable discussion will follow on July 17. Submit your feedback and learn more here: NIH Comment Form.

Additional resources shared at the event

  • GitHub: Learn more about NIH GitHub by emailing GitHub@nih.gov.

CARD tools and benchmarks on GitHub:

  • https://github.com/NIEHS/ToxPipe
    • https://github.com/NIH-CARD/CARDBiomedBench

GitHub Copilot clone: https://continue.dev

  • CARD.AI Arena: https://cardai-arena-809832168532.us-central1.run.app/
  • PubTator Central (NLM): extracts information from PubMed abstracts and articles to create annotations of biomedical concepts for use with AI
  • Prompt engineering tips and tricks: https://www.promptingguide.ai
  • Blog post by CARD’s advanced analytics expert group: Can GPT-4.5, Claude 3.7, and Gemini 2.0 Keep Up with Biomedical Research?

Watch the entire June 11 event to learn more about the tools discussed above and much more. 

This page was last updated on Thursday, July 10, 2025

  • Issue Overview
  • Features
    • Calling All Creators and Innovators to the BETA Center Makerspace
    • From ToxPipe to FAIRkit
    • The Reservoir Within: How Nitrate Secures Our Nitric Oxide Supply
    • WSA Hosts Annual Anita B. Roberts Lecture
  • Departments
    • From the Deputy Director for Intramural Research
    • Catalytic Research
    • News You Can Use
    • Snapshots
    • Colleagues: Recently Tenured
    • The SIG Beat
    • Photographic Moment
    • Announcements
  • Issue Contents
  • Download this issue as a PDF

Catalyst menu

  • Current Issue
  • Previous Issues
  • About The NIH Catalyst
  • Contact The NIH Catalyst
  • Share Your Story
  • NIH Abbreviations

Subscribe Today!

Subscribe to The NIH Catalyst Newsletter and receive email updates.

Subscribe

Get IRP Updates

Subscribe

  • Email
  • Print
  • Share Twitter Facebook LinkedIn

Main navigation

  • About Us
    • What Is the IRP?
    • History
    • Honors
    • Organization & Leadership
    • Our Programs
    • Research Campus Locations
    • Contact Information
  • Our Research
    • Scientific Focus Areas
    • Principal Investigators
    • Accomplishments
    • Accelerating Science
    • Research in Action
    • Trans-IRP Research Resources
    • IRP Review Process
    • Commercializing Inventions
  • NIH Clinical Center
    • Clinical Center Facilities
    • Clinical Faculty
    • Advancing Translational Science
    • Clinical Trials
  • News & Events
    • In the News
    • I am Intramural Blog
    • Speaking of Science Podcast
    • SciBites Video Shorts
    • The NIH Catalyst Newsletter
    • Events
  • Careers
    • Faculty-Level Scientific Careers
    • Trans-NIH Scientific Recruitments
    • Scientific & Clinical Careers
    • Administrative Careers
  • Research Training
    • Program Information
    • Training Opportunities
    • NIH Work/Life Resources
  • Department of Health and Human Services
  • National Institutes of Health
  • USA.gov

Footer

  • Home
  • Contact Us
  • IRP Brand Materials
  • HHS Vulnerability Disclosure
  • Web Policies & Notices
  • Site Map
  • Search