Research Topics
Over my career, my work has focused on developing and applying computer technologies to support research efforts, primarily in the domains of clinical, translational, and basic biomedical research. Much of this work has focused on building computer systems and cyberinfrastructures to acquire, process, manage, and analyze research data and to turn that data into useful knowledge.
The Office of Data Science (ODS), which I direct, was established to assist NIEHS and Environmental Health researchers in promoting FAIR+ principals for research data. The FAIR+ principals call for data to be findable, accessible, interoperable, reproducible, and computable. ODS staff work across multiple projects, collaborations, and communities of practice towards this goal. ODS projects have included a focus on: extracting knowledge from unstructured sources, principally literature; helping promote the development and adoption of data and metadata standards; building data systems to help researchers in accessing toxicology and climate change and health data; and more recently in applying natural language processing techniques and large language models to help guide research efforts and priorities.
Biography
I received a B.S. degree in in Physics in 1989 and a Ph.D. in computer science in 1999, both from the University of North Carolina at Chapel Hill. My work career has been split in three parts. My early career was in industry where I worked for several companies with a focus on software engineering, data analysis, and data mining in multiple domain areas prior to joining BD Technologies where my work shifted to developing biomedical data management and analysis platforms in support of stem cell research. I later joined the Renaissance Computing Institute (RENCI) where I created and led their biomedical informatics group and their data science group, served as PI, Investigator, or Key Personnel on multiple NIH, NSF, and DHS grants, and served as Chief Technology Officer. I began working at the NIEHS in 2017.
Selected Publications
- Cox S, Dong X, Rai R, Christopherson L, Zheng W, Tropsha A, Schmitt C. A semantic similarity based methodology for predicting protein-protein interactions: Evaluation with P53-interacting kinases. J Biomed Inform. 2020;111:103579.
- Walker VR, Schmitt CP, Wolfe MS, Nowak AJ, Kulesza K, Williams AR, Shin R, Cohen J, Burch D, Stout MD, Shipkowski KA, Rooney AA. Evaluation of a semi-automated data extraction tool for public health literature-based reviews: Dextr. Environ Int. 2022;159:107025.
- Holmgren SD, Boyles RR, Cronk RD, Duncan CG, Kwok RK, Lunn RM, Osborn KC, Thessen AE, Schmitt CP. Catalyzing Knowledge-Driven Discovery in Environmental Health Sciences through a Community-Driven Harmonized Language. Int J Environ Res Public Health. 2021;18(17).
- Fecho K, Bizon C, Miller F, Schurman S, Schmitt C, Xue W, Morton K, Wang P, Tropsha A. A Biomedical Knowledge Graph System to Propose Mechanistic Hypotheses for Real-World Environmental Health Observations: Cohort Study and Informatics Application. JMIR Med Inform. 2021;9(7):e26714.
- Chan LE, Thessen AE, Duncan WD, Matentzoglu N, Schmitt C, Grondin CJ, Vasilevsky N, McMurry JA, Robinson PN, Mungall CJ, Haendel MA. The Environmental Conditions, Treatments, and Exposures Ontology (ECTO): connecting toxicology and exposure to human health and beyond. J Biomed Semantics. 2023;14(1):3.
Related Scientific Focus Areas
This page was last updated on Friday, October 25, 2024