Prioritizing cases from a multi-institutional cohort for a dataset of pathologist annotations
Journal Title
Journal of Pathology Informatics
Publication Type
Research article
Abstract
OBJECTIVE: With the increasing energy surrounding the development of artificial intelligence and machine learning (AI/ML) models, the use of the same external validation dataset by various developers allows for a direct comparison of model performance. Through our High Throughput Truthing project, we are creating a validation dataset for AI/ML models trained in the assessment of stromal tumor-infiltrating lymphocytes (sTILs) in triple negative breast cancer (TNBC). MATERIALS AND METHODS: We obtained clinical metadata for hematoxylin and eosin-stained glass slides and corresponding scanned whole slide images (WSIs) of TNBC core biopsies from two US academic medical centers. We selected regions of interest (ROIs) from the WSIs to target regions with various tissue morphologies and sTILs densities. Given the selected ROIs, we implemented a hierarchical rank-sort method for case prioritization. RESULTS: We received 122 glass slides and clinical metadata on 105 unique patients with TNBC. All received cases were female, and the mean age was 63.4 years. 60% of all cases were White patients, and 38.1% were Black or African American. After case prioritization, the skewness of the sTILs density distribution improved from 0.60 to 0.46 with a corresponding increase in the entropy of the sTILs density bins from 1.20 to 1.24. We retained cases with less prevalent metadata elements. CONCLUSION: This method allows us to prioritize underrepresented subgroups based on important clinical factors. In this manuscript, we discuss how we sourced the clinical metadata, selected ROIs, and developed our approach to prioritizing cases for inclusion in our pivotal study.
Publisher
Elsevier
Keywords
Data; Prioritization; Sampling; Validation
Department(s)
Laboratory Research
Open Access at Publisher's Site
https://doi.org/10.1016/j.jpi.2024.100411
Terms of Use/Rights Notice
Refer to copyright notice on published article.


Creation Date: 2026-01-13 04:29:43
Last Modified: 2026-01-13 04:29:53
An error has occurred. This application may no longer respond until reloaded. Reload 🗙