Edit Content

Stay Connected

HEALTH AI BIAS

DATATHON '25

Thank you to everyone who took part in Datathon 24! We’re excited to welcome you back in 2025! You can pre-register using the link below, and we’ll let you know as soon as registration opens.

18-24 AUG’25

Atlanta, Georgia, United States

BREATHTAKING

Untitled design (20)
about_shape_1
about_shape_2

Join us in Health AI Bias Datathon '25! Collaborate, compete, and create solutions to tackle biases in AI, ensuring a fairer future for all.

Numbers

participants
0
speakers
0
workshops
0
exhibitors
0

SpecialGuests

Featured Speakers

Home

Judy Gichoya

Emory University

Home

Hari Trivedi

Emory University

Home

Janice Newsome

Emory University

Home

MinJae Woo

Clemson University

Home

Mornin Feng

National University of Singapore

Home

Po-Chih Kuo

National Tsing Hua University

Home

Leo Anthony Celi

Harvard Medical School

Home

Saptarshi P.

Indiana University Purdue University Indianapolis

Home

Enzo Ferrante

Universidad de Buenos Aires

Home

Marly Van Assen

Emory University

schedule

Agenda

7:00 am - 4:30 pm

Location - HSRB Rollins Auditorium

1760 Haygood Dr NE, Atlanta, GA 30322

7:00 am - 8:00 am

Continental Breakfast & Registration

8:00 am - 8:30 am

Introduction & Welcome

Brief welcome to the summer school, outline expectations and basic rules, provide information on facilities, and transition to the ice breaker.

Hari Trivedi
Hari Trivedi

Emory University

Judy Gichoya
Judy Gichoya

Emory University

8:30 am - 9:00 am

Icebreaker

Hari Trivedi
Hari Trivedi

Emory University

Judy Gichoya
Judy Gichoya

Emory University

9:00 am - 9:30 am

Technical Setup

9:30 am - 9:50 am

Break

9:50 am - 10:30 am

"I've got a blank space, baby!"

Addressing "Missingness" in EHR Data

Wesley Anderson
Wesley Anderson

Critical Path Institute

Smith Heavner
Smith Heavner

Critical Path Institute

10:30 am - 12:00 pm

Tabular Data 2 : End to end model development using Emory ICU data

Learn to handle tabular data through Exploratory Data Analysis (EDA) and review top papers on descriptive analysis to extract meaningful insights from ICU datasets.

Atika Rahman Paddo
Atika Rahman Paddo

Oklahoma State University

Saptarshi P.
Saptarshi P.

Indiana University Purdue University Indianapolis

12:00 pm - 1:00 pm

Lunch

1:00 pm - 2:00 pm

Tabular Data 3 : Statistics for machine learning using the National Inpatient Sample Dataset

A comprehensive overview of key statistics and trends within the National Inpatient Sample dataset, tailored for machine learning applications.

MinJae Woo
MinJae Woo

Clemson University

2:00 pm - 2:30 pm

Emory CXR : Patient’s Imaging Journey in the Health Care System

Overview of the imaging informatics journey in healthcare systems.

Judy Gichoya
Judy Gichoya

Emory University

2:30 pm - 3:00 pm

Break

3:00 pm - 3:45 pm

Emory CXR : Intro to CXR Problem Formulation and Opportunities

This session will cover the types of chest X-rays, including different views and portable options. We will discuss use cases such as opportunistic screening, explain how to convert images to numerical data, and describe the importance of channels.

Marly Van Assen
Marly Van Assen

Emory University

3:45 pm - 4:30 pm

Emory CXR : Data Preprocessing

In this hands-on session, participants will learn how to open a chest X-ray image and perform image preprocessing.

Theo Dapamede
Theo Dapamede

Emory University

7:30 am - 4:30 pm

Location - HSRB Rollins Auditorium

1760 Haygood Dr NE, Atlanta, GA 30322

7:30 am - 8:20 am

Continental Breakfast & Registration

8:20 am - 8:30 am

Debrief & House keeping

Review of the day’s activities and important housekeeping notes.

Hari Trivedi
Hari Trivedi

Emory University

Judy Gichoya
Judy Gichoya

Emory University

8:30 am - 9:30 am

Finding the Correct Answer (Ground Truths)

Explore the types of labels needed (class, mask, bbox) and their sources (PCR, sequencing, reports, other modalities). Discuss the history of labels, hidden stratifications, label correlation, and alternative approaches for label extraction using segmentation, Imagenome, and generative models.

Judy Gichoya
Judy Gichoya

Emory University

9:30 am - 10:00 am

Break

10:00 am - 12:00 pm

Models on the CXR Dataset

Understand the end-to-end model pipeline, including data processing, results representation, and potential pitfalls.

Frank Li
Frank Li

Emory University

Po-Chih Kuo
Po-Chih Kuo

National Tsing Hua University

12:00 pm - 1:00 pm

Lunch

1:00 pm - 2:30 pm

Inference and Stats on the CXR Dataset

Delve into inference using models, metrics for inference, and statistical analysis.

Frank Li
Frank Li

Emory University

Po-Chih Kuo
Po-Chih Kuo

National Tsing Hua University

2:30 pm - 3:00 pm

Break

3:00 pm - 3:45 pm

EMBED: Introduction to Breast Screening and Diagnostic Pathways

An introduction to breast screening and diagnostic pathways.

3:45 pm - 4:30 pm

Introduction to the EMBED Dataset

Overview of the EMBED dataset and its applications.

Hari Trivedi
Hari Trivedi

Emory University

7:30 am - 4:30 pm

Location - HSRB Rollins Auditorium

1760 Haygood Dr NE, Atlanta, GA 30322

7:30 am - 8:20 am

Continental Breakfast & Registration

8:20 am - 8:30 am

Debrief

Review of the day’s activities and important housekeeping notes.

Hari Trivedi
Hari Trivedi

Emory University

Judy Gichoya
Judy Gichoya

Emory University

8:30 am - 10:00 am

EMBED - Data Engineering

Curate a patient cohort with MagView and metadata, focusing on cancer/no cancer, normal vs. abnormal classifications, and stratification by breast density. Address common pitfalls and create a test set to highlight key issues.

Beatrice Brown-Mulry
Beatrice Brown-Mulry

Emory University

10:00 pm - 10:30 am

Break

10:30 am - 11:00 am

EMBED - Data Engineering: Making Use of ROIs

Learn how to effectively leverage Regions of Interest (ROIs) in data engineering for enhanced analysis and insights.

Beatrice Brown-Mulry
Beatrice Brown-Mulry

Emory University

11:00 am - 12:00 pm

EMBED - Data Preprocessing

Prepare a dataset for training by performing DICOM preprocessing, stratifying by subgroups, applying VOI LUT, masking breast tissue, standardizing image orientation, and addressing preprocessing challenges.

Theo Dapamede
Theo Dapamede

Emory University

12:00 pm - 1:00 pm

Lunch

1:00 pm - 2:00 pm

EMBED - Training Whole Image Classifier

Training a whole image classifier or other model for BI-RADS 0 vs. 1/2 categories.

Du Hao
Du Hao

National University of Singapore

Mornin Feng
Mornin Feng

National University of Singapore

2:00 pm - 3:00 pm

EMBED - Model Evaluation and Inference

Utilize our model for running inference, calculating statistical metrics, and evaluating subgroups. Extract imaging findings and analyze demographics, including race, ethnicity, age, density, and pathology.

Beatrice Brown-Mulry
Beatrice Brown-Mulry

Emory University

3:00 pm - 3:30 pm

Break

3:30 pm - 4:30 pm

EMBED - Model Evaluation and Inference

Utilize our model to run inference, calculate statistical metrics, and evaluate subgroups. Discuss appropriate metrics like F1, AUC, specificity, and sensitivity, along with bootstrapping, class balancing, and potential pitfalls in model performance evaluation.

Beatrice Brown-Mulry
Beatrice Brown-Mulry

Emory University

7:30 pm - 4:30 pm

Location - Goizueta Business School 208

1300 Clifton Road, Atlanta, GA 30322

7:30 am - 8:20 am

Continental Breakfast & Registration

8:20 am - 8:30 am

Debrief

Hari Trivedi
Hari Trivedi

Emory University

Judy Gichoya
Judy Gichoya

Emory University

8:30 am - 9:15 am

An overview of bias and unfairness in AI/ML

Prof. Odom

Gabriel J. Odom
Gabriel J. Odom

Florida International University

9:15 am - 9:30 am

EMBED - Breast MRI Overview

Hari Trivedi
Hari Trivedi

Emory University

9:30 am - 10:00 am

Break

10:00 am - 12:00 am

EMBED - Breast MRI Model

Classify MRI BPE, select and extract data from 1000 MRIs. Explore data, use pre-trained 3D models, fine-tune, evaluate, and compare performance of 3D classifiers, stacked image classifiers, middle images, and eigen images.

Rohan Satya Isaac
Rohan Satya Isaac

Emory University

12:00 pm - 1:00 pm

Lunch

1:00 pm - 2:00 pm

Radiopathomics

Exploring the intersection of radiology and pathology through advanced data analysis techniques.

Bolin Song
Bolin Song

Emory University

2:00 pm - 2:30 pm

Humanizing AI: The Patient's Influence in Reducing Bias in Women's Health

In AI-driven healthcare, especially in women's health and maternal care, it's crucial to recognize that clinicians and patients are key stakeholders directly impacted by these technologies. This presentation will highlight the importance of considering the human aspect in developing equitable and effective AI tools.

Tia Pope
Tia Pope

North Carolina Agricultural and Technical State University

2:30 pm - 3:00 pm

Break

3:00 pm - 3:45 pm

Bias in AI

Examination of biases present in AI models and their implications in healthcare.

Enzo Ferrante
Enzo Ferrante

Universidad de Buenos Aires

3:45 pm - 4:30 pm

AWS SageMaker Workshop

Learn to utilize AWS Sagemaker for building, training, and deploying machine learning models efficiently.

7:30 am - 8:00 pm

Location - HSRB Rollins Auditorium

1760 Haygood Dr NE, Atlanta, GA 30322

7:30 am - 8:30 pm

Continental Breakfast & Registration

8:30 am - 9:00 pm

Welcome to Emory

Janice Newsome
Janice Newsome

Emory University

Hari Trivedi
Hari Trivedi

Emory University

Judy Gichoya
Judy Gichoya

Emory University

9:00 am - 10:00 am

Keynote: Why AI Needs Feminism

Dr. Lauren Klein discusses the critical role of feminism in shaping the future of artificial intelligence.

Lauren Klein
Lauren Klein

Emory University

10:00 am - 10:30 am

Fairness in AI-based Software Development from Software Engineering Perspective

Explore strategies and insights on developing AI technologies that contribute positively to society.

Chitsutha Soomlek
Chitsutha Soomlek

Khon Kaen University

10:30 am - 11:00 am

Break

11:00 am - 12:00 pm

Deploying AI in Real World Setting

Gain insights from early evaluations of AI-Doc with Theo, understand real-world validation through the PRECISE trial with Siva Bhavani, and explore external validation of mammogram AI models with Hari. Join a panel Q&A session to delve deeper into deploying AI in practical applications Moderator : Dr. Enzo Ferrante

Theo Dapamede
Theo Dapamede

Emory University

Hari Trivedi
Hari Trivedi

Emory University

12:00 pm - 1:00 pm

Breast Cancer Disparities: A Grand Challenge

Keynote by Lauren McCullough addressing the significant disparities in breast cancer, followed by a Q&A session to discuss challenges and solutions. Moderator : Dr. Janice Newsome

Lauren E. McCullough
Lauren E. McCullough

Emory University

Janice Newsome
Janice Newsome

Emory University

1:00 pm - 2:00 pm

Lunch

2:00 pm - 2:15 pm

Scaling collaborative learning - Update from the MIT Critical Data Network

Sadia Afreen

2:15 pm - 3:00 pm

Keynote: AI Bias to AI by Us, for All of Us

Dr. Leo Celi explores the transition from AI bias to creating inclusive AI designed by and for everyone, followed by a Q&A session. Moderator: Dr. Po-Chi Kuo

Leo Anthony Celi
Leo Anthony Celi

Harvard Medical School

3:00 pm - 4:00 pm

Moving Beyond Race for Disparities Research

Explore strategies that move beyond race and sex groups for disparities research. Explore best practices for collecting skin tone data with Ian Wong, delve into genomics and ancestry considerations for health equity research with Manoj. Join a panel Q&A session to discuss these innovative approaches. Moderator: Dr. Idris Muhammad

An-Kwok Ian Wong
An-Kwok Ian Wong

Duke University

Manoj Bhasin
Manoj Bhasin

Emory University

Mornin Feng
Mornin Feng

National University of Singapore

4:30 pm - 8:00 pm

Reception Location - Miller Ward Alumni House

There will be a shuttle bus transporting participants from the Symposium to the Reception on Friday, August 23rd. The bus will provide continuous transportation from HSRB to Miller Ward and Miller Ward to HSRB from 4:15 to 8:30. Participants returning to HSRB can take the C Shuttle to the Starvine parking deck to their cars. The first bus leaves HSRB at 4:15. The last bus returns to HSRB at 8:30

4:30 pm - 8:00 pm

Opening Reception

Join us for food, entertainment, and networking as we kick off the datathon with an exciting opening ceremony.

Janice Newsome
Janice Newsome

Emory University

Hari Trivedi
Hari Trivedi

Emory University

Judy Gichoya
Judy Gichoya

Emory University

7:15 am - 6:00 pm

Location - School of Medicine (SOM, Room 130)

100 Woodruff Circle, Atlanta, GA 30322

7:15 am - 7:30 am

Datathon - Doors Open

7:30 am - 8:00 am

Continental Breakfast & Registration

8:00 am - 8:15 am

Welcome to Datathon

Hari Trivedi
Hari Trivedi

Emory University

Judy Gichoya
Judy Gichoya

Emory University

8:15 am - 8:25 am

Dataset description : EMBED

Beatrice Brown-Mulry
Beatrice Brown-Mulry

Emory University

8:25 am - 8:35 am

Dataset Description : Emory CXR

Frank Li
Frank Li

Emory University

8:35 am - 8:45 am

Dataset description : Emory ICU dataset

Atika Rahman Paddo
Atika Rahman Paddo

Oklahoma State University

8:45 am - 8:55 am

Dataset Description : EnCoDE

An-Kwok Ian Wong
An-Kwok Ian Wong

Duke University

8:55 am - 9:15 am

Problem description and group overview

Atika Rahman Paddo
Atika Rahman Paddo

Oklahoma State University

Judy Gichoya
Judy Gichoya

Emory University

9:15 am - 9:30 am

Group Photo

9:15 am - 9:30 am

Team transitions to rooms

9:30 am - 11:00 am

Datathon Begins

11:00 am - 11:15 am

Mentoring Session

11:15 am - 1:00 pm

Datathon Continues

1:00 pm - 2:00 pm

Lunch

2:00 pm - 4:30 pm

Datathon Continues

4:30 pm - 5:00 pm

Check-In with Groups

5:00 pm - 6:00 pm

Group Exercise

6:00 pm - 7:30 pm

Dinner

7:15 am - 2:30 pm

Location - School of Medicine (SOM, Room 130)

100 Woodruff Circle, Atlanta, GA 30322

7:15 am - 7:30 am

Datathon - Day 2 - Doors Open

7:30 am - 8:00 am

Continental Breakfast

8:00 am - 8:15 pm

Debrief

8:15 am - 11:00 am

Datathon Continues

11:00 am - 11:15 am

Mentoring Session

11:15 am - 12:00 pm

Final Slides & Code Submission

12:00 pm - 1:00 pm

Lunch

1:00 pm - 2:00 pm

Presentations

5 minutes allocated per group for their presentation.

2:00 pm - 2:30 pm

Judging

2:00 pm - 2:30 pm

Wrap Up & Results

HALL VIEW

Lecture Halls

ticket

Need Scholarship? Apply for it here.

$800

INCLUDES

Summer School
19th to 22nd August
Symposium
23rd August
Datathon
24th and 25th August

$100

INCLUDES

Symposium
23rd August
Datathon
24th and 25th August

$50

INCLUDES

Symposium
23rd August

DON'T MISS OUT

join us!

Don’t miss out on this opportunity.