Diagnosing the Problem: Automated Metrics for Evaluating AI-Generated Medical Information

As the use of AI to obtain health information becomes more widespread, it carries the potential to improve health literacy and access, but also the risk of spreading misinformation and perpetuating disparities. Diagnosing the Problem introduces a generalizable metric-based framework for evaluating the quality and safety of AI-generated medical communication by asking: what makes a response readable, helpful, and accurate? To operationalize these metrics, it also includes a Python-based tool that automates scoring and generates detailed, user-friendly reports, making it easier to assess and improve how AI engages with medical information.

Intern: Gail McKinney

Mentor: Samantha Levy (AOS)

Physical Science

Physical Science

Error Detection on Noisy Quantum Devices

Error Detection on Noisy Quantum Devices

Characteristics of Geomagnetic Storms and Their Drivers

Characteristics of Geomagnetic Storms and Their Drivers

Evaluating and Developing Tools for Open Source Underwater Acoustic Databases

Evaluating and Developing Tools for Open Source Underwater Acoustic Databases

Baltimore at Risk: Strengthening Soil with Natural Biopolymers

Baltimore at Risk: Strengthening Soil with Natural Biopolymers

Aerospace and Space-Related Engineering

Aerospace and Space-Related Engineering

The Success of Flight Begins on the Ground: Designing Non-Flight Mechanical Systems for Space Missions

The Success of Flight Begins on the Ground: Designing Non-Flight Mechanical Systems for Space Missions

AntHill: Designing a Futuristic Lunar Rover that Helps Establish Long-Term Human Prescence on the Moon

AntHill: Designing a Futuristic Lunar Rover that Helps Establish Long-Term Human Prescence on the Moon

NOVA: Amateur Weather Satellite Simulator

NOVA: Amateur Weather Satellite Simulator

Modeling GPS for Mars

Modeling GPS for Mars

Revolutionizing Space Communication Testing

Revolutionizing Space Communication Testing

Autonomous Safe-Landing Drone with Machine Learning-Based Detection and Navigation

Autonomous Safe-Landing Drone with Machine Learning-Based Detection and Navigation

Application of Fault Management Engineering to Spacecraft Design

Application of Fault Management Engineering to Spacecraft Design

Mini COTS Dragonfly Avionics Box

Mini COTS Dragonfly Avionics Box

Constructing Dragonfly: Engineering the Dragonfly Assembly Flow

Constructing Dragonfly: Engineering the Dragonfly Assembly Flow

Dragonfly MMRTG Integration: Fairing Simulator Assembly

Dragonfly MMRTG Integration: Fairing Simulator Assembly

Analysis of Neutral Wind Shears in the Low Thermosphere

Analysis of Neutral Wind Shears in the Low Thermosphere

SEEcrets Revealed: Photodiode Space Radiation Vulnerability Exposed Through Pulsed-Laser Single Event Effect Testing

SEEcrets Revealed: Photodiode Space Radiation Vulnerability Exposed Through Pulsed-Laser Single Event Effect Testing

Applying Fractional Calculus to Autonomous Control of Flight Systems

Applying Fractional Calculus to Autonomous Control of Flight Systems

Engineering

Engineering

Computer-Controlled Digital Embroidery Machine for eTextiles

Computer-Controlled Digital Embroidery Machine for eTextiles

Polyurea Aerogels for Body Armor Augmentation

Polyurea Aerogels for Body Armor Augmentation

Hyperspectral Imaging: The Solution to Your Grainy Images

Hyperspectral Imaging: The Solution to Your Grainy Images

Identifying Aircraft Type Using ADS-B Transponder Phase Information

Identifying Aircraft Type Using ADS-B Transponder Phase Information

Electric Unfolding Rotorcraft Observational Platform Armament (EUROPA)

Electric Unfolding Rotorcraft Observational Platform Armament (EUROPA)

Quadcopter Digital Twin and Optimization

Quadcopter Digital Twin and Optimization

Experimental Rope Drive Testing for Continuous Chromatography

Experimental Rope Drive Testing for Continuous Chromatography

Low Budget Embedded System Synthesizer Design for Improved Music Education Opportunity

Low Budget Embedded System Synthesizer Design for Improved Music Education Opportunity

Ambient Intelligence for Patient Care Environments

Ambient Intelligence for Patient Care Environments

Free-Space Optical Comms: How Lasers Can Transmit Data

Free-Space Optical Comms: How Lasers Can Transmit Data

Programming and Computer Science

Programming and Computer Science

A Spotify Data-Driven Web App for Clustering Tracks Across Eras

A Spotify Data-Driven Web App for Clustering Tracks Across Eras

Plug-and-Play Robotics: Modular Sensor Streaming for Any Mission

Plug-and-Play Robotics: Modular Sensor Streaming for Any Mission

SmartSched: Empowering Business Owners and Clients Through Seamless Appointments

SmartSched: Empowering Business Owners and Clients Through Seamless Appointments

Gitlab Ticket Creation Tool

Gitlab Ticket Creation Tool

Math and Data Analysis

Math and Data Analysis

How Do Brains Develop? Insights into Neurological Development from Electron Microscopy Data

How Do Brains Develop? Insights into Neurological Development from Electron Microscopy Data

Modeling and Analysis of Radar Beams Intersection

Modeling and Analysis of Radar Beams Intersection

Modeling and Mitigating Obesity Spread Using Graph Theory and Decentralized Control

Modeling and Mitigating Obesity Spread Using Graph Theory and Decentralized Control

Strategic Tracking with Conflicting Filters: A Study of Kalman and Particle Filters in Nonlinear Game-Theoretic Systems

Strategic Tracking with Conflicting Filters: A Study of Kalman and Particle Filters in Nonlinear Game-Theoretic Systems

IT and Cybersecurity

IT and Cybersecurity

Crack The Hashword - Using Reinforcement Learning For Faster Password Cracking

Crack The Hashword - Using Reinforcement Learning For Faster Password Cracking

VictimVM: Misconfiguring a Virtual Machine to Develop a Hacker Mindset

VictimVM: Misconfiguring a Virtual Machine to Develop a Hacker Mindset

Insider Threat Analysis and Direct Access Forensics

Insider Threat Analysis and Direct Access Forensics

Environmental, Earth and Life Sciences

Environmental, Earth and Life Sciences

Cost-Effective Fabrication of Hollow Fibers Using Selective Laser Sintering (SLS) with AI-Optimized Motor Vibration

Cost-Effective Fabrication of Hollow Fibers Using Selective Laser Sintering (SLS) with AI-Optimized Motor Vibration

CT Image-Based FE Modeling for Femoral Lesion Assessment

CT Image-Based FE Modeling for Femoral Lesion Assessment

AI and Machine Learning

AI and Machine Learning

Automatic Speech Recognition: Sanskrit to English

Automatic Speech Recognition: Sanskrit to English

Can AI Really Do Physics? Investigating AI Agents' Conceptual and Mathematical Reasoning Through Textbook Physics Problems

Can AI Really Do Physics? Investigating AI Agents' Conceptual and Mathematical Reasoning Through Textbook Physics Problems

Aerial Imagery Reconnaissance for Collaborative Search & Rescue

Aerial Imagery Reconnaissance for Collaborative Search & Rescue

Diagnosing the Problem: Automated Metrics for Evaluating AI-Generated Medical Information

Diagnosing the Problem: Automated Metrics for Evaluating AI-Generated Medical Information

Large Language Models for Improving Human-AI Collaboration

Large Language Models for Improving Human-AI Collaboration

Prompted to Lie? Evaluating How Prompt Specificity Influences Hallucinations in Large Language Models

Prompted to Lie? Evaluating How Prompt Specificity Influences Hallucinations in Large Language Models

Intelligence with Integrity: Policy Guidelines for Responsible AI Systems

Intelligence with Integrity: Policy Guidelines for Responsible AI Systems

AI Wargaming for Future Command and Control in ARMA 3 Simulation

AI Wargaming for Future Command and Control in ARMA 3 Simulation

Modeling the Spread of Rumors in Social Networks Using LLMs

Modeling the Spread of Rumors in Social Networks Using LLMs

Detecting adversarial backdoor attacks on computer vision systems

Detecting adversarial backdoor attacks on computer vision systems

Predictive Macroeconomic Trends Using Large Language Models

Predictive Macroeconomic Trends Using Large Language Models

Creating an Reconnaissance Blind Chess Bot using Agentic AI

Creating an Reconnaissance Blind Chess Bot using Agentic AI

Parking Spotter Upgrades

Parking Spotter Upgrades

LLM-Driven Embodied AI Task Generation via Genesis

LLM-Driven Embodied AI Task Generation via Genesis

A Data Processing Pipeline to Enable Machine Learning

A Data Processing Pipeline to Enable Machine Learning

SCENT-AI: An AI Agent that plays Foxhound Through Fog

SCENT-AI: An AI Agent that plays Foxhound Through Fog

Understanding AI Models for Global Weather Forecasting

Understanding AI Models for Global Weather Forecasting

Leveraging Large Language Models for Robotic Control

Leveraging Large Language Models for Robotic Control

Applying Transformers to Recognize American Sign Language: An Odyssey

Applying Transformers to Recognize American Sign Language: An Odyssey

Agentic AI for Collaborative Problem Solving

Agentic AI for Collaborative Problem Solving

LLM-Assisted Agentic Market Analysis (LLAAMA)

LLM-Assisted Agentic Market Analysis (LLAAMA)

Synthetic image generation and discrimination from NIST GenAI challenge

Synthetic image generation and discrimination from NIST GenAI challenge

STEM Communications and Education

STEM Communications and Education

GPT-Generated Health Visuals

GPT-Generated Health Visuals

Artificial Intelligence and STEM Education

Artificial Intelligence and STEM Education

SciOly PrepBot: An Agentic AI Framework for Creating Practice Tests

SciOly PrepBot: An Agentic AI Framework for Creating Practice Tests

Modeling Music Listening Behaviors Spread Using Graph Theory

Modeling Music Listening Behaviors Spread Using Graph Theory

Other

Other

REACT II: Researching Eye Tracking to Assess Community Trust

REACT II: Researching Eye Tracking to Assess Community Trust

Images for the Artemis III Mission: Assessing Field Test Images Collected by Astronauts

Images for the Artemis III Mission: Assessing Field Test Images Collected by Astronauts

Baltimore at Risk: Strengthening Soil with Natural Biopolymers

Baltimore at Risk: Strengthening Soil with Natural Biopolymers