Multi-Agent LLM Game for Healthcare Decision Support
Overview
I am a Research Assistant in the College of Engineering and Applied Science at the University of Cincinnati, working under Prof. Tommaso Giovannelli. My work focuses on multi-agent large language model systems applied to decision-making under uncertainty in healthcare operations.
For my capstone project (Spring 2026), I designed and developed an educational game that combines mathematical optimization with AI-driven structured debate. The game teaches students how different assumptions about uncertainty lead to fundamentally different optimal decisions — and that choosing among them requires understanding trade-offs, not just running a solver.
Role: Research Assistant – Multi-Agent LLM Systems
Institution: University of Cincinnati, Department of Mechanical and Materials Engineering
Advisor: Prof. Tommaso Giovannelli
Dates: Oct 2025 – Present
The Game
Players take on the role of hospital administrators allocating limited resources (nurses to shifts, or operating room blocks to surgical specialties) under uncertain patient demand. The game proceeds through five steps:
- Decide — Allocate resources using slider controls with real-time cost tracking
- Compare to Optimal — See how your decision compares to three optimization solutions computed under expected, moderate, and worst-case demand assumptions
- Observe AI Debate — Watch three LLM agents (optimistic, pessimistic, mediator) debate the trade-offs across two rounds, culminating in a synthesized final recommendation
- Revise — Adjust your allocation informed by the optimization results and the debate
- Evaluate — Stress-test your initial and revised decisions against multiple demand scenarios
The game implements two healthcare scenarios: Emergency Department Nurse Scheduling (30-bed ED, three 8-hour shifts) and Operating Room Block Allocation (35 weekly blocks across three surgical specialties).
System Architecture
- Backend: Python/Flask server with a scipy-based optimization engine using the SLSQP (Sequential Least Squares Programming) algorithm
- Multi-Agent Debate: Three LLM agent personas (optimistic, pessimistic, mediator) via the Anthropic Claude API (claude-sonnet-4-20250514), with a structured two-round debate protocol plus mediator synthesis
- Guardrails: Allocation clamping, budget constraints, output format validation — inspired by research on autonomous multi-agent supply chain systems
- Orchestrator: Controls data flow between agents, enforcing what each agent sees at each debate round
- Frontend: React browser interface with a five-step progress flow, slider controls, live debate display, and stress-test evaluation grids
Results
Across three live debate sessions on the nurse scheduling scenario:
- The optimistic agent consistently recommended the cost-minimized solution (23 nurses, $11,500), while the pessimistic agent proposed 30–34 nurses with maximum afternoon staffing
- All three mediator syntheses fell within a narrow range: 27–29 total nurses at $13,500–$14,500
- Agents showed genuine position changes between rounds — the optimistic agent increased its afternoon allocation after seeing the others’ proposals, and the pessimistic agent revised downward after triggering the budget guardrail
- The structured debate consistently surfaced the core trade-off: a $4,000 gap (35% cost increase) between cost-minimizing and safety-maximizing solutions
Publication
Gamage, T. (2026). “A Multi-Agent LLM Game for Healthcare Decision Support: Combining Mathematical Optimization with AI-Driven Debate for Experiential Learning.” Capstone Project, University of Cincinnati.
Read the Paper
View Research Group