Projects

# Agent Type IC EC RC
1HumanBaseline0.900.660.94
2Human SimulacraRAG0.790.630.87
3Li et al. (2025)Prompting0.730.590.98
4DeepPersonaPrompting0.720.540.92
5Character.aiCommercial0.710.710.46
6Twin 2K 500Prompting0.530.260.95
7Consistent LLMFine-tuned0.310.300.14
8OpenCharacterFine-tuned0.160.150.14
PICon

A multi-turn interrogation framework for evaluating persona agent consistency. Applies interrogation methodology to systematically probe LLM-based persona agents through logically chained questions, exposing contradictions in internal, external, and retest consistency.

CXReasonAgent

An evidence-grounded diagnostic reasoning agent for chest X-rays. Integrates an LLM with clinically grounded diagnostic tools to produce responses based on explicit image-derived evidence such as quantitative measurements, spatial observations, and visual overlays.