arxiv:2605.16679
Zeyu Tang
zeyutang
AI & ML interests
Trustworthy AI
Recent Activity
authored a paper about 1 month ago
Fantastic Bugs and Where to Find Them in AI Benchmarks authored a paper about 1 month ago
CHI-Bench: Can AI Agents Automate End-to-End, Long-Horizon, Policy-Rich Healthcare Workflows? liked a dataset about 1 month ago
actava/chi-bench