LMGame-GamingAgent
LLM/VLM gaming agents and model evaluation through games.
LLM/VLM gaming agents and model evaluation through games.
RL Train LLM/VLM during Multi-Turn Environments
Side‑by‑side leaderboards for Model (no harness) and Agent (harness‑enabled) performance.
Official hub for LMGame resources, docs, and blog updates.
Published in arXiv (submitted to NeurIPS ’25), 2025
Introduces lmgame‑Bench, a unified Gym‑style benchmark that tests LLM agents across platformer, puzzle, and narrative games—addressing vision brittleness, prompt variance, and data contamination.
Published in ICML MAS Workshop, 2025
Introduces a perception–memory–reasoning harness that consistently boosts LLM/VLM gameplay across classic and modern game suites, uncovering module‑specific performance patterns.
Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.