Publications
*Participant name in bold works at KRAFTON
Filter
AgentVidBench: A Multi-Hop Video Question Answering Benchmark for Evaluating MLLM Agents
AgentVidBench: A Multi-Hop Video Question Answering Benchmark for Evaluating MLLM Agents
AsyncOPD: How Stale Can On-Policy Distillation Be?
AsyncOPD: How Stale Can On-Policy Distillation Be?
Online Agent-as-a-Judge: Situation-Generating Evaluation for Interactive Agents
Online Agent-as-a-Judge: Situation-Generating Evaluation for Interactive Agents
Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling
Mitigating Perceptual Judgment Bias in Multimodal LLM-as-a-Judge via Perceptual Perturbation and Reward Modeling
Meta-Harness: End-to-End Optimization of Model Harnesses
Meta-Harness: End-to-End Optimization of Model Harnesses
Uniform Spectral Growth under Factor-wise Muon Orthogonalization in Matrix Factorization and LoRA
Uniform Spectral Growth under Factor-wise Muon Orthogonalization in Matrix Factorization and LoRA
Pruning and Distilling Mixture-of-Experts into Dense Language Models
Pruning and Distilling Mixture-of-Experts into Dense Language Models
AMUSE: Anytime Muon with Stable Gradient Evaluation
AMUSE: Anytime Muon with Stable Gradient Evaluation
RoDAC: A Robust Data-centric Anti-Cheat Framework for Fair Online Competitive Gaming
RoDAC: A Robust Data-centric Anti-Cheat Framework for Fair Online Competitive Gaming
Identifiable Token Correspondence for World Models