Dongjun Kim - AI Researcher

Hi, I'm Dongjun Kim

AI Safety · Model Evaluation · Mechanistic Interpretability

I build transparent, reliable AI through LLM Evaluation and Mechanistic Interpretability. I am a master's student at Korea University advised by Dr. Heuiseok Lim.

Independent AI Foundation Model Project (WBL)

Evaluation Data

Collaboration with NC AI, ETRI (NC AI Consortium)

Implemented an evaluation framework (50+ benchmarks) covering reasoning, safety, and robustness with reproducible pipelines and CI triggers.
Designed a unified metric layer and Weights & Biases dashboards for time series tracking, regression checks, and cross-model slice analysis.
Conducted pre- and post-training analyses on successive checkpoints, identified failure modes, and provided recommendations used for data and recipe updates.
Introduced contamination checks (deduplication and overlap scans) and standardized runbooks to support fair, comparable evaluations across systems.

KoLEG: Korean Legal Knowledge Editing

Training Interpretability Data

Collaboration with KT Corporation

Co-developed an on-the-fly legal knowledge editing framework with continuous retrieval and timestamp-aware sequential updates (team of 8).
Led crawling and construction of the Korean Legislative Amendment dataset, aligning implementation periods and applying high-precision filtering.
Contributed mechanistic interpretability analyses on locality and generalization in edited knowledge.
Built expert evaluation demos and protocols for human assessment, enabling attorney review and iterative error analysis.

KULLM 3 & KULLM R & Ko Gemma Model Training

Training Data

NLP&AI Lab, Korea University

Contributed on post-training for a team of 10, including instruction tuning, training framework, and multilingual and code-switch datasets.
Curated code and math corpora, established quality gates, and ran capability evaluations for coding and math.
KULLM Reasoning: implemented reinforcement learning with custom reward functions and GRPO and applied adaptive response length (concise on easier tasks, deeper chains on harder ones).
Observed improved Korean reasoning and math accuracy versus a Qwen3 reasoning baseline while reducing unnecessary verbosity in internal evaluations.

Self Improving Leaderboard

Evaluation Data

Automated Benchmark Generation from Real-Time Data

Implemented daily crawlers across news categories, real-time QA generation, and automated multi-LLM evaluation on daily refreshed data.
Launched a live leaderboard with time-aware ranking and quarterly stability/volatility metrics to track consistency over time.
Maintained scheduling, monitoring, and data hygiene to support regular refreshes and clear longitudinal comparisons.

LLM Lesion Brain Mapping

Interpretability

NLP&AI Lab, Korea University

Studied how targeted parameter perturbations relate to language deficits in LLMs across syntax, semantics, and retrieval behaviors.
Explored correspondences between internal subspaces and neuro-linguistic functionality with layer- and module-level analysis.
Prototyped a lesion-based causal tracing method linking internal changes to observable behaviors with reproducible analysis tooling.

Hi, I'm Dongjun Kim

About

Publications

Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks

KoLEG: On-the-Fly Korean Legal Knowledge Editing with Continuous Retrieval

MMA-ASIA: A Multilingual and Multimodal Alignment Framework for Culturally-Grounded Evaluation

Exploring Coding Spot: Understanding Parametric Contributions to LLM Coding Performance

From Snapshot to Stram: A Self-Improving Leaderboard for Robust and Evolving Natural Language Processing (NLP) Evaluation

Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval

KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models

Exploring Inherent Biases in LLMs within Korean Social Context: A Comparative Analysis of ChatGPT and GPT-4

CitySEIRCast: an agent-based city digital twin for pandemic analysis and simulation

Projects

Independent AI Foundation Model Project (WBL)

KoLEG: Korean Legal Knowledge Editing

KULLM 3 & KULLM R & Ko Gemma Model Training

Self Improving Leaderboard

LLM Lesion Brain Mapping

Education

Korea University, Seoul, South Korea

Master of Science in Computer Science — Expected Feb 2026

University of South Florida, Tampa, FL

Bachelor of Science in Computer Science — May 2023

Contact