Dongjun Kim - AI Research Engineer

Hi, I'm Dongjun Kim

I turn deep research insights into real-world AI systems.

I am an AI Research Engineer at Upstage, building LLM evaluation systems and infrastructure. Previously, I was in the NLP&AI Lab at Korea University, advised by Dr. Heuiseok Lim.

Independent AI Foundation Model Project (WBL)

Evaluation Data

Collaboration with NC AI, ETRI (NC AI Consortium)

Implemented an evaluation framework (50+ benchmarks) covering reasoning, safety, and robustness with reproducible pipelines and CI triggers.
Designed a unified metric layer and Weights & Biases dashboards for time series tracking, regression checks, and cross-model slice analysis.
Conducted pre- and post-training analyses on successive checkpoints, identified failure modes, and provided recommendations used for data and recipe updates.
Introduced contamination checks (deduplication and overlap scans) and standardized runbooks to support fair, comparable evaluations across systems.

KoLEG: Korean Legal Knowledge Editing

Training Data

Korea University NLP&AI Lab · KT Corporation

Co-developed an on-the-fly legal knowledge editing framework with continuous retrieval and timestamp-aware sequential updates (team of 8).
Led crawling and construction of the Korean Legislative Amendment dataset, aligning implementation periods and applying high-precision filtering.
Contributed mechanistic interpretability analyses on locality and generalization in edited knowledge.
Built expert evaluation demos and protocols for human assessment, enabling attorney review and iterative error analysis.

KULLM 3 · KULLM R · Ko-Gemma Training

Training

NLP&AI Lab, Korea University

Contributed on post-training for a team of 10, including instruction tuning, training framework, and multilingual and code-switch datasets.
Curated code and math corpora, established quality gates, and ran capability evaluations for coding and math.
KULLM Reasoning: implemented reinforcement learning with custom reward functions and GRPO and applied adaptive response length.
Observed improved Korean reasoning and math accuracy versus a Qwen3 reasoning baseline while reducing unnecessary verbosity in internal evaluations.

Self-Improving Leaderboard

Agents Evaluation

Auto-Generated Benchmarks From Real-Time Data

Implemented daily crawlers across multiple news categories, real-time QA generation, and automated multi-LLM evaluation on daily refreshed data.
Launched a live leaderboard with time-aware ranking and quarterly stability/volatility metrics to track consistency over time.
Maintained scheduling, monitoring, and data hygiene to support regular refreshes and clear longitudinal comparisons; operated on Hugging Face Spaces.

Hi, I'm Dongjun Kim

Top Achievements

Education

고려대학교 — 컴퓨터학 석사

University of South Florida — Computer Science 학사

Publications

Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks

KoLEG: On-the-Fly Korean Legal Knowledge Editing with Continuous Retrieval

MMA-ASIA: A Multilingual and Multimodal Alignment Framework for Culturally-Grounded Evaluation

Exploring Coding Spot: Understanding Parametric Contributions to LLM Coding Performance

From Snapshot to Stram: A Self-Improving Leaderboard for Robust and Evolving Natural Language Processing (NLP) Evaluation

Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval

KITE: A Benchmark for Evaluating Korean Instruction-Following Abilities in Large Language Models

Exploring Inherent Biases in LLMs within Korean Social Context: A Comparative Analysis of ChatGPT and GPT-4

CitySEIRCast: an agent-based city digital twin for pandemic analysis and simulation

Projects

Independent AI Foundation Model Project (WBL)

KoLEG: Korean Legal Knowledge Editing

KULLM 3 · KULLM R · Ko-Gemma Training

Self-Improving Leaderboard

Experience

AI Research Engineer 2026.03 – Present

RLHF Data Trainer 2023.03 – 2024.01

AR/VR Software Engineering Intern 2023.05 – 2023.11

High Performance Computing Intern 2022.08 – 2023.05

Mixed Reality Research Assistant 2022.01 – 2023.05

Contact