feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

RexBearIU · 2026-01-16T10:50:01Z

Description

This pull request significantly updates and modernizes the knowledge distillation tutorial for MaxText, aligning it with current best practices and tooling. The guide now uses Qwen3-32B as the teacher model (via vLLM) and Llama-3.1-8B as the student, streamlines the setup with Hyperdisk storage, and provides new scripts and commands for dataset generation and fine-tuning. The instructions have been clarified, unnecessary conversion steps removed for the teacher, and the fine-tuning process updated for the latest MaxText and vLLM workflows.

Tests

Manually triggered the distillation pipeline and monitored the execution flow step-by-step. Confirmed that the training loop finished and resources were released.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have run end-to-end tests tests and provided workload links above if applicable.
I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

docs/tutorials/posttraining/knowledge_distillation.md

codecov · 2026-01-21T08:18:32Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

tools/data_generation/generate_distillation_data_vllm.py

docs/tutorials/posttraining/knowledge_distillation.md

… model

RexBearIU force-pushed the jackyf/docs/distillation branch from 14091ae to 2fb1059 Compare January 19, 2026 14:24

RexBearIU marked this pull request as ready for review January 19, 2026 14:29

RexBearIU requested review from A9isha, RissyRan, bvandermoon, gagika, gobbleturk, jacoguzo, jiangjy1982, richjames0, shralex and vipannalla as code owners January 19, 2026 14:29

SurbhiJainUSC reviewed Jan 20, 2026

View reviewed changes

docs/tutorials/posttraining/knowledge_distillation.md Outdated Show resolved Hide resolved

SurbhiJainUSC reviewed Jan 20, 2026

View reviewed changes

docs/tutorials/posttraining/knowledge_distillation.md Outdated Show resolved Hide resolved

SurbhiJainUSC reviewed Jan 20, 2026

View reviewed changes

docs/tutorials/posttraining/knowledge_distillation.md Outdated Show resolved Hide resolved

RexBearIU force-pushed the jackyf/docs/distillation branch from 2fb1059 to 8005986 Compare January 21, 2026 08:05

RexBearIU requested review from NicoGrande, NuojCheng, aireenmei, hengtaoguo, jesselu-google, khatwanimohit and suexu1025 as code owners January 21, 2026 08:05

RexBearIU force-pushed the jackyf/docs/distillation branch 2 times, most recently from 84aa2ed to eb215d2 Compare January 21, 2026 09:28

SurbhiJainUSC reviewed Jan 22, 2026

View reviewed changes

tools/data_generation/generate_distillation_data_vllm.py Outdated Show resolved Hide resolved

SurbhiJainUSC reviewed Jan 22, 2026

View reviewed changes

tools/data_generation/generate_distillation_data_vllm.py Outdated Show resolved Hide resolved

SurbhiJainUSC reviewed Jan 22, 2026

View reviewed changes

docs/tutorials/posttraining/knowledge_distillation.md Outdated Show resolved Hide resolved

SurbhiJainUSC reviewed Jan 22, 2026

View reviewed changes

docs/tutorials/posttraining/knowledge_distillation.md Outdated Show resolved Hide resolved

RexBearIU force-pushed the jackyf/docs/distillation branch 4 times, most recently from d3bd2e7 to f813340 Compare January 23, 2026 09:25

SurbhiJainUSC approved these changes Jan 23, 2026

View reviewed changes

feat: update knowledge distillation tutorial for using vllm with Qwen…

4b59129

… model

RexBearIU force-pushed the jackyf/docs/distillation branch from f813340 to 4b59129 Compare January 26, 2026 07:49

xuefgu approved these changes Jan 27, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

RexBearIU commented Jan 16, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Jan 21, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

Are you sure you want to change the base?

feat: update knowledge distillation tutorial for using vllm with Qwen model #2960

Conversation

RexBearIU commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests

Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Jan 21, 2026

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

RexBearIU commented Jan 16, 2026 •

edited

Loading