Add multi-model support for Qwen 2.5 and Qwen3 Coder #9
+1,023
−49
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
Extends PR #8 to support multiple Qwen Coder models with model-specific configurations.
Changes
Model Configuration Registry (
src/training/models.py)SamplingConfigandModelConfigdataclassesAPI Enhancements (
src/training/serve.py)modelparameter/modelsendpoint to list available configurationskeep_alivesettings (300s for MoE models)CLI Improvements (
scripts/run_ollama.py)--model-keyfor registry-based model selection--list-modelsto display available modelsCross-Platform Support
.gitattributesfor consistent line endingsAvailable Models
qwen2.5-coder-32bqwen2.5-coder-14bqwen2.5-coder-7bqwen3-coder-30bTest plan
🤖 Generated with Claude Code