Skip to main content

Model Adaptation List (Continuously Updated)

Note

If you have adaptation requirements for other software/models, please contact mailto:hpc@hkust-gz.edu.cn

Training

Full Model NameAdaptation StatusGuidance Documentation
qwen3-30b-a3bAdaptedTrain/fine-tune using MindSpeed-LLM framework, refer to guide: Reference Document, LlamaFactory training reference: Reference Document, training script: Reference Document
Qwen2.5-Coder-32B/14B/7B-Instruct; Qwen3-Coder-30B-A3B-InstructAdaptedFine-tuning using MindSpeed-LLM framework reference guide: Reference Document, Reference Document
Qwen3-8B-InstructAdaptedFull-parameter SFT fine-tuning training using MindSpeed-LLM framework: Reference Document, LlamaFactory training reference: Reference Document, training script: Reference Document
Qwen/Qwen3-VL-8B-InstructAdaptedFine-tuning using MindSpeed-MM framework: Reference Document, LlamaFactory training reference: Reference Document, training script: Reference Document
Qwen/Qwen3-8BAdaptedFull-parameter SFT fine-tuning training using MindSpeed-LLM framework: Reference Document, LlamaFactory training reference: Reference Document, training script: Reference Document
Qwen/Qwen3-14BAdaptedFull-parameter SFT fine-tuning training using MindSpeed-LLM framework: Reference Document, LlamaFactory training reference: Reference Document, training script: Reference Document
Qwen/Qwen3-235B-A22B-Thinking-2507AdaptedMindSpeed LLM installation guide: Reference Document, Large model instruction fine-tuning: Reference Document, fine-tuning script: Reference Document
Qwen3-8BAdaptedFull-parameter SFT fine-tuning training using MindSpeed-LLM framework: Reference Document
Qwen3-8BAdaptedReference Document
Qwen3-8BAdaptedReference Document
Qwen3-8BAdaptedReference links: Reference Document, Reference Document
Qwen3-8BAdaptedFull-parameter SFT fine-tuning training using MindSpeed-LLM framework: Reference Document, LlamaFactory training reference: Reference Document, training script: Reference Document
Wan 2.2AdaptedFine-tuning using MindSpeed-MM framework: Reference Document, MindSpeed-MM fine-tuning practice for Wan2.2-T2V-A14B model: Reference Document
Qwen2.5-72B-InstructAdaptedMindSpeed LLM installation guide: Reference Document, Pre-training: Reference Document, LoRA fine-tuning: Reference Document, Model LoRA fine-tuning script: Reference Document, Pre-training: Reference Document
Qwen3-8bAdaptedMindSpeed-LLM preset dense large models: Reference Document, Installation guide: Reference Document, Fine-tuning guide: Reference Document
Qwen3-32bAdaptedMindSpeed-LLM preset dense large models: Reference Document, Installation guide: Reference Document, Fine-tuning guide: Reference Document
Qwen3-VL-32bAdaptedFine-tuning using MindSpeed-MM framework: Reference Document, LlamaFactory training reference: Reference Document, training script: Reference Document
Qwen3(VL)-4B/8BAdaptedMindSpeed-MM reinforcement learning: Reference Document
Qwen2-7B / LLaMA2-7BAdaptedMindSpeed-LLM preset dense large models: Reference Document, Installation guide: Reference Document, LoRA fine-tuning: Reference Document, LlaMA2-7B fine-tuning script: Reference Document
Qwen-3 series, such as 14B, 32B; LLaMA-3.2 8B/14BMindSpeed-LLM official website supports LLaMA3.2-1B/3B; VeRL does not currently support LLaMA-3.2 8B/14B and qwen3-14bDAPO operation instructions: Reference Document, Installation guide: Reference Document, Qwen3-32B model mindspeed-rl reinforcement learning script: Reference Document
LLaMA3-8B-Instruct / LLaMA3.1-8B-InstructAdaptedMindSpeed-LLM preset dense large models: Reference Document, LoRA fine-tuning: Reference Document, Installation guide: Reference Document, LlamaFactory training reference: Reference Document, LlamaFactory framework training script: Reference Document
DeepSeek-R1-Distill-Llama-70B or Llama-3-70BAdaptedMindSpeed LLM installation guide: Reference Document, Distributed pre-training: Reference Document, llama3-70B pre-training script: Reference Document
Deepseek V3.2AdaptedMindSpeed LLM installation guide: Reference Document, Fine-tuning script: Reference Document, Model fine-tuning script: Reference Document
openai/gpt-oss-120bgpt-oss-20b supported, 120B not yetInstallation guide: Reference Document, Operation instructions: Reference Document, gpt-oss-20b model fine-tuning script: Reference Document

Inference

Note

Some reference documents are deployment guidance documents for the same framework and can be used as references.

Full Model NameInference EngineAdaptation StatusReference Document
Qwen3-VL-30B-A3B-InstructvLLMAdaptedReference Document, Reference Document
qwen3-30b-a3bvLLMAdaptedReference Document
Qwen3-VL 235B-A22BvLLMAdaptedReference Document
Qwen3-VL-32B-ThinkingvLLMAdaptedReference Document
Qwen2.5-Coder-32B/14B/7B-Instruct; Qwen3-Coder-30B-A3B-InstructvLLM, sglangAdaptedReference Document, Reference Document
Qwen3-8BMindIE / vLLMAdaptedReference Document, Reference Document
Qwen2-7BvLLM / MindIEAdaptedReference Document, Reference Document
Qwen/Qwen3-8BvLLMAdaptedReference Document
Qwen/Qwen3-14BvLLMAdaptedReference Document
Qwen/Qwen2.5-7B-InstructvLLMAdaptedReference Document
Qwen/Qwen2.5-VL-7B-InstructvLLMAdaptedReference Document
Qwen/Qwen2.5-14B-InstructvLLMAdaptedReference Document
Qwen/Qwen3-VL-Embedding-2BvLLMAdaptedReference Document
Qwen/Qwen3-VL-Embedding-8BvLLMAdaptedReference Document
Qwen/Qwen3-Embedding-8BvLLMAdaptedReference Document
Qwen/Qwen3-Embedding-4BvLLMAdaptedReference Document
Qwen/Qwen3-Embedding-0.6BvLLMAdaptedReference Document
Qwen/Qwen3-235B-A22B-Thinking-2507liteLLM, vLLMAdaptedReference Document, Reference Document, Reference Document
Qwen3-8BvLLMAdaptedReference Document
Qwen3-8BsglangAdaptedReference Document, Reference Document
Qwen3-8BvLLMAdaptedReference Document
Qwen3-32BvLLMAdaptedReference Document
Qwen3-235BvLLMAdaptedReference Document
Qwen3-VL-235BvLLMAdaptedReference Document
Qwen3-OminivLLMAdaptedReference Document
Qwen3(VL)-4B/8B/32BvLLMAdaptedReference Document
Qwen3-235B-A22B/Qwen3-235B-A22B-W8A8vllm/omni_inferAdaptedReference Document
Wan 2.2AdaptedReference Document
DeepSeek-R1-Distill-70B (Int8/W8A8 Quantized Version)vLLM or MindIEAdaptedReference Document
Deepseek V3.2liteLLM, vLLMAdaptedReference Document, Reference Document, Reference Document
DeepSeek-V3vLLMAdaptedReference Document
DeepseekocrvLLMAdaptedReference Document
Kimi-K2-ThinkingliteLLM, vLLMAdaptedReference Document, Reference Document, Reference Document
Kimi-AudiopytorchAdaptedNone available
LLaMA3-8B-InstructMindIEAdaptedReference Document
openai/gpt-oss-120bliteLLM, vLLMAdaptedReference Document, Reference Document, Reference Document
Whisper-Large-V3pytorchAdaptedReference Document
BAAI/bge-base-en-v1.5vLLMTEI adapted, vllm not yet adaptedReference Document
BAAI/bge-large-en-v1.5vLLMTEI adapted, vllm not yet adaptedReference Document
LLaDA2.0-flashSgLangSubmitted to model adaptation team, migration adaptation in progress; Successfully served and called based on SGLang -- version 1.20Reference Document
HunyuanVideo-1.5AdaptedReference Document
speaker-diarization-3.1pytorchAdaptedReference Document