LMMs Engine Documentation
Welcome to the LMMs Engine documentation! LMMs Engine is a flexible and extensible framework for training large multimodal models with support for various model architectures, datasets, and training strategies.
Getting Started
User Guide
Developer Guide
Reference
Models
- BAGEL Model Training Guide
- Qwen-VL Model Training Guide
- dLLM (Diffusion Language Model) Training
- FLA Models (DGN) Training
- RAE-SigLip Training
- SiT (Scalable Interpolant Transformer) Training
- WanVideo Training
- Qwen2.5 LLM Training
- Qwen2.5-Omni Training
- Qwen3-VL MoE Training
- Qwen3-MoE Training
- Qwen3-Omni MoE Training
Troubleshooting