LoRA / QLoRA fine-tuning
adapter training with low VRAM
// official site: github.com ↗
Axolotl is a config-driven LLM fine-tuning toolkit by OpenAccess AI Collective — supports LoRA/QLoRA/full-parameter training, multi-GPU via DeepSpeed/FSDP, and broad model coverage (Llama, Mistral, Qwen, Gemma, ChatGLM, Phi, etc.). The industry standard for production LLM fine-tuning.
Axolotl is a config-driven LLM fine-tuning toolkit by OpenAccess AI Collective — supports LoRA/QLoRA/full-parameter training, multi-GPU via DeepSpeed/FSDP, and broad model coverage (Llama, Mistral, Qwen, Gemma, ChatGLM, Phi, etc.). The industry standard for production LLM fine-tuning.
When AI startups fine-tune their own LLMs, Axolotl is the most common choice.
Concrete scenarios where teams pick Axolotl over the SaaS alternative.
adapter training with low VRAM
supervised fine-tuning end-to-end
preference alignment / RLHF alternatives
extend base model on new domain
DeepSpeed ZeRO 1/2/3, FSDP
alpaca, sharegpt, jsonl formats supported
If your team profile matches one of these, Axolotl is a strong fit out of the box.
fine-tuning custom LLMs for their domain
publishing fine-tuned model variants
training internal-data-aware models
keeping training data on-prem
offering managed fine-tuning to clients
When evaluating self-hosted options for this category, here are the dimensions on which Axolotl consistently lands above the alternatives.
The stack you'll plug Axolotl into — services, protocols, and adjacent apps in the BluixApps catalog.
/root/bluixapps/axolotl.txtbluixapps_ensure_nvidia_runtimeOperational guidance from running this in production — what to lock down, what surprises people.