Member-only story

A Beginner’s Guide to LLM Fine-Tuning

How to fine-tune Llama and other LLMs with one tool

8 min readAug 30, 2023

The growing interest in Large Language Models (LLMs) has led to a surge in tools and wrappers designed to streamline their training process.

Popular options include FastChat from LMSYS (used to train Vicuna) and Hugging Face’s transformers/trl libraries (used in my previous article). In addition, each big LLM project, like WizardLM, tends to have its own training script, inspired by the original Alpaca implementation.

In this article, we will use Axolotl, a tool created by the OpenAccess AI Collective. We will use it to fine-tune a Code Llama 7b model on an evol-instruct dataset comprised of 1,000 samples of Python code.

🤔 Why Axolotl?

The main appeal of Axolotl is that it provides a one-stop solution, which includes numerous features, model architectures, and an active community. Here’s a quick list of my favorite things about it:

Configuration: All parameters used to train an LLM are neatly stored in a yaml config file. This makes it convenient for sharing and reproducing models. You can see an example for Llama 2 here.
Dataset Flexibility: Axolotl allows the specification of multiple datasets with varied prompt formats such as…

TDS Archive

A Beginner’s Guide to LLM Fine-Tuning

How to fine-tune Llama and other LLMs with one tool

🤔 Why Axolotl?

Published in TDS Archive

Written by Maxime Labonne