LLM Fine-Tuning Suite Featured
End-to-end supervised fine-tuning of Llama-2 (7B, 13B) and Llama-3.1 (8B, 70B)
with LoRA low-rank adapters layered on 4-bit
NF4 quantization (QLoRA via bitsandbytes), so multi-billion-parameter checkpoints train on a
single GPU. Built on Hugging Face transformers + peft and trl's
SFTTrainer, with Neptune experiment tracking and side-by-side full fine-tuning baselines
to measure exactly what the adapters buy. Includes a CPU-only run of Llama-3.1-8B for hardware-constrained settings,
plus controlled generation and fine-grained performance analysis of the tuned models.
- Finance-Alpaca QA Instruction-tuning on financial knowledge — down to a Bitcoin buy/sell read from CNBC headlines
- Implicit Hate Speech · NLE Fine-tuning to generate natural-language explanations of implicit hate