Unsloth Studio — Fine-tuning Dataset Formats

Unsloth Studio supports several dataset formats depending on your fine-tuning goal. Files can be uploaded directly as JSONL, JSON, CSV, Parquet, PDF, or DOCX. Format Overview 1. Raw Text (Continued Pretraining) Used to inject domain knowledge without any structure. The model learns from continuous prose. T h e m i t o c h o n d r i a i s t h e p o w e r h o u s e o f t h e c e l l . A T P s y n t h e s i s o c c u r s v i a o x i d a t i v e p h o s p h o r y l a t i o n . . . Best for: books, articles, documentation dumps, codebases. ...

April 23, 2026 · 5 min