Unsloth Studio — Fine-tuning Dataset Formats
Unsloth Studio supports several dataset formats depending on your fine-tuning goal. Files can be uploaded directly as JSONL, JSON, CSV, Parquet, PDF, or DOCX. Format Overview 1. Raw Text (Continued Pretraining) Used to inject domain knowledge without any structure. The model learns from continuous prose. T h e m i t o c h o n d r i a i s t h e p o w e r h o u s e o f t h e c e l l . A T P s y n t h e s i s o c c u r s v i a o x i d a t i v e p h o s p h o r y l a t i o n . . . Best for: books, articles, documentation dumps, codebases. ...