Moneycontrol

S1 AI model explained: What is it and all you need to know about this ‘$50’ language model

Researchers from Stanford University and the University of Washington have developed an AI reasoning model called s1, which was trained for under $50 using a process called distillation.

February 10, 2025 / 18:12 IST
Story continues below Advertisement
artificial intelligence

Researchers from Stanford University and the University of Washington have developed an AI reasoning model called s1, which was trained for under $50 using a process called distillation. The model is based on Qwen2.5, an open-source language model from Alibaba Cloud, and was refined using outputs from Google’s Gemini 2.0 Flash Thinking Experimental model.

S1 AI model development details

Story continues below Advertisement

The research team trained S1 on a small dataset of 1,000 questions instead of a larger dataset of 59,000 questions, determining that a smaller dataset was sufficient for strong reasoning performance. The model was refined using distillation, a technique that extracts reasoning patterns from larger AI models.

To train S1, researchers used 16 Nvidia H100 GPUs, a significantly lower compute cost than many large-scale AI models. The training process included supervised fine-tuning (SFT), where the model was explicitly taught to mimic patterns in the dataset.