Qwen2-Math Open Source! Initial Exploration of Mathematical Synthetic Data Generation!
The Alibaba Tongyi team has open-sourced the next-generation mathematical model Qwen2-Math, which includes three parameter versions. The base model is pre-trained on a mathematical corpus, and the instruction-tuned version optimizes performance through a reward mechanism and rejection sampling. Qwen2-Math surpasses mainstream models in mathematical evaluations, becoming the most advanced specialized mathematical model. The article discusses the advantages of synthetic data generation, such as addressing privacy concerns and enhancing data structurization, and provides model download and inference guidelines, showcasing methods for generating mathematical data in the educational field. This model offers high-quality data support for mathematical modeling, educational technology, and model fine-tuning, promoting the development of mathematical modeling.