Skip to content

v3.0.2

Latest
Compare
Choose a tag to compare
@Jintao-Huang Jintao-Huang released this 07 Jan 15:28
· 3 commits to main since this release

中文版

新特性

  1. 支持使用swift app开启可视化推理创空间,参考这里
  2. 支持大模型的RM和PPO训练,参考这里
  3. 支持SequenceClassification模型(含BERT)的BNB/GPTQ量化,参考这里
  4. 支持reward model的推理、部署和BNB/GPTQ量化

新模型

  1. ZhipuAI/cogagent-9b-20241220
  2. Reward Models: Shanghai_AI_Laboratory/internlm2-1_8b-reward系列, Qwen/Qwen2-Math-RM-72B系列, AI-ModelScope/Skywork-Reward-Llama-3.1-8B系列, AI-ModelScope/GRM_Llama3.1_8B_rewardmodel-ft系列
  3. AIDC-AI/Ovis1.6-Gemma2-27B, AIDC-AI/Ovis1.6-Llama3.2-3B
  4. PowerInfer/SmallThinker-3B-Preview

新数据集

  1. PowerInfer/LONGCOT-Refine-500K, PowerInfer/QWQ-LONGCOT-500K

English Version

New Features

  1. Support for using swift app to launch a visual inference creative space, see here
  2. Support for RM and PPO training of large models, see here
  3. Support for BNB/GPTQ quantization of SequenceClassification models (including BERT), see here
  4. Support for inference, deployment, and BNB/GPTQ quantization of reward models

New Models

  1. ZhipuAI/cogagent-9b-20241220
  2. Reward Models: Shanghai_AI_Laboratory/internlm2-1_8b-reward series, Qwen/Qwen2-Math-RM-72B series, AI-ModelScope/Skywork-Reward-Llama-3.1-8B series, AI-ModelScope/GRM_Llama3.1_8B_rewardmodel-ft series
  3. AIDC-AI/Ovis1.6-Gemma2-27B, AIDC-AI/Ovis1.6-Llama3.2-3B
  4. PowerInfer/SmallThinker-3B-Preview

New Datasets

  1. PowerInfer/LONGCOT-Refine-500K, PowerInfer/QWQ-LONGCOT-500K

What's Changed

New Contributors

Full Changelog: v3.0.1...v3.0.2