Goku Submitted Paper on AI Training Breakthrough

About Us

Investment approach

Impact & Insights

Careers

Contact

Home Impact & Insights

News

Goku Submitted Paper on AI Training Breakthrough

05/23/2025

Goku Technologies recently published on Step-wise Adaptive Integration of Supervised fine-tuning and Reinforcement Learning (SASR), which is currently available on arXiv and will be submitted to NeurIPS. SASR is an evolutionary training method for LLMs. Original LLMs utilized supervised learning (SFT), while DeepSeek divided SFT and reinforcement learning (RL) into separate stages. SASR is a new framework that uses an adaptive algorithm to dynamically adjust the training weights of SFT and RL throughout the continuous training process, which further enhances the efficiency of LLMs.

AI has been integral to Goku’s investment process since 2018. While this paper is not directly focused on quant, our R&D deepens our understanding on how we can better utilize tools such as LLMs to further improve risk-adjusted returns for our investors.

You can find the paper in the link below.

[2505.13026] Step-wise Adaptive Integration of Supervised Fine-tuning and Reinforcement Learning for Task-Specific LLMs

About Us

Platform History people

Investment approach

Strategy Introduction Risk management

Impact & Insights

GokuTech's Monthly Report Goku Insights News Interpretation of viewpoints Latest from GokuTech

Careers

Start your career life at Goku join us

Contact

offices Reason for contact

Email：IR@gokudata.com

Terms and Conditions Privacy Policy

Hi! Cookies

By clicking "Accept, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts.

Reject All