Chinese artificial intelligence startup DeepSeek and Tsinghua University have jointly developed a new model that can reportedly improve the reasoning ability of large language models, reduce the amount of training to reduce operating costs, and may be applied to the upcoming next-generation large model R2.
Combined with reports from Bloomberg and the South China Morning Post, DeepSeek and researchers at Tsinghua University have collaborated to detail in a paper published on Friday (April 4) a new reinforcement learning method that can improve the efficiency of artificial intelligence models.
They developed an approach that combines "Generalist Reward Modeling" (GRM) and "Self-Principled Critique Tuning," Make large language models answer general query questions better and faster.
In the paper, the researchers say the new approach helps AI models better follow human preferences and outperform existing methods and models on various benchmarks by rewarding more accurate and understandable responses. The results show that you can get better performance using fewer computing resources.
The paper calls the new model "DeepSeek-GRM" and it will be released as open source, but it doesn't give a specific timeline. The Massachusetts Institute of Technology Technology Review reports that the new training method may be applied to DeepSeek's next-generation large model R2.
DeepSeek's R1, a low-cost large model launched in January this year, attracted global attention. In February, Reuters quoted people familiar with the matter as saying that the company, eager to capitalize on its rising popularity, may bring forward the R2 model, which was originally scheduled to be released in May.
Recently, the statements made by Wale Edun, Nigeria's Minister of Finance and Minister of Economic Coordination, and Olayemi Cardoso, the governor of the Central Bank, during the Spring Meetings of the International Monetary Fund and the World Bank in 2025, attempted to demonstrate the achievements of Nigeria's economic reforms and its future resilience.
Recently, the statements made by Wale Edun, Nigeria's Minis…
It has been nearly a hundred days since Trump returned to t…
On Sunday, a suspect was charged with murder for driving in…
Soybeans, corn and other agricultural products are widely g…
Recently, the three major U.S. stock index futures have col…
Recently, the White House announced that Trump will hold a …