•DeepSeek was founded in December 2023 by Liang Wenfeng, and released its first AI large language model the following year. Not much is known about Liang, who graduated from Zhejiang University with degrees in electronic information engineering and computer science.11 hours ago.
•DeepSeek-Coder and DeepSeek-Math were used to generate 20K code-related and 30K math-related instruction data, then combined with an instruction dataset of 300M tokens. This was used for SFT. RL with GRPO.
•DeepSeek is a Chinese AI company, which just a week ago launched its latest AI model, which it calls R1. The company said the model was particularly good at problem solving, performing on par with OpenAI’s o1 reasoning model—but at a fraction of the cost per use.
•He told Sky News: “What truly sets DeepSeek apart is its accessibility thanks to open-weight models. “Unlike centralised models, its open-source versions can be run locally, providing perfect data privacy. “Organisations are already deploying full models internally, ensuring complete control over sensitive information.