Shanghai Jieyue Xingchen Intelligent Technology Co., Ltd, known as StepFun, is an artificial intelligence (AI) company based in Shanghai, China. It has been dubbed one of China's "AI Tiger" companies by investors.
On 25 February 2026, it was reported that StepFun was seeking an initial public offering on the Hong Kong Stock Exchange.[4]
StepFun focuses on multimodal models which are designed to understand multiple types of input data such as text, video and audio.[5]
Products
In July 2024 at the World Artificial Intelligence Conference, StepFun officially launched Step-2, a trillion-parameter LLM, along with the Step-1.5V multimodal model and the Step-1X image generation model.[6]
In February 2025, StepFun and Geely jointly announced the open-sourcing of two multimodal large models to global developers. They were Step-Video-T2V and Step-Audio.[7][8]
In July 2025, StepFun released Step 3.[9] The Model-Chip Ecosystem Innovation Alliance aimed to optimize Step 3 for domestic chips.[10]
In April 2025, Step-R1-V-Mini was released. It is a multimodal reasoning model designed for visual interpretation and image understanding.[5]
In February 2026, Step-3.5-Flash, a mixture-of-experts model with 196 billion parameters and 11 billion active parameters was released under the free and open-source Apache 2.0 license. It supports tool use and a 256k token context window.[11][12]