Zhang Chi says ByteDance trains a large model in about 6 months vs Google's 3, a pace that hinders China's AI progress. Seed aims high but lags in practice; after DeepSeek, it shifted toward reinforcement learning.Zhang Chi, a former ByteDance Seed engineer and now a PKU assistant professor, discusses in Into Asia that ByteDance’s full large-model cycle takes about six months, while Google is rumored to need around three, making iteration speed a key bottleneck for Chinese AI development. He describes Seed’s culture of bench-based optimization and notes that, despite apparent parity on paper, Chinese models lag in real-world use. Seed aims to be globally top but has not caught up; after the DeepSeek era, the team pivoted toward reinforcement learning to close the gap.

AirdropBlackHole

2026-04-24 09:31:17

Abstract generation in progress

According to monitoring by Dongcha Beating, Zhang Chi, a former engineer from ByteDance’s Seed team and now an assistant professor at Peking University, revealed in the podcast “Into Asia” that it takes about six months for ByteDance to complete one round of large model training (pre-training plus post-training), while Google is rumored to only need three months. He believes that the speed of iteration is one of the core reasons why Chinese companies find it difficult to catch up. Zhang worked at ByteDance for about a year, and his math team’s focus was more research-oriented. He described the team’s positioning as ‘more for publicity,’ which differs from the teams responsible for model delivery in pre-training and post-training. Zhang described the internal ‘benchmaxxing’ culture at Seed: team leaders evaluate performance based on the benchmarks they are responsible for, and everyone is focused on scoring, ‘but this does not translate into a good experience in actual use.’ He stated that on paper, the models of large Chinese companies can match the leading models in the U.S., but in practice, they are ‘not good enough.’ The goal of Seed is to be globally top-notch, ‘but unfortunately, I do not think we have caught up,’ and even the goal of being number one domestically ‘has not been achieved.’ By the end of 2024, Seed believed it had caught up with GPT-4o, but after the release of DeepSeek, the team realized that the gap still existed, and when he joined, the entire group was urgently shifting towards reinforcement learning.

This page may contain third-party content, which is provided for information purposes only (not representations/warranties) and should not be considered as an endorsement of its views by Gate, nor as financial or professional advice. See Disclaimer for details.

Reward
like
Comment
Repost
Share

Comment

Add a comment

No comments

Trending Topics
View More
#
WCTCTradingKingPK
153.53K Popularity
#
CryptoMarketSeesVolatility
218.7K Popularity
#
rsETHAttackUpdate
66.46K Popularity
#
US-IranTalksStall
172.76K Popularity
#
ETHMemeCoinFLORKSurges
35.65K Popularity

Sitemap

Former ByteDance Seed Engineer: ByteDance Takes Six Months for One Iteration, Google Allegedly Only Three Months

Trending Topics

WCTCTradingKingPK

CryptoMarketSeesVolatility

rsETHAttackUpdate

US-IranTalksStall

ETHMemeCoinFLORKSurges

Pin