2025-11-25
Train Stages: Pretrain, Mid-Train(CT), SFT, RL
Artificial Intelligence
RL Algorithms: PPO-RLHF & GRPO-family
2025-11-19
Bridging the Gap: Challenges and Trends in Multimodal RL.
2025-10-11
Way 2 Wealth Freedom
OOW
2025-09-19
Pytorch 2.5 ๏ผDataset & Dataloader
Programming
2025-09-15
Why Choose Quantitative Finance
2025-05-25
Blind Date 1st(2)
2025-05-11
Blind Date 1st
2025-05-10
Blind Date Tips
2025-04-17
Ideas around Vision-Language Models (VLMs) / Reasoning Models
Shaojie Tan
๐๐ฐ๐ฎ๐ฑ๐ถ๐ต๐ฆ๐ณ ๐๐ณ๐ค๐ฉ๐ช๐ต๐ฆ๐ค๐ต๐ถ๐ณ๐ฆ & ๐๐๐
Anhui, Hefei, China
Posts
475
Categories
36
Tags
546
2025-12-15
QCC๏ผQuality Control Circle
Overview
2025-12-11
SGLang
AI
2025-12-10
DiffSynth & ms-swift
VeOmni
2025-12-09
Pip Cache
Tutorials