SHAOJIE'S BOOK

Posted 2026-02-27Updated 2026-03-04Artificial Intelligence4 minutes read (About 659 words)

Business Trip: 2601-2602 verl + DanceGRPO

导言

ZJ内部出差，从0到1完成verl + MindSpeed MM + DanceGRPO算法的 t2v RL，达成reward快速持续上升。

Posted 2026-02-02Updated 2026-03-04Artificial Intelligence24 minutes read (About 3561 words)

导言

AI浪潮下，一开始是代码补全，之后是Vibe Coding，现在是Agent（规范驱动开发(Spec-driven Development)），后续趋势是Agent Team/Swarm。作为一个程序员，应当以什么姿势拥抱AI时代的代码编程，是需要持续关注的问题。

Posted 2026-02-02Updated 2026-03-04Artificial Intelligence12 minutes read (About 1806 words)

My Digital Worker : Target 1

导言

第一阶段的目标: 接入api模型，完成每日的工作相关基础的信息收集和整理归档。
第二阶段的目标: 无监管处理较简单事项；
第三阶段的目标: 参与构建复杂系统，和辅助重要决策。

Posted 2026-02-02Updated 2026-03-04Artificial Intelligence9 minutes read (About 1326 words)

My Digital Worker : AutoMoneyMaker - AutoTrader

导言

量化交易一直是最火的自动赚钱的途径：

经过调研，个人量化从技术上是可行的。
加上现在agent coding能力起来了。
原本是自己在写AQTP仓，但是发现了 zvt 这个偏个人的研究策略仓，和更关注实盘高频模拟的 vnpy
现在把精力转移到开源仓的使用
zvt 仓的使用和二次开发上；（思路、可视化、数据库、策略拓展性都感觉OK）
QUANTAXIS 通过 Rust 加速；
AI 向 Qbot 和 microsoft/qlib 方法尝试。Qbot还支持接入飞书。

Posted 2026-02-02Updated 2026-03-04Artificial Intelligence9 minutes read (About 1362 words)

My Digital Worker

导言

Agent 概念与 OpenClaw 的爆火，本质上反映了人们对个人数字员工（Digital Worker）能力的期待：它不只是一个对话式 AI，而是一个可以在真实工作流中长期运行、承担任务、放大个人生产力的“虚拟员工”。

我真正关心的问题是：如何为自己的具体工作场景配置合适的数字员工，使其在时间与认知两个维度上对个人效率形成倍增效应。

Posted 2025-11-26Updated 2026-03-04Programming31 minutes read (About 4720 words)

Pytorch 7 ：Memory Optimization(Freeing GPU/NPU Memory Early)

导言

对于不使用的python对象，如何释放？
python 的对象管理机制
del，empty_cache , gc_collect的原理

Posted 2025-11-26Updated 2026-03-04Programming3 minutes read (About 510 words)

Pytorch 8 ：Hyperparameter

导言

learning rate、clip_norm、梯度累计、micro bs 这些通用超参，应该如何调整。

Posted 2025-09-19Updated 2026-03-04Programming18 minutes read (About 2766 words)

Pytorch 2.5 ：Dataset & Dataloader

导言

数据集与数据加载器：学习如何使用torch.utils.data.Dataset和DataLoader来加载和处理数据。
数据预处理：介绍常用的数据预处理方法，如归一化、数据增强等。

Posted 2025-01-02Updated 2026-03-04Artificial Intelligence3 minutes read (About 491 words)

AI Model Visualization

导言

作为一个AI初学者，总是遇到以下场景：

客户正在基于NV开发一个AI模型，需要同步的做昇腾适配。手上只有NV下的代码。
往往很难将论文里的AI模型的图，和代码里的每一层以及参数对应起来。

设计期望：

在模型开发的过程中，能简单插入，来明确当前模块的大致信息。
1. 名称，类型(卷积层，池化层)，输入/输出/参数, 执行的时间(第一次)。
可视化
格式兼容cpprinter。
能体现出TP，CP等并行策略的效果。

大致思路：

还是借助chrome://tracing格式，来设计类似PyPrinter的工具。
早期可以使用VizTracer代替。

Posted 2023-12-17Updated 2026-03-04Artificial Intelligence4 minutes read (About 544 words)

Deploy OpenLLM to one A100

导言

Practice is the best teacher in learning.

Categories

Subscribe for updates

follow.it

Links

Recents

Archives

Tags