Category: Artificial Intelligence

Posted 2023-12-18Updated 2026-03-11Artificial Intelligence15 minutes read (About 2182 words)

Deploy Stable Diffusion to A100

导言

图片推理多采用各种GUI(ComfyUI, Stable Diffusion WebUI) [^2]
训练基于 kohya-trainer 和 GUI，带标签的二次元图片数据可以从 danbooru 爬取。
模型和方法实现，如LyCORIS框架？从civitai免费下载

Posted 2023-12-18Updated 2026-03-11Artificial Intelligence3 minutes read (About 467 words)

CV Model

导言

和AIGC 生图相关

Posted 2023-12-18Updated 2026-03-11Artificial Intelligencea few seconds read (About 47 words)

Inference Basic

导言

RL 涉及到推理，推理的流程细节不是很明晰。

warmup，计算kvcache
chunked prefill，降低prefill的显存

Posted 2023-12-18Updated 2026-03-11Artificial Intelligence9 minutes read (About 1409 words)

Inference Optimization

导言

训练由于要计算并更新梯度，一般是计算密集。但是推理一般是访存密集。

Posted 2023-12-18Updated 2026-03-11Artificial Intelligence27 minutes read (About 4004 words)

AI Training Optimization

导言

训练由于要计算并更新梯度，一般是计算密集。但是推理一般是访存密集。

Posted 2023-12-18Updated 2026-03-11Artificial Intelligence7 minutes read (About 1090 words)

[LLM]: DeekSeekV3

导言

本来在多模态组，结果被拉去优化TX的dspv3部署，还是要熟悉相关概念逻辑。

Posted 2023-12-18Updated 2026-03-11Artificial Intelligence22 minutes read (About 3340 words)

LLM Model

导言

Foudation Models(One4All): General pre-training model

LLM path ，generative-ai-for-beginners

排行榜:

Posted 2023-12-18Updated 2026-03-11Artificial Intelligence14 minutes read (About 2139 words)

LLM Model Basic

导言

LLM Prefill、decode、kvcache等概念

Posted 2023-12-18Updated 2026-03-11Artificial Intelligencean hour read (About 8696 words)

Classical AI Models

导言

机器学习和人工智能模型算法，从一开始模仿神经元设计，到现在根据任务定制或者基于naive的思想构建(例如对抗思想、感受野、注意力机制)。模型的设计可以说是日新月异，截然不同。但是从高性能计算的角度来看，还是离不开求导操作、矩阵操作、激活函数计算这几点。剩下值得考虑的就是寻找现有或者未来模型构成计算操作的最大公约数，来对其进行特殊软硬件设计加速。或者只是对现有模型的适配加速工作。

Posted 2023-12-17Updated 2026-03-11Artificial Intelligence4 minutes read (About 544 words)

Deploy OpenLLM to one A100

导言

Practice is the best teacher in learning.