<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Training on Jeanphilo Blog</title><link>https://shio-chan-dev.github.io/jeanblog/zh/tags/training/</link><description>Recent content in Training on Jeanphilo Blog</description><generator>Hugo -- 0.159.2</generator><language>zh-cn</language><lastBuildDate>Sat, 24 Jan 2026 16:28:18 +0800</lastBuildDate><atom:link href="https://shio-chan-dev.github.io/jeanblog/zh/tags/training/index.xml" rel="self" type="application/rss+xml"/><item><title>动量（Momentum）优化的过程：从直觉到公式</title><link>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/momentum-optimizer-process/</link><pubDate>Sat, 24 Jan 2026 16:28:18 +0800</pubDate><guid>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/momentum-optimizer-process/</guid><description>解释动量优化的更新过程、直觉与工程取舍，并给出最小 PyTorch 示例。</description></item><item><title>优化器的了解：从 SGD 到 Adam 的工程取舍</title><link>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/optimizer-overview/</link><pubDate>Sat, 24 Jan 2026 16:27:20 +0800</pubDate><guid>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/optimizer-overview/</guid><description>系统讲清常见优化器原理与工程取舍，含最小 PyTorch 示例与实践建议。</description></item><item><title>BN 与 Dropout：训练与推理时的关键区别</title><link>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/bn-vs-dropout-train-infer/</link><pubDate>Sat, 24 Jan 2026 16:24:44 +0800</pubDate><guid>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/bn-vs-dropout-train-infer/</guid><description>系统对比 BatchNorm 与 Dropout 在训练/推理阶段的行为差异，并提供最小 PyTorch 示例。</description></item><item><title>Transformer 中可以用 BatchNorm 吗？</title><link>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/batchnorm-in-transformer/</link><pubDate>Sat, 24 Jan 2026 16:24:03 +0800</pubDate><guid>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/batchnorm-in-transformer/</guid><description>讨论 Transformer 使用 BatchNorm 的可行性、限制与工程取舍，并给出最小示例。</description></item><item><title>残差连接的作用：为什么深度网络离不开它</title><link>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/residual-connection-role/</link><pubDate>Sat, 24 Jan 2026 16:22:22 +0800</pubDate><guid>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/residual-connection-role/</guid><description>解释残差连接在深度网络中的作用与原理，并提供最小可运行示例。</description></item><item><title>SGD vs Adam：优化器原理与工程取舍</title><link>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/sgd-vs-adam-optimizer/</link><pubDate>Sat, 24 Jan 2026 16:12:12 +0800</pubDate><guid>https://shio-chan-dev.github.io/jeanblog/zh/ai/llm/sgd-vs-adam-optimizer/</guid><description>对比 SGD 与 Adam 的原理、收敛特性与应用场景，并提供最小 PyTorch 示例。</description></item><item><title>CLIP 系列（2/3）：PyTorch 完整可复现实战——从数据到训练闭环</title><link>https://shio-chan-dev.github.io/jeanblog/zh/ai/clip/2-clip-pytorch-reproducible-implementation/</link><pubDate>Sat, 24 Jan 2026 12:46:49 +0800</pubDate><guid>https://shio-chan-dev.github.io/jeanblog/zh/ai/clip/2-clip-pytorch-reproducible-implementation/</guid><description>用 CIFAR-10 + 文本提示搭建最小 CLIP 训练闭环，提供完整可复现的 PyTorch 实战脚本。</description></item></channel></rss>