🌰Senjl's Digital Garden🐉

Recent writing

Welcome to Senj's Digital Garden
2025年5月22日
2-批处理操作系统
2025年5月22日
3-多道程序与分时多任务
2025年5月22日

❯

❯

7 Machine Learning

❯

NTU LinHsuanTian

❯

❯

ML-Foundations-Index

ML-Foundations-Index

2023年12月30日2分钟阅读

机器学习
林轩田
ML

课程 Logo（学到第 13 节时会揭秘它的由来）

Prerequisites

数据结构
线性代数
概率论与数理统计
高等数学（微积分、泰勒展开、解析几何、拉格朗日乘数法）

Rules

学习基础原理，做驾驭机器学习技术的人，而非被眼花缭乱的机器学习技术所束缚。

Roadmap

When Can Machines Learn?

The Learning Problem: $A$ takes $D$ and $H$ to get g;
Learning to Answer Yes or No: PLA $A$ takes linear separable $D$ and perceptrons $H$ to get hypothesis g;
Types of Learning: Binary classification or regression from a batch of supervised data with concrete features;
Feasibility of Learning: Learning is PAC-possible if enough statistical data and finite $∣ H ∣$ ;

Why Can Machines Learn?

Training versus Testing: effective price of choice in training: growth function $m_{H} (N)$ with a break point;
Theory of Generalization: $E_{o u t} \approx E_{in}$ possible if $m_{H}$ breaks somewhere and N large enough;
The VC Dimension: learning happens in finite $d_{V C}$ , large $N$ , and low $E_{in}$ ;
Noise and Error: learning can happen with target distribution $P (y ∣ x)$ and low $E_{in}$ with respect to $err$ ;

How Can Machines Learn?

Linear Regression: analytic solution $w_{L I N} = X^{†} y$ with linear regression hypotheses and squared error;
Logistic Regression: gradient descent on cross-entropy error to get good logistic hypothesis;
Linear Models for Classification: binary classification via (logistic) regression; multiclass via OVA/OVO decomposition;
Nonlinear Transformation: nonlinear $H$ via nonlinear feature transform $Φ$ plus linear $H$ with price of model complexity;

How Can Machines Learn Better?

Hazard of Overfitting: overfitting happens with excessive power, stochastic/deterministic noise, and limited data;
Regularization: minimizes augmented error, where the added regularizer effectively limits model complexity;
Validation: (crossly) reserve validation data to simulate testing procedure for model selection;
Three Learning Principles: Occam’s Razor, Sampling Bias and Data Snooping.

Postscript

如果在阅读笔记时，看到一些问题、或与我有不同的观点和思考，欢迎来信交流： [email protected]

另外，关于《机器学习技法》系列的笔记，正在更新中：ML-Techniques-Index

关系图谱

Prerequisites
Rules
Roadmap
When Can Machines Learn?
Why Can Machines Learn?
How Can Machines Learn?
How Can Machines Learn Better?
Postscript

反向链接

D0-Hazard-of-Overfitting
ML-Techniques-Index
Dive-Into-DL
Welcome to Senj's Digital Garden

Created with Quartz v4.4.0 © 2025

GitHub
RSS Feed