Bias

Variance

Symptom
$ J_{cv}(\theta) \gg J_{train}(\theta) $ and $J_{train}(\theta) $ is low.
Prescription
1. Getting more training samples
2. Getting rid of some features
3. Increasing $\lambda$

Very big $\lambda \rightarrow $ Bias(underfiiting)
very small $\lambda \rightarrow $ Variance(overfilling)

$\lambda $ selection : use the same training set and select the lambda that leads to the smallest CV Error and to check the Test Error.

Getting more data is useless!

High Variance
There is a gap between $J_{train}(\theta) $ and $J_{cv}(\theta) $.

Getting more data may give a better result.

Using “large” neural network with good regularization to address overfiting is usually better than “small” neural network, but the computation cost is more expensive.

Reprint policy

《Bias vs. Variance》 by 卢宁 is licensed under a Creative Commons Attribution 4.0 International License

本文简单介绍了反向传播算法

2020-05-12 机器学习

深度学习基础

本文在 DETR 的基础上提出了可行变的 Self attention，有效地解决了目标检测中注意力稀疏导致的小目标和训练周期长的问题。

2020-02-16 Paper Reading

目标检测自注意力