The Bagging Algorithm

当base learner不稳定 (large variance, low bias) 时Bagging带来的提升很多

base learner需要能够响应细微的变化,因此有时over-fit是允许的

A good choice for base learner: Decision Tree

Untitled

Random Forest:

Use decision tree as basic unit in bagging

Untitled

RF incorporate randomized feature selection at each split step

Untitled

3 Rules:

  1. Random forest need basic learner aware the little change, sometimes overfit is allowable.
  2. Each time, basic learner doesn’t learn from all data, but from Random bootstrap sampled data.
  3. Basic learner doesn’t use all features, but Random select some features.

有两层随机性:数据随机、特征随机

Boosting

Untitled