OOB Errors for Random Forests
  • References/Python/scikit-learn/Examples/Ensemble methods

The RandomForestClassifier is trained using bootstrap aggregation, where each new tree is fit from a bootstrap sample of the training observations

2025-01-10 15:47:30
Label Propagation digits active learning
  • References/Python/scikit-learn/Examples/Semi Supervised Classification

Demonstrates an active learning technique to learn handwritten digits using label propagation. We start by training a label propagation model

2025-01-10 15:47:30
Underfitting vs.
  • References/Python/scikit-learn/Examples/Model Selection

This example demonstrates the problems of underfitting and overfitting and how we can use linear regression with polynomial features to approximate nonlinear functions

2025-01-10 15:47:30
SVM with custom kernel
  • References/Python/scikit-learn/Examples/Support Vector Machines

Simple usage of Support Vector Machines to classify a sample. It will plot the decision surface and the support vectors.

2025-01-10 15:47:30
Plot randomly generated classification dataset
  • References/Python/scikit-learn/Examples/Dataset examples

Plot several randomly generated 2D classification datasets. This example illustrates the datasets.make_classification datasets

2025-01-10 15:47:30
Pipelining
  • References/Python/scikit-learn/Examples/General examples

The PCA does an unsupervised dimensionality reduction, while the logistic regression does the prediction. We use a GridSearchCV to

2025-01-10 15:47:30
Robust linear model estimation using RANSAC
  • References/Python/scikit-learn/Examples/Generalized Linear Models

In this example we see how to robustly fit a linear model to faulty data using the RANSAC algorithm.

2025-01-10 15:47:30
The Digit Dataset
  • References/Python/scikit-learn/Examples/Dataset examples

This dataset is made up of 1797 8x8 images. Each image, like the one shown below, is of a hand-written digit. In order to utilize an 8x8 figure like this, we?d have to first

2025-01-10 15:47:30
Pipeline Anova SVM
  • References/Python/scikit-learn/Examples/Feature Selection

Simple usage of Pipeline that runs successively a univariate feature selection with anova and then a C-SVM of the selected features.

2025-01-10 15:47:30
Model Complexity Influence
  • References/Python/scikit-learn/Examples/Examples based on real world datasets

Demonstrate how model complexity influences both prediction accuracy and computational performance. The dataset is the Boston Housing dataset (resp. 20 Newsgroups)

2025-01-10 15:47:30