The Digit Dataset
  • References/Python/scikit-learn/Examples/Dataset examples

This dataset is made up of 1797 8x8 images. Each image, like the one shown below, is of a hand-written digit. In order to utilize an 8x8 figure like this, we?d have to first

2025-01-10 15:47:30
Feature importances with forests of trees
  • References/Python/scikit-learn/Examples/Ensemble methods

This examples shows the use of forests of trees to evaluate the importance of features on an artificial classification task. The red bars are the feature

2025-01-10 15:47:30
Feature agglomeration vs. univariate selection
  • References/Python/scikit-learn/Examples/Clustering

This example compares 2 dimensionality reduction strategies: univariate feature selection with Anova feature

2025-01-10 15:47:30
FeatureHasher and DictVectorizer Comparison
  • References/Python/scikit-learn/Examples/Working with text documents

Compares FeatureHasher and DictVectorizer by using both to vectorize text documents. The example demonstrates syntax and speed only; it doesn

2025-01-10 15:47:30
SGD: convex loss functions
  • References/Python/scikit-learn/Examples/Generalized Linear Models

A plot that compares the various convex loss functions supported by

2025-01-10 15:47:30
Concentration Prior Type Analysis of Variation Bayesian Gaussian Mixture
  • References/Python/scikit-learn/Examples/Gaussian Mixture Models

This example plots the ellipsoids obtained from a toy dataset (mixture of three Gaussians) fitted by the Baye

2025-01-10 15:47:30
SVM: Weighted samples
  • References/Python/scikit-learn/Examples/Support Vector Machines

Plot decision function of a weighted dataset, where the size of points is proportional to its weight. The sample weighting rescales the C parameter, which means

2025-01-10 15:47:30
Discrete versus Real AdaBoost
  • References/Python/scikit-learn/Examples/Ensemble methods

This example is based on Figure 10.2 from Hastie et al 2009 [1] and illustrates the difference in performance between the discrete SAMME [2] boosting algorithm

2025-01-10 15:47:30
Robust Scaling on Toy Data
  • References/Python/scikit-learn/Examples/Preprocessing

Making sure that each Feature has approximately the same scale can be a crucial preprocessing step. However, when data contains outliers,

2025-01-10 15:47:30
SVM with custom kernel
  • References/Python/scikit-learn/Examples/Support Vector Machines

Simple usage of Support Vector Machines to classify a sample. It will plot the decision surface and the support vectors.

2025-01-10 15:47:30