Topic extraction with Non-negative Matrix Factorization and Latent Dirichlet Allocation
  • References/Python/scikit-learn/Examples/Examples based on real world datasets

This is an example of applying Non-negative Matrix Factorization and Latent Dirichlet Allocation on a

2025-01-10 15:47:30
Robust linear model estimation using RANSAC
  • References/Python/scikit-learn/Examples/Generalized Linear Models

In this example we see how to robustly fit a linear model to faulty data using the RANSAC algorithm.

2025-01-10 15:47:30
Cross-validation on Digits Dataset Exercise
  • References/Python/scikit-learn/Examples/Tutorial exercises

A tutorial exercise using Cross-validation with an SVM on the Digits dataset. This exercise is used in the

2025-01-10 15:47:30
Pipelining
  • References/Python/scikit-learn/Examples/General examples

The PCA does an unsupervised dimensionality reduction, while the logistic regression does the prediction. We use a GridSearchCV to

2025-01-10 15:47:30
Comparison of Manifold Learning methods
  • References/Python/scikit-learn/Examples/Manifold learning

An illustration of dimensionality reduction on the S-curve dataset with various manifold learning methods. For a discussion and comparison of

2025-01-10 15:47:30
Pipeline Anova SVM
  • References/Python/scikit-learn/Examples/Feature Selection

Simple usage of Pipeline that runs successively a univariate feature selection with anova and then a C-SVM of the selected features.

2025-01-10 15:47:30
Plot Ridge coefficients as a function of the L2 regularization
  • References/Python/scikit-learn/Examples/Generalized Linear Models

Ridge Regression is the estimator used in this example. Each color in the left plot represents one different dimension of the coefficient vector, and this is displayed as a function of the regularization parameter. The right plot shows how exact the solution is. This example illustrates how a well defined solution is found by Ridge regression and how regularization affects the coefficients and their values. The plot on the right shows how the difference of the coefficients from the estimator c

2025-01-10 15:47:30
FeatureHasher and DictVectorizer Comparison
  • References/Python/scikit-learn/Examples/Working with text documents

Compares FeatureHasher and DictVectorizer by using both to vectorize text documents. The example demonstrates syntax and speed only; it doesn

2025-01-10 15:47:30
Selecting the number of clusters with silhouette analysis on KMeans clustering
  • References/Python/scikit-learn/Examples/Clustering

Silhouette analysis can be used to study the separation distance between the resulting clusters. The silhouette

2025-01-10 15:47:30
Online learning of a dictionary of parts of faces
  • References/Python/scikit-learn/Examples/Clustering

This example uses a large dataset of faces to learn a set of 20 x 20 images patches that constitute faces. From the programming standpoint

2025-01-10 15:47:30