MILK: Machine Learning Toolkit

Last update: Dec 14, 2022

Related tags

Overview

MILK: MACHINE LEARNING TOOLKIT

Machine Learning in Python

Milk is a machine learning toolkit in Python.

Its focus is on supervised classification with several classifiers available: SVMs (based on libsvm), k-NN, random forests, decision trees. It also performs feature selection. These classifiers can be combined in many ways to form different classification systems.

For unsupervised learning, milk supports k-means clustering and affinity propagation.

Milk is flexible about its inputs. It optimised for numpy arrays, but can often handle anything (for example, for SVMs, you can use any dataype and any kernel and it does the right thing).

There is a strong emphasis on speed and low memory usage. Therefore, most of the performance sensitive code is in C++. This is behind Python-based interfaces for convenience.

To learn more, check the docs at http://packages.python.org/milk/ or the code demos included with the source at milk/demos/.

Examples

Here is how to test how well you can classify some features,labels data, measured by cross-validation:

import numpy as np
import milk
features = np.random.rand(100,10) # 2d array of features: 100 examples of 10 features each
labels = np.zeros(100)
features[50:] += .5
labels[50:] = 1
confusion_matrix, names = milk.nfoldcrossvalidation(features, labels)
print 'Accuracy:', confusion_matrix.trace()/float(confusion_matrix.sum())

If want to use a classifier, you instanciate a learner object and call its train() method:

import numpy as np
import milk
features = np.random.rand(100,10)
labels = np.zeros(100)
features[50:] += .5
labels[50:] = 1
learner = milk.defaultclassifier()
model = learner.train(features, labels)

# Now you can use the model on new examples:
example = np.random.rand(10)
print model.apply(example)
example2 = np.random.rand(10)
example2 += .5
print model.apply(example2)

There are several classification methods in the package, but they all use the same interface: train() returns a model object, which has an apply() method to execute on new instances.

Details

License: MIT

Author: Luis Pedro Coelho (with code from LibSVM and scikits.learn)

API Documentation: http://packages.python.org/milk/

Mailing List: http://groups.google.com/group/milk-users

Features

SVMs. Using the libsvm solver with a pythonesque wrapper around it.
LASSO
K-means using as little memory as possible. It can cluster millions of instances efficiently.
Random forests
Self organising maps
Stepwise Discriminant Analysis for feature selection.
Non-negative matrix factorisation
Affinity propagation

Recent History

The ChangeLog file contains a more complete history.

New in 0.6.1 (11 May 2015)

Fixed source distribution

New in 0.6 (27 Apr 2015)

Update for Python 3

New in 0.5.3 (19 Jun 2013)

Fix MDS for non-array inputs
Fix MDS bug
Add return_* arguments to kmeans
Extend zscore() to work on non-ndarrays
Add frac_precluster_learner
Work with older C++ compilers

New in 0.5.2 (7 Mar 2013)

Fix distribution of Eigen with source

New in 0.5.1 (11 Jan 2013)

Add subspace projection kNN
Export pdist in milk namespace
Add Eigen to source distribution
Add measures.curves.roc
Add mds_dists function
Add verbose argument to milk.tests.run

New in 0.5 (05 Nov 2012)

Add coordinate-descent based LASSO
Add unsupervised.center function
Make zscore work with NaNs (by ignoring them)
Propagate apply_many calls through transformers
Much faster SVM classification with means a much faster defaultlearner() [measured 2.5x speedup on yeast dataset!]

For older versions, see ChangeLog file

MILK: Machine Learning Toolkit

Related tags

Overview

MILK: MACHINE LEARNING TOOLKIT

Machine Learning in Python

Examples

Details

Features

Recent History

New in 0.6.1 (11 May 2015)

New in 0.6 (27 Apr 2015)

New in 0.5.3 (19 Jun 2013)

New in 0.5.2 (7 Mar 2013)

New in 0.5.1 (11 Jan 2013)

New in 0.5 (05 Nov 2012)

Owner

Luis Pedro Coelho

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

Visual Adversarial Imitation Learning using Variational Models (VMAIL)

PyTorch Implementation of Sparse DETR

Edison AT is software Depression Assistant personal.

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Supervised & unsupervised machine-learning techniques are applied to the database of weighted P4s which admit Calabi-Yau hypersurfaces.

dualFace: Two-Stage Drawing Guidance for Freehand Portrait Sketching (CVMJ)

A PyTorch implementation of the Relational Graph Convolutional Network (RGCN).

This is a simple backtesting framework to help you test your crypto currency trading. It includes a way to download and store historical crypto data and to execute a trading strategy.

A Pytorch Implementation of Source Data-free Domain Adaptation for a Faster R-CNN

OrienMask: Real-time Instance Segmentation with Discriminative Orientation Maps

:boar: :bear: Deep Learning based Python Library for Stock Market Prediction and Modelling

nnFormer: Interleaved Transformer for Volumetric Segmentation Code for paper "nnFormer: Interleaved Transformer for Volumetric Segmentation "

MultiLexNorm 2021 competition system from ÚFAL

Minimal fastai code needed for working with pytorch

Finetune SSL models for MOS prediction

GeneGAN: Learning Object Transfiguration and Attribute Subspace from Unpaired Data

bespoke tooling for offensive security's Windows Usermode Exploit Dev course (OSED)

Identifying Stroke Indicators Using Rough Sets