Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

The roadmap below can provide an indication of the algorithms we aim to implement in the coming
months. If you are interested in helping out, please check our [contribution guide](contribution_guide.html).
Items in **in bold** have already been implemented:

  • Pipelines of transformers and learners
  • Data pre-processing
    •  Feature scaling
    • Polynomial feature base mapper
    • Feature hashing
    • Feature extraction for text
    • Dimensionality reduction
  • Model selection and performance evaluation
    • Model evaluation using a variety of scoring functions
    • Cross-validation for model selection and evaluation
    • Hyper-parameter optimization
  • Supervised learning
    • Optimization framework
      • Stochastic Gradient Descent
      • L-BFGS
    • Generalized Linear Models
      • Multiple linear regression
      • LASSO, Ridge regression
      • Multi-class Logistic regression
      • Random forests
      • Support Vector Machines
    • Decision trees
  • Unsupervised learning
    • Clustering
      • K-means clustering
    • Principal Components Analysis
  • Recommendation
    • ALS
  • Text analytics
    • LDA
  • Statistical estimation tools
  • Distributed linear algebra
  • Streaming ML

...