Home » » Top 20 Python Machine Learning Open Source Projects

Top 20 Python Machine Learning Open Source Projects

We examine top Python Machine learning open source projects on Github, both in terms of contributors and commits, and identify most popular and most active ones.
By Bhavya Geethika Peddibhotla. 

We analyze Top 20 Python Machine learning projects on GitHub and find that scikit-Learn, PyLearn2 and NuPic are the most actively contributed projects. Explore these popular projects on Github! 

top-python-machine-learning-projects
Fig. 1: Python Machine learning projects on GitHub, with color corresponding to commits/contributors. Bob, Iepy, Nilearn, and NuPIC have the highest such value. 

  1. scikit-learn, 18845 commits, 404 contributors, 
    www.github.com/scikit-learn/scikit-learn 
    scikit-learn is a Python module for machine learning built on top of SciPy.It features various classification, regression and clustering algorithms including support vector machines, logistic regression, naive Bayes, random forests, gradient boosting, k-means and DBSCAN, and is designed to interoperate with the Python numerical and scientific libraries NumPy and SciPy.
  2. Pylearn2, 7027 commits, 117 contributors, 
    www.github.com/lisa-lab/pylearn2 
    Pylearn2 is a library designed to make machine learning research easy. Its a library based on Theano
  3. NuPIC, 4392 commits, 60 contributors, 
    www.github.com/numenta/nupic 
    The Numenta Platform for Intelligent Computing (NuPIC) is a machine intelligence platform that implements the HTM learning algorithms. HTM is a detailed computational theory of the neocortex. At the core of HTM are time-based continuous learning algorithms that store and recall spatial and temporal patterns. NuPIC is suited to a variety of problems, particularly anomaly detection and prediction of streaming data sources.
  4. Nilearn, 2742 commits, 28 contributors, 
    www.github.com/nilearn/nilearn 
    Nilearn is a Python module for fast and easy statistical learning on NeuroImaging data. It leverages the scikit-learn Python toolbox for multivariate statistics with applications such as predictive modeling, classification, decoding, or connectivity analysis.
  5. PyBrain, 969 commits, 27 contributors, 
    www.github.com/pybrain/pybrain 
    PyBrain is short for Python-Based Reinforcement Learning, Artificial Intelligence and Neural Network Library. Its goal is to offer flexible, easy-to-use yet still powerful algorithms for Machine Learning Tasks and a variety of predefined environments to test and compare your algorithms.
  6. Pattern, 943 commits, 20 contributors, 
    www.github.com/clips/pattern 
    Pattern is a web mining module for Python. It has tools for Data Mining, Natural Language Processing, Network Analysis and Machine Learning. It supports vector space model, clustering, classification using KNN, SVM, Perceptron
  7. Fuel, 497 commits, 12 contributors, 
    www.github.com/mila-udem/fuel 
    Fuel provides your machine learning models with the data they need to learn. it has interfaces to common datasets such as MNIST, CIFAR-10 (image datasets), Google's One Billion Words (text). It gives you the ability to iterate over your data in a variety of ways, such as in minibatches with shuffled/sequential examples
  8. Bob, 5080 commits, 11 contributors, 
    www.github.com/idiap/bob 
    Bob is a free signal-processing and machine learning toolbox The toolbox is written in a mix of Python and C++ and is designed to be both efficient and reduce development time. It is composed of a reasonably large number of packages that implement tools for image, audio & video processing, machine learning and pattern recognition
  9. skdata, 441 commits, 10 contributors, 
    www.github.com/jaberg/skdata 
    Skdata is a library of data sets for machine learning and statistics. This module provides standardized Python access to toy problems as well as popular computer vision and natural language processing data sets.
  10. MILK, 687 commits, 9 contributors, 
    www.github.com/luispedro/milk 
    Milk is a machine learning toolkit in Python. Its focus is on supervised classification with several classifiers available: SVMs, k-NN, random forests, decision trees. It also performs feature selection. These classifiers can be combined in many ways to form different classification systems.For unsupervised learning, milk supports k-means clustering and affinity propagation.
  11. IEPY, 1758 commits, 9 contributors, 
    www.github.com/machinalis/iepy 
    IEPY is an open source tool for Information Extraction focused on Relation Extraction 
    It's aimed at users needing to perform Information Extraction on a large dataset. scientists wanting to experiment with new IE algorithms.
  12. Quepy, 131 commits, 9 contributors, 
    www.github.com/machinalis/quepy 
    Quepy is a python framework to transform natural language questions to queries in a database query language. It can be easily customized to different kinds of questions in natural language and database queries. So, with little coding you can build your own system for natural language access to your database. 
    Currently Quepy provides support for Sparql and MQL query languages, with plans to extended it to other database query languages.
  13. Hebel, 244 commits, 5 contributors, 
    www.github.com/hannes-brt/hebel 
    Hebel is a library for deep learning with neural networks in Python using GPU acceleration with CUDA through PyCUDA. It implements the most important types of neural network models and offers a variety of different activation functions and training methods such as momentum, Nesterov momentum, dropout, and early stopping.
  14. mlxtend, 135 commits, 5 contributors, 
    www.github.com/rasbt/mlxtend 
    Its a library consisting of useful tools and extensions for the day-to-day data science tasks.
  15. nolearn, 192 commits, 4 contributors, 
    www.github.com/dnouri/nolearn 
    This package contains a number of utility modules that are helpful with machine learning tasks. Most of the modules work together with scikit-learn, others are more generally useful.
  16. Ramp, 179 commits, 4 contributors, 
    www.github.com/kvh/ramp 
    Ramp is a python library for rapid prototyping of machine learning solutions. It's a light-weight pandas-based machine learning framework pluggable with existing python machine learning and statistics tools (scikit-learn, rpy2, etc.). Ramp provides a simple, declarative syntax for exploring features, algorithms and transformations quickly and efficiently.
  17. Feature Forge, 219 commits, 3 contributors, 
    www.github.com/machinalis/featureforge 
    A set of tools for creating and testing machine learning features, with a scikit-learn compatible API. 
    This library provides a set of tools that can be useful in many machine learning applications (classification, clustering, regression, etc.), and particularly helpful if you use scikit-learn (although this can work if you have a different algorithm).
  18. REP, 50 commits, 3 contributors, 
    www.github.com/yandex/rep 
    REP is environment for conducting data-driven research in a consistent and reproducible way. It has a unified classifiers wrapper for variety of implementations like TMVA, Sklearn, XGBoost, uBoost. It can train classifiers in parallel on a cluster. It supports interactive plots
  19. Python Machine Learning Samples, 15 commits, 3 contributors, 
    www.github.com/awslabs/machine-learning-samples 
    A collection of sample applications built using Amazon Machine Learning.
  20. Python-ELM, 17 commits, 1 contributor, 
    www.github.com/dclambert/Python-ELM 
    This is an implementation of the Extreme Learning Machine in Python, based on scikit-learn.
This post used some content from www.pansop.com/1039/ 

251 Comments:

«Oldest   ‹Older   201 – 251 of 251   Newer›   Newest»
massage.blue said...

What an excellent you are. Your presentation was so good. 스포츠마사지

gunma.top said...

"It’s amazing, your posts are really impressive, I hope to see more of this wonderful article from you 출장마사지"

michal said...


I am working as a SEO Analyst.My blog gives information about latest
technologies and IT Courses.
it courses online

visa cost Turkey said...

Great post I would like to thank you for the effort you put into writing this interesting and informative article, no matter what is the purpose of your trip to Turkey, you need to pay the visa cost Turkey before submitting the visa application .Depending on your desired option, the e Visa Turkey cost are calculated.

magosucomwep said...

It has fully emerged to crown Singapore's southern shores and undoubtedly placed her on the global map of residential landmarks. I still scored the more points than I ever have in a season for GS. I think you would be hard pressed to find somebody with the same consistency I have had over the years so I am happy with that.
As I am looking at your writing, I regret being unable to do outdoor activities due to Corona 19, and I miss my old daily life. If you also miss the daily life of those days, would you please visit my site once? My site is a site where I post about photos and daily life when I was free
경마사이트

magosucowep

Innomatics said...

Become a Data Science Expert with Innomatics. we provide Classroom training on Data Science course in Hyderabad for the individuals who believe hand-held training. We teach as per the Indian Standard Time (IST) with In-depth practical Knowledge on each topic in classroom training, 80 – 90 Hrs of Real-time practical training classes. There are different slots available on weekends or weekdays according to your choices.

data science said...

I was just examining through the web looking for certain information and ran over your blog.It shows how well you understand this subject. Bookmarked this page, will return for extra. data science course in vadodara

SCARLET BROWN said...

I think this is the best I’ve seen till now. You can certainly visit your expertise inside the article you write.

성인야설

ophunter.net said...

Just when I couldn’t love you more. You posted this pic and my jaw dropped to the floor.

휴게텔

KUPZ said...

An outstanding share! I have just forwarded this onto a co-worker who was doing a little homework on this. And he actually bought me dinner simply because I discovered it for him… lol. So let me reword this…. Thanks for the meal!! But yeah, thanks for spending some time to talk about this topic here on your web site.

마사지

WAMOS said...

I really like your blog site.. excellent shades & style. Do a person pattern this excellent website oneself or even have people hire an attorney to make it happen available for you? Please answer while I!|m seeking to style and design my very own blog as well as would wish to learn where u obtained this specific out of thanks a lot 스포츠마사지

Judy A. Walters said...



Very informative post I really like it. Happy to hear about it more. riverdale southside serpents jacket

Majortotosite Pro said...

You are truly a just right webmaster. The site loading speed is incredible. It seems that you are doing any unique trick. Also visit my site: 토토사이트

Racesite Pro said...

Hi my friend! I want to say that this post is awesome, great written and include approximately all important infos.
I would like to look extra posts like this .
온라인경마
경마사이트

Oncasinosite Net said...

What a nice comment! Nice to meet you. I live in a different country from you. Your writing will be of great help to me and to many other people living in our country. I was looking for a post like this, but I finally found 바카라

Totopick Pro said...

That is a very good tip particularly to those new to
the blogosphere. Brief but very accurate information… Thanks for sharing
this one. A must read post! 토토

Dettifoss IT Solutions said...

I was just examining through the web looking for certain information and ran over your blog.It shows how well you understand this subject.
servicenow training in hyderabad

DataScienceBOX said...

I am a new user of this site, so here I saw several articles and posts published on this site, I am more interested in some of them, will provide more information on these topics in future articles.
data science course in london

jbmglobal said...

It- Such a nice blog and I appreciate all your efforts about your thoughts. It's really good work. well done.
Website - Best CBSE School In Noida

jbmsmartstart said...

I just stumbled upon your blog and it was great to be here! Definitely, this is a great post with great content
Website - Best Preschool In Noida

timestrbute said...

I would like to thank you for the efforts you have made in writing this article. Hope to see some great stuff in the future as well.
Hindustan Times obituary

sathish said...

Nice article with valuable information. Thanks for sharing.

Python Online Training

Artificial Intelligence Online Training

Data Science Online Training

Machine Learning Online Training

AWS Online Training

UiPath Online Training

Anonymous said...

For financial markets, I would like to introduce you to some sites that have helped me a lot and teach in the field of digital currencies.
benzinga
click here
Source

tech science said...

Great to become visiting your weblog once more, it has been a very long time for me. Pleasantly this article i've been sat tight for such a long time. I will require this post to add up to my task in the school, and it has identical subject along with your review. Much appreciated, great offer. data science course in nagpur

sathish said...

Thanks for posting the best information and the blog is very helpful.

Artificial Intelligence Training in Bangalore | Artificial Intelligence Online Training

Python Training in Bangalore | Python Online Training

Data Science Training in Bangalore | Data Science Online Training

Machine Learning Training in Bangalore | Machine Learning Online Training

AWS Training in bangalore | AWS Online Training

UiPath Training in Bangalore | UiPath Online Training

Bhuvankumar said...

It is a very helpful and very informative blog. I really learned a lot from it thanks for sharing.
Data Analytics Course

Artificial Intelligence Training in BLR said...

A good blog always contains new and exciting information, and reading it I feel like this blog really has all of these qualities that make it a blog.

Artificial Intelligence Training in Bangalore

Unknown said...

This site seems to inspire me a lot. Thank you so much for organizing and providing this quality information in an easy to understand way. I think that a healthy era of big data can be maintained only when such high-quality information is continuously produced. And I, too, are working hard to organize and provide such high-quality information. It would be nice to come in once and get information.

Also visit my site:파워볼


Digital Marketing Training in BLR said...

It is late to find this act. At least one should be familiar with the fact that such events exist. I agree with your blog and will come back to inspect it further in the future, so keep your performance going.

Digital Marketing Training in Bangalore

Arnold DK said...

Nice post. I'm impressed! Extremely useful information. Thank you and keep up the good work. whatsapp mod

frzn said...

Bitcoin is built on a digitally distributed record called blockchain. As the name implies, a blockchain is a collection of linked https://econews.ir/fa/content/3174564 data consisting of units called blocks that contain information about each transaction, including date and time, total value, buyer and seller, and a unique identification code for each It is an exchange

Maneesha said...

I finally found great post here.I will get back here. I just added your blog to my bookmark sites. thanks.Quality posts is the crucial to invite the visitors to visit the web page, that's what this web page is providing.
data scientist training in hyderabad

Maneesha said...

Extremely overall quite fascinating post. I was searching for this sort of data and delighted in perusing this one. Continue posting. A debt of gratitude is in order for sharing.aws certification cost hyderabad

Professional Course said...

Really impressed! Everything is a very open and very clear clarification of the issues. It contains true facts. Your website is very valuable. Thanks for sharing.

Best Data Analytics Courses in Bangalore

tech science said...


Such a very useful article. Very interesting to read this article.I would like to thank you for the efforts you had made for writing this awesome article. data science institute in nagpur

360digitmg said...

This is also a very good post which I really enjoyed reading. It is not every day that I have the possibility to see something like this.. data science training in gurgaon

jowsaaa said...

Hello there! Quick question that’s completely off topic.
Do you know how to make your site mobile friendly? My website looks weird when viewing from my iphone.
I’m trying to find a template or plugin that might
be able to resolve this issue. If you have any recommendations, please share.
Thank you!

website:토토

sportstoto365 said...

Damn, Ive been looking for information like this for a while now. Thanks! 스포츠토토

bacarasite said...

Many thanks for the article, I have a lot of spray lining knowledge but always learn something new. Keep up the good work and thank you again. 바카라사이트

powerballsite said...

Youre so right. Im there with you. Your weblog is definitely worth a read if anyone comes throughout it. Im lucky I did because now Ive received a whole new view of this. 파워볼

casinositelink said...

Wohh exactly what I was looking for, appreciate it for posting . I truly appreciate your post. Great site! 카지노사이트

360digitmg said...

Thankyou for this wondrous post, I am happy I watched this site on yippee. cloud computing training in noida

Tutorials said...

Nice article with valuable information. Thanks for sharing.

AWS Training in Chennai | AWS Training institute in Chennai

360digitmg said...

Very nice job... Thanks for sharing this amazing and educative blog post!
artificial intelligence training institute in noida

frzn said...

The total number of tokens in this digital currency is 1,000 trillion ELON tokens. 50% of the total was sent to Vitalik Butrin (Creating Atrium Digital Currency) and the remaining 50% of the tokens were exchanged for ions. https://resalat-news.com/?p=55036 Fifty percent of the tokens sent to Vitalik as a symbol of peace were burned

360digitmg said...

Thank a lot. You have done excellent job. I enjoyed your blog . Nice efforts data science training in noida

Dev Kumar said...

We are really grateful for your blog post. You will find a lot of approaches after visiting your post. Great work thank you.
Cloud Computing Training in Bangalore

Professional Course said...

I am more curious to take an interest in some of them. I hope you will provide more information on these topics in your next articles.

Cyber Security Training in Bangalore

Professional Course said...

Happy to chat on your blog, I feel like I can't wait to read more reliable posts and think we all want to thank many blog posts to share with us.

Cyber Security Training in Bangalore

Anonymous said...

I want to leave a little comment to support and wish you the best of luck.we wish you the best of luck in all your blogging enedevors
data science course in noida

Technical Knowledge said...

I really enjoyed reading this post and keep up the good work and let me know when you can post more articles or where I can find out more on the topic.
Data Science Online Course

«Oldest ‹Older   201 – 251 of 251   Newer› Newest»

Popular Posts