In 2017, we compared nearly 8,800 open source machine learning projects and selected the top 30 (0.3% chance).

This is a very competitive ranking, and I have handpicked the best open source machine learning libraries, datasets and applications released between January and December 2017. Mybridge AI assesses the quality of these entries by taking into account metrics such as popularity, engagement and recency. To give you an idea of the quality, the average number of GitHub stars is 3,558.

Open source projects are very useful for data scientists. You can learn by reading the source code and then build your own on top of existing projects. Now let’s get easy on the machine learning projects you might have missed over the past year.

List of recommended study books:

A) Neural networks

 

Deep Learning A-Z: Hands-on Artificial Neural Networks

[68,745 recommendations, 4.5/5 stars]

B) TensorFlow

 

The Complete Guide to TensorFlow for Deep Learning with Python

[17,834 recommendations, 4.6/5 stars]

 

Open Source Project Leaderboards

First place

FastText: FastText representation/classification library.

[11,786 stars on Github] courtesy of Facebook Research

 

[Muse: Multilingual unsupervised or supervised word embedding based on fast Text. 695 stars on Github]

second

Deep-photo-style Transfer: Code and data for the paper “Deep Photo Style Transfer”.

[9747 stars on Github] courtesy of Dr. Fujun Luan, Cornell University

 

Third,

Face Recognition: The simplest Python command-line face recognition API.

[8672 stars on Github] courtesy of Adam Geitgey

 

4,

Magenta: Machine Intelligent music and art generator.

[There are 8,113 stars on Github]

 

fifth

Sonnet: A neural network library based on TensorFlow.

[5731 stars on Github]. Courtesy of Deepmind’s Malcolm Reynolds

 

6

Deeplearn.js: a hardware-accelerated machine intelligence library for the Web.

[5,462 stars on Github]. Courtesy Nikhil Thorat of Google Brain

 

seventh

Fast Style Transfer: Fast Style Transfer for TensorFlow.

[4843 stars on Github] courtesy of Logan Engstrom, MIT

 

eighth

Pysc2: StarCraft II learning environment.

[3,683 stars on Github] courtesy of DeepMind’s Timo Ewalds

 

The ninth

AirSim: open source simulator based on Unreal Engine for Microsoft AI&Research autonomous vehicles.

[3,861 stars on Github]. Provided by Shital Shah of Microsoft

 

10th

Facets: Machine learning data set visualization tool.

[3371 stars on Github] courtesy of Google Brain

 

11th

Style2Paints: AI image colorization.

[There are 3,310 stars on Github]

 

12th

Tensor2Tensor: A library of tools for generalized sequence-sequence models

[3087 stars on Github] courtesy of Ryan Sepassi of Google Brain

 

13th

CycleGAN and Pix2pix in PyTorch: Image-to-image conversion tools based on PyTorch (e.g. Horse2zebra, Edges2Cats, etc.).

[2847 stars on Github] contributed by Dr. Jun-yan Zhu, Berkeley

 

14th

Faiss: library for efficient similarity search and clustering of dense vectors.

[2,629 stars on Github]. Courtesy of Facebook Research

 

15

Fashion-mnist: A Fashion product data set similar to MNIST.

[2,780 stars on Github] courtesy of Zalando Tech research scientist Han Xiao

 

16

ParlAI: A framework for training and evaluating artificial intelligence models on a variety of publicly available conversational data sets.

[2,578 stars on Github]. Here’s Alexander Miller from Facebook Research

 

17

Fairseq: Facebook AI Research Sequence – a toolkit for sequences.

[There are 2,571 stars on Github]

 

18th

Pyro: Deep general-purpose probabilistic programming using Python and PyTorch.

[2387 stars on Github] courtesy of Uber Engineering

 

19 in

IGAN: Interactive image generator supported by GAN.

[There are 2,369 stars on Github]

 

20

Deep-image-prior: the neural network is used for image restoration without learning the neural network.

[2188 stars on Github] by Dmitry Ulyanov, Ph.D. Skoltech

 

21,

Face_classification: Real-time face detection and emotion/gender classification using fer2013/ IMDB datasets with KerAS CNN model and openCV.

[1967 star on Github]

 

22nd

Speech-to-text-wavenet: End-to-end sentence-level English Speech recognition using DeepMind’s WaveNet and TensorFlow.

[1961 star on Github] courtesy of Namju Kim of Kakao Brain

 

23

StarGAN: Unified generative adversarial network for multi-domain image-image transformations.

[1954 stars on Github] by Yunjey Choi, Korea University

 

24th

Ml-agents: Unity machine learning agent.

Deep Learning at Unity3D by Unity Juliani

 

25th

DeepVideoAnalytics: Distributed visual search and visual data analysis platform.

[1494 stars on Github] by Akshay Bhat, PhD, Cornell University

 

26

OpenNMT: Open source neural machine translation on Torch.

[There are 1,490 stars on Github]

 

27,

Pix2pixHD: Use conditional GANs to synthesize and process 2048×1024 resolution images.

[1283 stars on Github] courtesy of Nvidia AI Research scientist Ming-Yu Liu

 

28th

Horovod: Distributed training framework for TensorFlow.

[1188 stars on Github] courtesy of Uber Engineering

 

29

Ai-blocks: A powerful and intuitive WYSIWYG interface that allows anyone to create machine learning models.

[899 stars on Github]

 

30th

Voice Conversion with non-Parallel Data: Deep neural network speech Conversion (or speech style Conversion) based on Tensorflow.

[845 stars on Github] contributed by Dabi Ahn of AI Kakao Brain

 

 

This article is recommended by Beijing Post @ Love coco – Love life teacher, translated by Ali Yunqi Community organization.

30 Amazing Machine Learning Projects for the Past Year (V. 2018)

Author: Mybridge

Translator: Altman, edited by Yuan Hu.

The article is a brief translation. For more details, please refer to the original text