Projects that are tagged with big data.

Logo SparklingGraph 0.0.6

by riomus - June 17, 2016, 14:49:46 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 3545 views, 691 downloads, 3 subscriptions

About: Large scale, distributed graph processing made easy.


Bug fixes, Graph generators

Logo OpenNN 3.0

by Sergiointelnics - February 11, 2016, 16:55:37 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 7231 views, 1261 downloads, 4 subscriptions

About: OpenNN is an open source class library written in C++ programming language which implements neural networks, a main area of deep learning research. The library has been designed to learn from both data sets and mathematical models.


New algorithms, correction of bugs, model selection algorithms.

Logo JMLR BudgetedSVM v1.1

by nemanja - February 12, 2014, 20:53:45 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4269 views, 801 downloads, 1 subscription

About: BudgetedSVM is an open-source C++ toolbox for scalable non-linear classification. The toolbox can be seen as a missing link between LibLinear and LibSVM, combining the efficiency of linear with the accuracy of kernel SVM. We provide an Application Programming Interface for efficient training and testing of non-linear classifiers, supported by data structures designed for handling data which cannot fit in memory. We also provide command-line and Matlab interfaces, providing users with an efficient, easy-to-use tool for large-scale non-linear classification.


Changed license from LGPL v3 to Modified BSD.

Logo MLlib 0.8

by atalwalkar - October 10, 2013, 00:56:25 CET [ Project Homepage BibTeX Download ] 4013 views, 757 downloads, 1 subscription

About: MLlib provides a distributed machine learning (ML) library to address the growing need for scalable ML. MLlib is developed in Spark (, a cluster computing system designed for iterative computation. Moreover, it is a component of a larger system called MLbase ( that aims to provide user-friendly distributed ML functionality both for ML researchers and domain experts. MLlib currently consists of scalable implementations of algorithms for classification, regression, collaborative filtering and clustering.


Initial Announcement on

Logo ClowdFlows 0.9

by janezkranjc - October 8, 2013, 02:57:49 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4014 views, 770 downloads, 1 subscription

About: ClowdFlows is a web based platform for service oriented data mining publicly available at . A web based interface allows users to construct data mining workflows that are hosted on the web and can be (if allowed by the author) accessed by anyone by following a URL of the workflow.


Initial Announcement on