Projects that are tagged with cuda.


Logo Somoclu 1.6.2

by peterwittek - August 9, 2016, 14:30:34 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 18352 views, 3444 downloads, 3 subscriptions

About: Somoclu is a massively parallel implementation of self-organizing maps. It relies on OpenMP for multicore execution, MPI for distributing the workload, and it can be accelerated by CUDA on a GPU cluster. A sparse kernel is also included, which is useful for training maps on vector spaces generated in text mining processes. Apart from a command line interface, Python, R, and MATLAB are supported.

Changes:
  • Changed: In-place codebook updates when compiled without MPI. This improves update speed and substantially cuts memory use.
  • Changed: Compatible with Visual Studio 15.
  • Fixed: The BMUs returned after training were from before the last epoch. Now another round of BMU search is done.
  • Fixed: Training can continue on the same data in the Python wrapper.
  • Fixed: GPU memory allocation problem on Windows.

Logo Theano 0.8.1

by jaberg - April 1, 2016, 19:22:01 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 26597 views, 4536 downloads, 3 subscriptions

About: A Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Dynamically generates CPU and GPU modules for good performance. Deep Learning Tutorials illustrate deep learning with Theano.

Changes:

Theano 0.8.1 (29th of March, 2016)

* Fix compilation on Mac with CLT 7.3

Theano 0.8 (21th of March, 2016)

We recommend to everyone to upgrade to this version.

Highlights:

* Python 2 and 3 support with the same code base
* Faster optimization
* Integration of CuDNN for better GPU performance
* Many Scan improvements (execution speed up, ...)
* optimizer=fast_compile moves computation to the GPU.
* Better convolution on CPU and GPU. (CorrMM, cudnn, 3d conv, more parameter)
* Interactive visualization of graphs with d3viz
* cnmem (better memory management on GPU)
* BreakpointOp
* Multi-GPU for data parallism via Platoon (https://github.com/mila-udem/platoon/)
* More pooling parameter supported
* Bilinear interpolation of images
* New GPU back-end:

    * Float16 new back-end (need cuda 7.5)
    * Multi dtypes
    * Multi-GPU support in the same process

Logo deepdetect 0.1

by beniz - June 2, 2015, 09:25:28 CET [ Project Homepage BibTeX Download ] 1801 views, 477 downloads, 3 subscriptions

About: A Deep Learning API and server

Changes:

Initial Announcement on mloss.org.


Logo MShadow 1.0

by antinucleon - April 10, 2014, 02:57:54 CET [ Project Homepage BibTeX Download ] 2479 views, 719 downloads, 1 subscription

About: Lightweight CPU/GPU Matrix/Tensor Template Library in C++/CUDA. Support element-wise expression expand in high performance. Code once, run smoothly on both GPU and CPU

Changes:

Initial Announcement on mloss.org.


Logo CXXNET 0.1

by antinucleon - April 10, 2014, 02:47:08 CET [ Project Homepage BibTeX Download ] 3191 views, 748 downloads, 1 subscription

About: CXXNET (spelled as: C plus plus net) is a neural network toolkit build on mshadow(https://github.com/tqchen/mshadow). It is yet another implementation of (convolutional) neural network. It is in C++, with about 1000 lines of network layer implementations, easily configuration via config file, and can get the state of art performance.

Changes:

Initial Announcement on mloss.org.