Projects that are tagged with cuda.


Logo Somoclu 1.4

by peterwittek - September 5, 2014, 13:01:14 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 4494 views, 847 downloads, 2 subscriptions

About: Somoclu is a massively parallel implementation of self-organizing maps. It relies on OpenMP for multicore execution, MPI for distributing the workload, and it can be accelerated by CUDA on a GPU cluster. A sparse kernel is also included, which is useful for training maps on vector spaces generated in text mining processes.

Changes:
  • Better Windows support.
  • Completed CUDA support for Python and R interfaces.
  • Faster compilation by removing unnecessary flags for nvcc
  • Support for CUDA 6.5.
  • Bug fixes: R version no longer needs separate code.

Logo MShadow 1.0

by antinucleon - April 10, 2014, 02:57:54 CET [ Project Homepage BibTeX Download ] 789 views, 220 downloads, 1 subscription

About: Lightweight CPU/GPU Matrix/Tensor Template Library in C++/CUDA. Support element-wise expression expand in high performance. Code once, run smoothly on both GPU and CPU

Changes:

Initial Announcement on mloss.org.


Logo CXXNET 0.1

by antinucleon - April 10, 2014, 02:47:08 CET [ Project Homepage BibTeX Download ] 892 views, 217 downloads, 1 subscription

About: CXXNET (spelled as: C plus plus net) is a neural network toolkit build on mshadow(https://github.com/tqchen/mshadow). It is yet another implementation of (convolutional) neural network. It is in C++, with about 1000 lines of network layer implementations, easily configuration via config file, and can get the state of art performance.

Changes:

Initial Announcement on mloss.org.


Logo Theano 0.6

by jaberg - December 3, 2013, 20:32:02 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 13216 views, 2467 downloads, 1 subscription

About: A Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Dynamically generates CPU and GPU modules for good performance. Deep Learning Tutorials illustrate deep learning with Theano.

Changes:

Theano 0.6 (December 3th, 2013)

Highlight:

* Last release with support for Python 2.4 and 2.5.
* We will try to release more frequently.
* Fix crash/installation problems.
* Use less memory for conv3d2d.

0.6rc4 skipped for a technical reason.

Highlights (since 0.6rc3):

* Python 3.3 compatibility with buildbot test for it.
* Full advanced indexing support.
* Better Windows 64 bit support.
* New profiler.
* Better error messages that help debugging.
* Better support for newer NumPy versions (remove useless warning/crash).
* Faster optimization/compilation for big graph.
* Move in Theano the Conv3d2d implementation.
* Better SymPy/Theano bridge: Make an Theano op from SymPy expression and use SymPy c code generator.
* Bug fixes.

Too much changes in 0.6rc1, 0.6rc2 and 0.6rc3 to list here. See https://github.com/Theano/Theano/blob/master/NEWS.txt for details.