Project details for Somoclu

Logo Somoclu 1.2

by peterwittek - December 17, 2013, 04:31:05 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ]

view ( today), download ( today ), 0 subscriptions


Somoclu is a C++ tool for training self-organizing maps on large data sets using a high-performance cluster. It builds on MPI for distributing the workload across the nodes of the cluster. It is also able to boost training by using CUDA if graphics processing units are available. A sparse kernel is included, which is useful for high-dimensional but sparse data, such as the vector spaces common in text mining workflows. The code is released under GNU GPLv3 licence.

Key features: * Fast execution by parallelization: OpenMP, MPI, and CUDA are supported. Planar and toroid maps. Both dense and sparse input data are supported. Large emergent maps of several hundred thousand neurons are feasible. Integration with Databionic ESOM Tools.

Changes to previous version:
  • Massive improvements in OpenMP parallelization.
  • MPI libraries are no longer mandatory.
  • Best matching units are saved.
  • Option for specifying an initial codebook for the map.
  • ESOM .lrn input format added.
  • Parsing of white-space characters corrected.
  • Long-named command line switches for specifying SOM dimensions.
  • Fine-grained control of which interim files to save across epochs
  • Option in Makefile for building shared library.
BibTeX Entry: Download
Corresponding Paper BibTeX Entry: Download
Supported Operating Systems: Linux, Os X
Data Formats: Ascii, Libsvm, Esom
Tags: Cuda, Self Organizing Maps, Mpi
Archive: download here


No one has posted any comments yet. Perhaps you'd like to be the first?

Leave a comment

You must be logged in to post comments.