Projects that are tagged with random projections.


Logo gensim 0.8.6

by Radim - December 9, 2012, 13:15:16 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ] 61672 views, 14024 downloads, 0 subscriptions

About: Python Framework for Vector Space Modelling that can handle unlimited datasets (streamed input, online algorithms work incrementally in constant memory).

Changes:
  • added the "hashing trick" (by Homer Strong)
  • support for adding target classes in SVMlight format (by Corrado Monti)
  • fixed problems with global lemmatizer object when running in parallel on Windows
  • parallelization of Wikipedia processing + added script version that lemmatizes the input documents
  • added class method to initialize Dictionary from an existing corpus (by Marko Burjek)