mloss.org Loomhttp://mloss.orgUpdates and additions to LoomenThu, 19 Mar 2015 19:22:03 -0000Loom 0.2.10http://mloss.org/software/view/599/<html><p>Loom is a streaming inference and query engine for the Cross-Categorization model. </p> <p>Data Types </p> <p>Loom learns models of sparse heterogeneous tabular data, with hundreds of features and millions of rows. Loom currently supports the following feature types and models: </p> <ul> <li> boolean fields as Beta-Bernoulli </li> <li> categorical fields with up to 256 values as Dirichlet-Discrete </li> <li> unbounded categorical fields as Dirichlet-Process-Discrete </li> <li> count fields as Gamma-Poisson </li> <li> real fields as Normal-Inverse-Chi-Squared-Normal </li> <li> sparse real fields as mixture of degenerate and dense real </li> <li> text and keyword fields as booleans for word absence/presence </li> <li> date fields as a combination of absolute, relative, and cyclic parts </li> <li> optional fields as a boolean plus one of the above feature models </li> </ul> <p>Data Scale </p> <p>Loom targets tabular datasets of sizes 100-1000 columns x 10^3-10^9 rows. To handle large datasets, loom implements subsample annealing with an accelerating annealing schedule and adaptively turns off ineffective inference strategies. Loom's annealing schedule is tuned to learn 10^8 cell datasets in under an hour and 10^10 cell datasets in under a day (depending on feature type and sparsity). </p></html>Fritz Obermeyer, Jonathan Glidden, Eric JonasThu, 19 Mar 2015 19:22:03 -0000http://mloss.org/software/rss/comments/599http://mloss.org/software/view/599/mcmcbayesiannonparametric