Hivemallhttp://mloss.orgUpdates and additions to HivemallenFri, 13 Mar 2015 17:08:22 -0000Hivemall 0.3<html><p><a href="">Hivemall</a> provides machine learning functionality as well as feature engineering functions through UDFs/UDAFs/UDTFs of Hive. It is designed to be scalable to the number of training instances as well as the number of training features. </p> <p>Though we consider that Hivemall is much easier to use and more scalable than Mahout for classification/regression tasks, please check it by yourself. If you have a Hive environment, you can evaluate Hivemall within 5 minutes or so. </p> <p>Hivemall is very easy to use as every machine learning step is done within HiveQL. </p> <p>-- Installation is just as follows:<br /> <code>add jar /tmp/hivemall.jar; source /tmp/define-all.hive;</code> </p> <p>-- Logistic regression is performed by a query.<br /> <code>SELECT feature, avg(weight) as weight FROM ( SELECT logress(features,label) as (feature,weight) FROM training_features ) t GROUP BY feature;</code> </p> <p>You can find detailed examples on <a href="">our wiki pages</a>. </p></html>Makoto YuiFri, 13 Mar 2015 17:08:22 -0000 learningmatrix factorizationlogistic regressionmulticlass classificationrecommendationhadoophivepassive aggressiveconfidence weightedadagrad