BenchMarking Via Wekahttp://mloss.orgUpdates and additions to BenchMarking Via WekaenThu, 04 Dec 2008 01:15:15 -0000BenchMarking Via Weka 0.0.4<html><p>BenchMarking Via Weka is a client-server architecture that supports interoperability between different machine learning systems. Machine learning systems need to provide mechanisms for processing data and evaluating generated models. In our system, the server hosts all the data and performs all the statistical analyses, while the client performs all the pre-processing and model building. This separation of tasks opens up the possibility of offering a cross-platform and cross-language framework. By performing statistical analyses on the host, we avoid unnecessary exchange and conversion of generated results, and provide a simple mechanism for channelling results into an experiment database, such as [1]. </p> <p>We realize this architecture employing Weka [2] as the backend on the server-side. The clients are completely independent of the Weka machine learning workbench. This makes it easy to develop clients for any programming language, one only needs to implement the client-server communication protocol. So far, Java and Python clients have been implemented, offering command-line and GUI-based interfaces. GUI-based interfaces are important because it is a well-known shortcoming of many open-source systems that they suffer from usability issues [3]. </p> <p>At the moment, classification and regression analyses are supported within the system. Other open-source frameworks, like Weka through the Java client and mlpy [4] through the Python client, can be used. </p> <p>[1] Experiment Databases For Machine Learning, see </p> <p>[2] Ian H. Witten and Eibe Frank (2005) "Data Mining: Practical machine learning tools and techniques", 2nd Edition, Morgan Kaufmann, San Francisco, 2005. </p> <p>[3] The usability of open source software: analysis and prospects (2006), D.M. Nichols &amp; M.B. Twidale, in Open Source Software in Business - Issues and Perspectives, (ed.) Jain, R.K., Hyderabad, India: ICFAI University Press. 167-188. </p> <p>[4] Machine Learning Py (mlpy) is a high-performance Python/NumPy based package for machine learning, see </p></html>Peter Reutemann, Geoff HolmesThu, 04 Dec 2008 01:15:15 -0000 learningnips2008