|
About: The SHOGUN machine learning toolbox's focus is on large scale learning methods with focus on Support Vector Machines (SVM), providing interfaces to python, octave, matlab, r and the command line. Changes:This release also contains several enhancements, cleanups and bugfixes: Features
Bugfixes
Cleanup and API Changes
|
|
About: FABIA is a biclustering algorithm that clusters rows and columns of a matrix simultaneously. Consequently, members of a row cluster are similar to each other on a subset of columns and, analogously, members of a column cluster are similar to each other on a subset of rows. Biclusters are found by factor analysis where both the factors and the loading matrix are sparse. FABIA is a multiplicative model that extracts linear dependencies between samples and feature patterns. Applications include detection of transcriptional modules in gene expression data and identification of haplotypes/>identity by descent< consisting of rare variants obtained by next generation sequencing. Changes:CHANGES IN VERSION 2.4.0
CHANGES IN VERSION 2.3.1NEW FEATURES 2.0.0:
1.4.0:
|
|
About: A Java framework for statistical analysis and classification of biological sequences Changes:February 2, 2012: Jstacs 2.0 released Jstacs 2.0 changes many names and the structure of several packages. It is not code-compatible with Jstacs 1.5 and earlier RESTRUCTURING and RENAMING: former ScoringFunction, NormalizableScoringFunction, Model
Parameters and Results
performance measures
further changes
NEW FUNCTIONALITY:
BUGFIXES/IMPROVEMENTS:
DOCUMENTATION:
MISC:
|
|
About: MATLAB toolbox for advanced Brain-Computer Interface (BCI) research. Changes:Initial Announcement on mloss.org.
|
|
About: Tools for functional network analysis. Changes:Initial Announcement on mloss.org.
|
|
About: Epistatic miniarray profiles (E-MAPs) are a high-throughput approach capable of quantifying aggravating or alleviating genetic interactions between gene pairs. The datasets resulting from E-MAP experiments typically take the form of a symmetric pairwise matrix of interaction scores. These datasets have a significant number of missing values - up to 35% - that can reduce the effectiveness of some data analysis techniques and prevent the use of others. This project contains nearest neighbor based tools for the imputation and prediction of these missing values. The code is implemented in Python and uses a nearest neighbor based approach. Two variants are used - a simple weighted nearest neighbors, and a local least squares based regression. Changes:Initial Announcement on mloss.org.
|
|
About: Implementation of LSTM for biological sequence analysis (classification, regression, motif discovery, remote homology detection). Additionally a LSTM as logistic regression with spectrum kernel is included. Changes:Spectrum LSTM package included
|
|
About: Accurate splice site predictor for a variety of genomes. Changes:Asp now supports three formats: -g fname for gff format -s fname for spf format -b dir for a binary format compatible with mGene. And a new switch -t which switches on a sigmoid-based transformation of the svm scores to get scores between 0 and 1.
|
|
About: Investigation of dependencies between multiple data sources allows the discovery of regularities and interactions that are not seen in individual data sets. The demand for such methods is increasing with the availability and size of co-occurring observations in computational biology, open data initiatives, and in other domains. We provide practical, open access implementations of general-purpose algorithms that help to realize the full potential of these information sources. Changes:Three independent modules (drCCA, pint, MultiWayCCA) have been added.
|
|
About: BACKGROUND:Over the last decade several prediction methods have been developed for determining the structural and functional properties of individual protein residues using sequence and sequence-derived information. Most of these methods are based on support vector machines as they provide accurate and generalizable prediction models. RESULTS:We present a general purpose protein residue annotation toolkit (svmPRAT) to allow biologists to formulate residue-wise prediction problems. svmPRAT formulates the annotation problem as a classification or regression problem using support vector machines. One of the key features of svmPRAT is its ease of use in incorporating any user-provided information in the form of feature matrices. For every residue svmPRAT captures local information around the reside to create fixed length feature vectors. svmPRAT implements accurate and fast kernel functions, and also introduces a flexible window-based encoding scheme that accurately captures signals and pattern for training eective predictive models. CONCLUSIONS:In this work we evaluate svmPRAT on several classification and regression problems including disorder prediction, residue-wise contact order estimation, DNA-binding site prediction, and local structure alphabet prediction. svmPRAT has also been used for the development of state-of-the-art transmembrane helix prediction method called TOPTMH, and secondary structure prediction method called YASSPP. This toolkit developed provides practitioners an efficient and easy-to-use tool for a wide variety of annotation problems. Availability: http://www.cs.gmu.edu/~mlbio/svmprat/ Changes:Initial Announcement on mloss.org.
|
|
About: SeqAn is an open source C++ library of efficient algorithms and data structures for the analysis of sequences with the focus on biological data. Changes:
|
|
About: The Easysvm package provides a set of tools based on the Shogun toolbox allowing to train and test SVMs in a simple way. Changes:Fixes for shogun 0.7.3.
|
|
About: ARTS is an accurate predictor for Transcription Start Sites (TSS). Changes:Initial Announcement on mloss.org.
|
|
About: *binsdfc* is a command line implementation of the algorithm described in [Endres,Oram,Schindelin,Foldiak:*Bayesian binning beats approximate alternatives: estimating peri-stimulus time histograms*, [...] Changes:Changed build system from automake to cmake. Moved download page to www.compsens.uni-tuebingen.de
|
|
About: Nested Effects Models (NEMs) are a class of directed graphical models originally introduced to analyze the effects of gene perturbation screens with high-dimensional phenotypes. In contrast to other [...] Changes:Initial Announcement on mloss.org.
|
|
About: The Sleipnir C++ library implements a variety of machine learning and data manipulation algorithms focusing on heterogeneous data integration and efficiency for large biological data collections. Changes:Initial Announcement on mloss.org.
|
|
About: For modern biology, precise genome annotations are of prime importance as they allow the accurate definition of genic regions. We employ state of the art machine learning methods to assay and [...] Changes:Initial Announcement on mloss.org.
|
|
About: Local alignment kernels measure the similarity between two sequences by summing up scores obtained from local alignments with gaps of the sequences. Changes:Initial Announcement on mloss.org.
|
|
About: PALMA computes the optimal spliced alignment of a mRNA sequence to a genomic sequence. The main python script takes two FASTA files containing the target (e.g. a DNA sequence, part of the genome) [...] Changes:Initial Announcement on mloss.org.
|


