Project details for Jstacs

Logo JMLR Jstacs 2.2

by keili - February 17, 2016, 11:57:56 CET [ Project Homepage BibTeX BibTeX for corresponding Paper Download ]

view ( today), download ( today ), 0 subscriptions

Description:

Sequence analysis is one of the major subjects of bioinformatics. Several existing libraries combine the representation of biological sequences with exact and approximate pattern matching as well as alignment algorithms. We present Jstacs, an open source Java library, which focuses on the statistical analysis of biological sequences instead. Jstacs comprises an efficient representation of sequence data and provides implementations of many statistical models with generative and discriminative approaches for parameter learning. Using Jstacs, classifiers can be assessed and compared on test datasets or by cross-validation experiments evaluating several performance measures. Due to its strictly object-oriented design Jstacs is easy to use and readily extensible.

Changes to previous version:

New classes and packages:

  • CorreationCoefficient: PerformanceMeasure
  • de.jstacs.clustering: package with classes for hierarchical clustering
  • DeBruijnGraphSequenceGenerator and DeBruijnSequenceGenerator for generating De Buijn sequences
  • CyclicSequenceAdaptor for representing cyclic sequences
  • PlotGeneratorResult for representing results that plot images to a Graphics2D object
  • TextResult for results that may be stored as text files
  • package de.jstacs.results.savers for generic classes that store results to disk
  • LimitedSparseLocalInhomogeneousMixtureDiffSM_higherOrder for sparse local inhomogeneous mixture (Slim) models
  • PFMWrapperTrainSM for representing position frequency matrices and position weight matrices from databases
  • package de.jstacs.tools with classes for generic Jstacs tools that may be used in different user interfaces (command line, Galaxy, JavaFX)
  • Compression for ZIP compression of Strings
  • package de.jstacs.utils.graphics with generic GraphicsAdaptor using Apache XML commons
  • projects: Dimont, GeMoMa, Slim, TALEN, motif comparison

New features and improvements:

  • Major restructuring of Alignment for better efficiency
  • Alignment Costs and StringAlignment now Storable
  • New constructor of DataSet allowing a specified percentage of sequences to mismatch the given alphabet
  • BioJavaAdapter ported to BioJava 1.9
  • XMLParser now also allows for storing Sequences
  • New method for parsing HMMer profile HMMs in HMMFactory
  • Several minor improvements and bugfixes in many classes
  • Improvements of documentation of several classes
BibTeX Entry: Download
Corresponding Paper BibTeX Entry: Download
Supported Operating Systems: Cygwin, Linux, Macosx, Windows, Unix, Agnostic, Solaris, Freebsd, Platform Independent
Data Formats: Plain Ascii, Fasta
Tags: Bioinformatics, R, Classification, Machine Learning, Bayesian Networks, Markov Random Fields, Supervised Learning, Em, Mixture Models, Java, Learning Principles, Probabilistic Models, Motif Discovery
Archive: download here

Comments

No one has posted any comments yet. Perhaps you'd like to be the first?

Leave a comment

You must be logged in to post comments.