MetaAnnotator
MetaAnnotator is a novel software for binning and annotating short paired-end reads.
(1) Use a novel annotating method;
(2) Incorporate different reference databases;
(2) Annotate more accurately, comparing to existing tools e.g MEGAN5, TANGO and MetaCluster-TA;
(4) Cost much less time.
MetaCluster-TA
MetaCluster-TA is a new software for binning and annotating short paired-end reads.
(1) improved clustering/binning performance, comparing to MetaCluster5.0;
(2) more annotated reads and more accurate annotation, comparing to MEGAN4;
*Usage information can be found in README file in the package(see in download tab).
MetaCluster 5.0
MetaCluster5.0 is an unsupervised binning method that can
(1) samples with low-abundance species, or
(2) samples (even with high-abundance) with many extremely-low-abundance species.
Release Package Information
The version 5.0.1 release
package contains the C++ source code of "MetaCluster 5.0 version". Download from
here.
Runtime Environment
This software is suitable for all unix-like 64-bit system with gcc installed.
It is not necessary but highly recommanded to install openmp for muilt-cpu machine.
Compile
Code
1.Extract the package.
2.Use the command :
$ ./configure
$ make
3.After compiling, the tools 'MetaCluster5_1' & 'MetaCluster5_2' are in the directory: ./bin
Command Format
Please refer to the README file in the package.
MetaCluster 4.0.6 version
MetaCluster4.0 is an unsupervised binning method that can
(1) deal with short reads(75bp),
(2) deal with species of similar abundance ratios as well as extreme ratios,
(3) determine the number of species automatically, and
(4) have acceptable performance even when the species number is much larger than 10.
Release Package Information
The version 4.0.6 release
package contains the C++ source code of "MetaCluster 4.0.6". Download from
here.
Runtime Environment
This software is suitable for all unix-like 64-bit system with gcc installed.
It is not necessary but highly recommanded to install openmp for muilt-cpu machine.
Compile
Code
1.Extract the package.
2.Use the command :
$ ./configure
$ make
3.After compiling, the tool 'MetaCluster4' is in the directory: ./bin
Command Format
The typical command format of MetaCluster 4 is:
MetaCluster4 input-file [options]
For the details of [options], please refer to the README file in the package.
Input & Output
The input of MetaCluster 4 should be in fasta format.
Every odd-number read and its next read are supposed to be pair-end reads.
The default format output file is FATSA file with binning-label, e.g.:
Cluster : 0
>read2_35/1
>read2_35/2
>read2_96/1
>read2_96/2
Cluster : 1
>read1_10/1
>read1_10/2
>read1_55/1
>read1_55/2
MetaCluster 3.0
MetaCluster 3.0 is an unsupervised binning method for
metagenomic sequences.Existing binning methods based on sequence similarity
and sequence composition markers rely heavily on the reference genomes of
known microorganisms and phylogenetic markers. While MetaCluster 3.0 is an
integrated binning method based on the unsupervised top-down separation and
bottom-up merging strategy. It can bin metagenomic sequencing datasets with
mixed complex species abundance ratios from the exactly equal situation to
the extremely unbalance situation with consistently higher accuracy when
compared with other recently reported methods.
Release Package Information
The version 3.0 release
package contains the C++ source code of "MetaCluster 3.0". Download from
here.
Runtime Environment
64-bit processor and 1G or
more memory is highly recommended for better performance or solving large
dataset. MetaCluster 3.0 is developed and tested under Ubuntu 10.04 64bit
version and Debian 6.0 64bit version and should be suitable for all Linux
environments.
Compile
Code
1.Decompress the package.
2.Use the command :
#make
So that
files will be compiled.
3.After compiling, the tools 'metaCluster-3.0' is
in the directory: /bin
Command Format
The typical command format of MetaCluster 3.0 is:
metacluster input-file
[--threshold]
input-file:
The input file you want to bin. This is the compulsory parameter.
[--threshold]:
The threshold for
the bottom-up merge step. The default value for t is 0.9.
Input & Output
The input of MetaCluster 3.0 should be a sequencing dataset, say test.fa, contains several DNA fragments with FASTA format, e.g.
>read0001
TACGCGGACTTCGATGCG
GCTAGGCTGACG
>read0002
GCCTGCGAAGCGCGATTT
AGCTGAGGCGTC
The default format output files only contain the FATSA comments, e.g.:
>read0001
>read0002
Samples
metaCluster-3.0
test.fa
metaCluster-3.0 test.fa --threshold 0.83
Copyright 2010-2014, Department of Computer Science, The University of Hong Kong. & Mr. WANG YI
MetaAnnotator beta version is released!
If you have any questions, please contact ywang@cs.hku.hk