Software Community: Data Analysis Resources for Scientists
& Developers
Are you a life scientist interested in applications to analyze and manage the vast amounts of data generated by the SOLiD™ System? Are you an independent software vendor or academic researcher interested in developing software for next-generation sequencing? In this site, you will find resources to help, including:
- Academic and open-source software
- Commercial software
- Sample data sets
- Technical documentation
- Data conversion tools
- Become a member of the SOLiD™ Software Development Community and receive regular updates by registering here.
Data Analysis Software, Data Sets, and More
Application-Specific Software
Chromatin Immunoprecipitation Sequencing (ChIP-Seq)
- ChIP-Seq File Formating Tool
Enables you to convert result files generated by aligning SOLiD reads to reference sequences using SOLiD software into a format compatible with most available ChIP-seq analysis tools. Available as source code.
- MACS
Enables you to convert result files generated by aligning SOLiD reads to reference sequences using SOLiD software into a format compatible with most available ChIP-seq analysis tools. Available as source code.
- QuEST
A Kernel Density Estimator-based package for analyzing massively parallel sequencing data from chromatin immunoprecipitations (ChIP-Seq or ChIPseq).
Gene Expression
- SOLiD™ SAGE™ Analysis Software
A tool for taking the raw sequence data files from SOLiD™ SAGE™ reads and matching them to known sequences in your reference database of choice.
De Novo Sequencing
- SOLiD™ Accuracy Enhancement Tool (SAET)
A tool that uses raw data generated by the SOLiD™ system to correct miscalls within reads either prior to mapping or contig assembly.
- Velvet
De novo of short reads (EMBL-EBI)
- ADIR
ADiR is an assembler for AB SOLiD™ colorspace reads that makes use of color transition properties in resolving overlap and consensus.
Resequencing
- SOLiD™ System Analysis Pipeline Tool
Allows you to map color space reads to large or small genomes, to place and annotate paired reads, and to call SNPs. Available as source code.
- AB Small InDel Tool
This stand-alone tool enables you to find small indels in SOLiD™ data.
- AB Inversion Tool
This stand-alone tool enables you to detect inversions in SOLiD™ data.
- AB Large InDel Tool
This stand-alone tool enables you to find large indels in SOLiD™ data.
- AB CNV Tool
This tool is designed to detect copy number variation using SOLiD™ data from a single human sample mapped to the human reference sequence.
- AB SNP Finder Tool (diBayes)
The diBayes package is the tool to identify SNPs from mapped and processed SOLiD™ System color space reads.
- CNV-seq
CNV-seq is a new method to detect DNA copy number variation (CNV) using high-throughput sequencing. This method was published in BMC Bioinformatics 2009, 10:80.
Small RNA Analysis
- Small RNA Analysis Pipeline Tool
A small RNA SOLiD™ System Analysis Pipeline for human whole genome alignment of small RNA sequencing data with tag counting features. Available as source code.
Whole Transcriptome Analysis
- AB WT Analysis Pipeline
Allows you to align transcriptome reads to a reference genome, tag counting for exons and genes, and output data in base space.
- RNA-MATE
A computational pipeline which provides automatic and integrated way to align color- space sequencing data, collate this information and generate files for examining gene-expression data in a genomic context.
General Software (for Multiple Applications)
Data Analysis Tools
- Galaxy
Galaxy allows you to analyze multiple alignments, compare genomic annotations, profile metagenomic samples, and much much more—without the need to install or download anything. For developers, it is an open-source, scalable framework for tool and data integration.
Data Conversion Tools
- General Feature Format (GFF) Conversion Tool
Enables you to consolidate reference-aligned SOLiD™ data into a standard format (General Feature Format). Available as source code.
- Sequence Read Format Conversion Tool
Enables you to convert native SOLiD™ data into a community-driven standard (Short Read Format) for NCBI submission. Available as source code.
- SOLiD™ BaseQV Tool
Tool for the conversion of SOLiD™ output files to base sequences data with associated quality values.
Genome Browsers
- SOLiD™ Alignment Browser
A genome annotation viewer and editor which is based on the Apollo Genome Annotation Curation Tool.
- UCSC Genome Broswer
The Genome Browser zooms and scrolls over chromosomes, showing the work of annotators worldwide.
Mapping Tools
- Color Space Mapping Tool
Enables you to map SOLiD™ color space reads to whole human genomes. Available as source code.
- SHRiMP
The SHort Read Mapping Package.
- BFAST
Enables you to map short reads to reference sequences quickly and accurately.
- SOCS
This software uses iterative mismatch tolerances to speed up high tolerance mapping. It finds optimal alignments for each read across one or more reference files. It also provides maps of sequence census and isolated mismatches.
NOTICE TO USER | TERMS AND CONDITIONS | DISCLAIMERS
The Applied Biosystems resources provided here are all unsupported software development tools. They are not validated products and are provided "as is" and without warranty. Without limiting the foregoing, your use of such resources is subject to any applicable license or terms of use.
Links to third-party Web sites and any resources or products that they may provide are intended for convenience only and do not imply endorsement or approval by Applied Biosystems and we neither support nor make any warranty for such resources. Applied Biosystems is not responsible for their contents. All queries and end-user support requests regarding third-party resources and products should be addressed to the relevant third party. Applied Biosystems is not responsible for providing any end-user support.
The use of all the resources provided herein and any associated materials is at user's own risk, without recourse to Applied Biosystems. Your use of any resources provided herein, including the software tools, is conditioned on your acceptance of the above terms, including the applicable license terms, if any, and your use of any of the resources herein indicate your understanding and acceptance of these terms.





