= Satellite meeting for Sequence analysis tools from the next generation sequencer = == Topics == To share our knowledges about sequence analysis tools. Please write what you know about the tools. Also see [http://www.oxfordjournals.org/our_journals/bioinformatics/nextgenerationsequencing.html Bioinformatics for Next Generation Sequencing virtual issue] -- yaskaz@ddbj (thanks to ichan) and [http://www.nature.com/nbt/journal/v26/n10/fig_tab/nbt1486_T3.html Table 3 of Jay Shendure & Hanlee Ji., Next-generation DNA sequencing, Nature Biotech.] (thanks to hinaichigo) == Mapping tools == Maq[[BR]] ELAND[[BR]] [http://bowtie-bio.sourceforge.net/index.shtml BOWTIE] Ultra-short reads, Burrows-Wheeler index. [[BR]] Lasergene[[BR]] [http://www.454.com/products-solutions/analysis-tools/gs-reference-mapper.asp GSMapper] * Map reads to any reference genome and generate a consensus sequence. * Easily view all differences compared to the reference sequence with automatic output to separate files: Insertions (blocks up to 50 bases), Deletions (blocks up to 50 bases), SNPs * Quickly identify high confidence difference compared to the reference genome, which are singled out in a separate file. * Compare large, complex genomes of any size including: resequencing of whole genomes from humans, plants, yeasts, bacteria, fungi, viruses, YACs, BACs, fosmids * Data outputs: fna.file (sequence of contigs, FASTA format), qual.file (corresponding Phred equivalent quality score), ace.file (consensus alignment of the reads against a given reference sequence) [[BR]] [http://compbio.cs.toronto.edu/shrimp/ SHRiMP] supports letter-space (454/Solexa), color-space (Solid) and Helicos 2-pass space. Spaced-seeds followed by S-W. [[BR]] [http://bx.psu.edu/miller_lab/dist/README.lastz-1.01.50/README.lastz-1.01.50.html Lastz] Blast(z) variant. Has parameter settings tuned for 454 and Solexa mapping for both reference based assembly and variant discovery. [[BR]] EMBOSS == Assemble tools == [http://www.454.com/products-solutions/analysis-tools/gs-de-novo-assembler.asp GSAssembly] * Performs whole genome shotgun assembly of genomes with or without paired-end data. * Order contigs into scaffolds using supported paired-end reads. * Assemble larger, more complex genomes up to 400 megabases in size with a 64-bit assembler or up to 20 megabases with a 32-bit assembler. * Co-assemble with Sanger Sequencing reads. * Select the read files you want to assemble by browsing in the GUI. * Data outputs: fna.file (sequence of contigs, FASTA format), qual.file (corresponding Phred equivalent quality score), ace.file (alignment of the reads to contig sequence) [[BR]] Velvet[[BR]] EMBOSS == Date == == Room == == Presentations == == Notes == Every new software must handle this "huge" amount of data. Some veriations SNP, deletion, insertion can by identified during mapping/assembly process, other things more "new biology" needs more work by bionformaticians. Sopport from biologists is required in order to discover more new things. == Results == == TODOs ==