FastGroupII: A Web-based Bioinformatics Platform for Analyses of Large 16S rDNA Libraries

Yanan Yu, Mya Breitbart, Pat McNairnie, Forest Rohwer

Research output: Contribution to journalArticlepeer-review

Abstract

Background: High-throughput sequencing makes it possible to rapidly obtain thousands of 16S rDNA sequences from environmental samples. Bioinformatic tools for the analyses of large 16S rDNA sequence databases are needed to comprehensively describe and compare these datasets.

Results: FastGroupII is a web-based bioinformatics platform to dereplicate large 16S rDNA libraries. FastGroupII provides users with the option of four different dereplication methods, performs rarefaction analysis, and automatically calculates the Shannon-Wiener Index and Chao1. FastGroupII was tested on a set of 16S rDNA sequences from coral-associated Bacteria . The different grouping algorithms produced similar, but not identical, results. This suggests that 16S rDNA datasets need to be analyzed in multiple ways when being used for community ecology studies.

Conclusion: FastGroupII is an effective bioinformatics tool for the trimming and dereplication of 16S rDNA sequences. Several standard diversity indices are calculated, and the raw sequences are prepared for downstream analyses.

Original languageAmerican English
JournalBMC Bioinformatics
Volume7
DOIs
StatePublished - Jan 1 2006

Keywords

  • Test Dataset
  • Query Sequence
  • Representative Sequence
  • Percentage Sequence Identity
  • Single Base Insertion

Disciplines

  • Life Sciences

Cite this