phyloseq: A Bioconductor Package for Handling and Analysis of High-Throughput Phylogenetic Sequence Data

Paul J. McMurdie and Susan Holmes

Statistics Department, Stanford University, Stanford, CA 94305, USA

Pacific Symposium on Biocomputing 17:235-246(2012)


We present a detailed description of a new Bioconductor package, phyloseq, for integrated data and analysis of taxonomically-clustered phylogenetic sequencing data in conjunction with related data types. The phyloseq package integrates abundance data, phylogenetic information and covariates so that exploratory transformations, plots, and confirmatory testing and diagnostic plots can be carried out seamlessly. The package is built following the S4 object-oriented framework of the R language so that once the data have been input the user can easily transform, plot and analyze the data. We present some examples that highlight the methods and the ease with which we can leverage existing packages.

