Contents

library(microbiomeDataSets)
#> Loading required package: SummarizedExperiment
#> Loading required package: MatrixGenerics
#> Loading required package: matrixStats
#> 
#> Attaching package: 'MatrixGenerics'
#> The following objects are masked from 'package:matrixStats':
#> 
#>     colAlls, colAnyNAs, colAnys, colAvgsPerRowSet, colCollapse,
#>     colCounts, colCummaxs, colCummins, colCumprods, colCumsums,
#>     colDiffs, colIQRDiffs, colIQRs, colLogSumExps, colMadDiffs,
#>     colMads, colMaxs, colMeans2, colMedians, colMins, colOrderStats,
#>     colProds, colQuantiles, colRanges, colRanks, colSdDiffs, colSds,
#>     colSums2, colTabulates, colVarDiffs, colVars, colWeightedMads,
#>     colWeightedMeans, colWeightedMedians, colWeightedSds,
#>     colWeightedVars, rowAlls, rowAnyNAs, rowAnys, rowAvgsPerColSet,
#>     rowCollapse, rowCounts, rowCummaxs, rowCummins, rowCumprods,
#>     rowCumsums, rowDiffs, rowIQRDiffs, rowIQRs, rowLogSumExps,
#>     rowMadDiffs, rowMads, rowMaxs, rowMeans2, rowMedians, rowMins,
#>     rowOrderStats, rowProds, rowQuantiles, rowRanges, rowRanks,
#>     rowSdDiffs, rowSds, rowSums2, rowTabulates, rowVarDiffs, rowVars,
#>     rowWeightedMads, rowWeightedMeans, rowWeightedMedians,
#>     rowWeightedSds, rowWeightedVars
#> Loading required package: GenomicRanges
#> Loading required package: stats4
#> Loading required package: BiocGenerics
#> 
#> Attaching package: 'BiocGenerics'
#> The following objects are masked from 'package:stats':
#> 
#>     IQR, mad, sd, var, xtabs
#> The following objects are masked from 'package:base':
#> 
#>     Filter, Find, Map, Position, Reduce, anyDuplicated, aperm, append,
#>     as.data.frame, basename, cbind, colnames, dirname, do.call,
#>     duplicated, eval, evalq, get, grep, grepl, intersect, is.unsorted,
#>     lapply, mapply, match, mget, order, paste, pmax, pmax.int, pmin,
#>     pmin.int, rank, rbind, rownames, sapply, setdiff, sort, table,
#>     tapply, union, unique, unsplit, which.max, which.min
#> Loading required package: S4Vectors
#> 
#> Attaching package: 'S4Vectors'
#> The following object is masked from 'package:utils':
#> 
#>     findMatches
#> The following objects are masked from 'package:base':
#> 
#>     I, expand.grid, unname
#> Loading required package: IRanges
#> Loading required package: GenomeInfoDb
#> Loading required package: Biobase
#> Welcome to Bioconductor
#> 
#>     Vignettes contain introductory material; view with
#>     'browseVignettes()'. To cite Bioconductor, see
#>     'citation("Biobase")', and for packages 'citation("pkgname")'.
#> 
#> Attaching package: 'Biobase'
#> The following object is masked from 'package:MatrixGenerics':
#> 
#>     rowMedians
#> The following objects are masked from 'package:matrixStats':
#> 
#>     anyMissing, rowMedians
#> Loading required package: TreeSummarizedExperiment
#> Loading required package: SingleCellExperiment
#> Loading required package: Biostrings
#> Loading required package: XVector
#> 
#> Attaching package: 'Biostrings'
#> The following object is masked from 'package:base':
#> 
#>     strsplit
#> Loading required package: MultiAssayExperiment

1 Microbiome example data sets

The data sets are primarily named by the first author of the associated publication, together with a descriptive suffix. Aliases are provided for some of the data sets.

A table of the available data sets is available through the availableDataSets function.

availableDataSets()
#>             Dataset
#> 1  GrieneisenTSData
#> 2    HintikkaXOData
#> 3       LahtiMLData
#> 4        LahtiMData
#> 5       LahtiWAData
#> 6      OKeefeDSData
#> 7 SilvermanAGutData
#> 8        SongQAData
#> 9   SprockettTHData

All data are downloaded from ExperimentHub and cached for local re-use. Check the man pages of each function for a detailed documentation of the data contents and original source.

The microbiome data is usually loaded as a r Biocpkg("TreeSummarizedExperiment"). If other associated data tables (metabolomic, biomarker..) are provided, the integrated data collection is provided as MultiAssayExperiment.

For more information on how to use these objects, please refer to the vignettes of those packages.

sessionInfo()
#> R version 4.3.1 (2023-06-16)
#> Platform: x86_64-pc-linux-gnu (64-bit)
#> Running under: Ubuntu 22.04.3 LTS
#> 
#> Matrix products: default
#> BLAS:   /home/biocbuild/bbs-3.18-bioc/R/lib/libRblas.so 
#> LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.10.0
#> 
#> locale:
#>  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
#>  [3] LC_TIME=en_GB              LC_COLLATE=C              
#>  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
#>  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
#>  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
#> [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
#> 
#> time zone: America/New_York
#> tzcode source: system (glibc)
#> 
#> attached base packages:
#> [1] stats4    stats     graphics  grDevices utils     datasets  methods  
#> [8] base     
#> 
#> other attached packages:
#>  [1] microbiomeDataSets_1.10.0       MultiAssayExperiment_1.28.0    
#>  [3] TreeSummarizedExperiment_2.10.0 Biostrings_2.70.1              
#>  [5] XVector_0.42.0                  SingleCellExperiment_1.24.0    
#>  [7] SummarizedExperiment_1.32.0     Biobase_2.62.0                 
#>  [9] GenomicRanges_1.54.0            GenomeInfoDb_1.38.0            
#> [11] IRanges_2.36.0                  S4Vectors_0.40.0               
#> [13] BiocGenerics_0.48.0             MatrixGenerics_1.14.0          
#> [15] matrixStats_1.0.0               BiocStyle_2.30.0               
#> 
#> loaded via a namespace (and not attached):
#>  [1] tidyselect_1.2.0              dplyr_1.1.3                  
#>  [3] blob_1.2.4                    filelock_1.0.2               
#>  [5] bitops_1.0-7                  fastmap_1.1.1                
#>  [7] lazyeval_0.2.2                RCurl_1.98-1.12              
#>  [9] BiocFileCache_2.10.0          promises_1.2.1               
#> [11] digest_0.6.33                 mime_0.12                    
#> [13] lifecycle_1.0.3               ellipsis_0.3.2               
#> [15] KEGGREST_1.42.0               interactiveDisplayBase_1.40.0
#> [17] tidytree_0.4.5                RSQLite_2.3.1                
#> [19] magrittr_2.0.3                compiler_4.3.1               
#> [21] rlang_1.1.1                   sass_0.4.7                   
#> [23] tools_4.3.1                   utf8_1.2.4                   
#> [25] yaml_2.3.7                    knitr_1.44                   
#> [27] S4Arrays_1.2.0                bit_4.0.5                    
#> [29] curl_5.1.0                    DelayedArray_0.28.0          
#> [31] abind_1.4-5                   BiocParallel_1.36.0          
#> [33] purrr_1.0.2                   grid_4.3.1                   
#> [35] fansi_1.0.5                   ExperimentHub_2.10.0         
#> [37] xtable_1.8-4                  cli_3.6.1                    
#> [39] rmarkdown_2.25                crayon_1.5.2                 
#> [41] treeio_1.26.0                 generics_0.1.3               
#> [43] httr_1.4.7                    DBI_1.1.3                    
#> [45] ape_5.7-1                     cachem_1.0.8                 
#> [47] zlibbioc_1.48.0               parallel_4.3.1               
#> [49] AnnotationDbi_1.64.0          BiocManager_1.30.22          
#> [51] vctrs_0.6.4                   yulab.utils_0.1.0            
#> [53] Matrix_1.6-1.1                jsonlite_1.8.7               
#> [55] bookdown_0.36                 bit64_4.0.5                  
#> [57] jquerylib_0.1.4               tidyr_1.3.0                  
#> [59] glue_1.6.2                    codetools_0.2-19             
#> [61] later_1.3.1                   BiocVersion_3.18.0           
#> [63] tibble_3.2.1                  pillar_1.9.0                 
#> [65] rappdirs_0.3.3                htmltools_0.5.6.1            
#> [67] GenomeInfoDbData_1.2.11       R6_2.5.1                     
#> [69] dbplyr_2.3.4                  shiny_1.7.5.1                
#> [71] evaluate_0.22                 lattice_0.22-5               
#> [73] AnnotationHub_3.10.0          png_0.1-8                    
#> [75] memoise_2.0.1                 httpuv_1.6.12                
#> [77] bslib_0.5.1                   Rcpp_1.0.11                  
#> [79] SparseArray_1.2.0             nlme_3.1-163                 
#> [81] xfun_0.40                     fs_1.6.3                     
#> [83] pkgconfig_2.0.3