bodymapRat 1.2.0
The bodymapRat
package contains gene expression
data on 652 RNA-Seq samples from a comprehensive rat
transcriptomic BodyMap study. These samples include
the sequence identifier information provided in the
header of the FASTQ files which can be used as a
surrogate for batch. These samples have not been
normalized or pre-processed.
The data are provided in a SummarizedExperiment
. The
phenotypic information can be extracted using the
colData()
function and a description of the phenotypic
data is listed in the table below:
Title | Description |
---|---|
sraExperiment | SRA Experiment ID |
title | Title of sample provided by the authors |
geoAccession | GEO Accession ID |
BioSample | BioSample ID |
avgLength | Average read length |
instrument | Machine identifier (from FASTQ header) |
runID | Run ID (from FASTQ header) |
fcID | Flow cell ID (from FASTQ header) |
fcLane | Flow cell lane (from FASTQ header) |
tile | Tile (from FASTQ header) |
xtile | xtile (from FASTQ header) |
ytile | ytile (from FASTQ header) |
organ | Body organ |
sex | Gender |
stage | Stage |
techRep | Technical replicate number |
colOrgan | Column of colors to help with plotting |
rnaRIN | RIN number |
barcode | barcode number |
The data can be accessed as follows:
library(SummarizedExperiment)
library(bodymapRat)
We use the bodymapRat()
function to download the
relevant files from Bioconductor’s
ExperimentHub web
resource. Running this function will download a
SummarizedExperiment
object, which contains read counts, as
well as the metadata on the rows (genes) and columns (cells).
bm_rat <- bodymapRat()
# Get the expression data
counts = assay(bm_rat)
dim(counts)
## [1] 32637 652
counts[1:5, 1:5]
## SRR1169893 SRR1169894 SRR1169895 SRR1169896 SRR1169897
## ENSRNOG00000000001 1 0 0 1 4
## ENSRNOG00000000007 1 1 0 3 0
## ENSRNOG00000000008 7 4 2 3 7
## ENSRNOG00000000009 0 0 0 0 1
## ENSRNOG00000000010 0 1 0 0 0
# Get the meta data along columns
head(colData(bm_rat))
## DataFrame with 6 rows and 22 columns
## sraExperiment sraRun title geoAccession sraSample
## <character> <character> <character> <factor> <factor>
## SRR1169893 SRX471368 SRR1169893 Adr_F_002_1 GSM1328469 SRS558114
## SRR1169894 SRX471368 SRR1169894 Adr_F_002_1 GSM1328469 SRS558114
## SRR1169895 SRX471369 SRR1169895 Adr_F_002_2 GSM1328470 SRS558115
## SRR1169896 SRX471369 SRR1169896 Adr_F_002_2 GSM1328470 SRS558115
## SRR1169897 SRX471370 SRR1169897 Adr_F_002_3 GSM1328471 SRS558116
## SRR1169898 SRX471370 SRR1169898 Adr_F_002_3 GSM1328471 SRS558116
## BioSample avgLength organ sex stage
## <factor> <integer> <character> <character> <numeric>
## SRR1169893 SAMN02642886 50 Adrenal F 2
## SRR1169894 SAMN02642886 50 Adrenal F 2
## SRR1169895 SAMN02642867 50 Adrenal F 2
## SRR1169896 SAMN02642867 50 Adrenal F 2
## SRR1169897 SAMN02642894 50 Adrenal F 2
## SRR1169898 SAMN02642894 50 Adrenal F 2
## techRep colOrgan mix rnaRIN barcode
## <integer> <character> <character> <character> <character>
## SRR1169893 1 brown M1 9.3 11
## SRR1169894 2 brown M1 9.3 11
## SRR1169895 1 brown M1 9.1 5
## SRR1169896 2 brown M1 9.1 5
## SRR1169897 1 brown M1 9.5 3
## SRR1169898 2 brown M1 9.5 3
## instrument runID fcID fcLane tile
## <character> <character> <character> <character> <character>
## SRR1169893 HWI-ST845 120326 D0VTJACXX 2 1101
## SRR1169894 HWI-ST845 120525 D10G7ACXX 2 1101
## SRR1169895 HWI-ST845 120326 D0VTJACXX 5 1101
## SRR1169896 HWI-ST845 120525 D10G7ACXX 5 1101
## SRR1169897 HWI-ST1131 120424 C0P4UACXX 4 1101
## SRR1169898 HWI-ST1195 120525 C0TDUACXX 4 1101
## xtile ytile
## <character> <character>
## SRR1169893 1506 2000
## SRR1169894 1394 2133
## SRR1169895 1170 2029
## SRR1169896 1650 2126
## SRR1169897 1675 2216
## SRR1169898 1138 2067
The data in this package are used as an example
data set in the
qsmooth
Bioconductor package.
sessionInfo()
## R version 3.6.1 (2019-07-05)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 18.04.3 LTS
##
## Matrix products: default
## BLAS: /home/biocbuild/bbs-3.10-bioc/R/lib/libRblas.so
## LAPACK: /home/biocbuild/bbs-3.10-bioc/R/lib/libRlapack.so
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## attached base packages:
## [1] parallel stats4 stats graphics grDevices utils datasets
## [8] methods base
##
## other attached packages:
## [1] bodymapRat_1.2.0 ExperimentHub_1.12.0
## [3] AnnotationHub_2.18.0 BiocFileCache_1.10.0
## [5] dbplyr_1.4.2 SummarizedExperiment_1.16.0
## [7] DelayedArray_0.12.0 BiocParallel_1.20.0
## [9] matrixStats_0.55.0 Biobase_2.46.0
## [11] GenomicRanges_1.38.0 GenomeInfoDb_1.22.0
## [13] IRanges_2.20.0 S4Vectors_0.24.0
## [15] BiocGenerics_0.32.0 knitr_1.25
## [17] BiocStyle_2.14.0
##
## loaded via a namespace (and not attached):
## [1] Rcpp_1.0.2 lattice_0.20-38
## [3] assertthat_0.2.1 zeallot_0.1.0
## [5] digest_0.6.22 mime_0.7
## [7] R6_2.4.0 backports_1.1.5
## [9] RSQLite_2.1.2 evaluate_0.14
## [11] httr_1.4.1 pillar_1.4.2
## [13] zlibbioc_1.32.0 rlang_0.4.1
## [15] curl_4.2 blob_1.2.0
## [17] Matrix_1.2-17 rmarkdown_1.16
## [19] stringr_1.4.0 RCurl_1.95-4.12
## [21] bit_1.1-14 shiny_1.4.0
## [23] compiler_3.6.1 httpuv_1.5.2
## [25] xfun_0.10 pkgconfig_2.0.3
## [27] htmltools_0.4.0 tidyselect_0.2.5
## [29] tibble_2.1.3 GenomeInfoDbData_1.2.2
## [31] interactiveDisplayBase_1.24.0 bookdown_0.14
## [33] crayon_1.3.4 dplyr_0.8.3
## [35] later_1.0.0 bitops_1.0-6
## [37] rappdirs_0.3.1 grid_3.6.1
## [39] xtable_1.8-4 DBI_1.0.0
## [41] magrittr_1.5 stringi_1.4.3
## [43] XVector_0.26.0 promises_1.1.0
## [45] vctrs_0.2.0 tools_3.6.1
## [47] bit64_0.9-7 glue_1.3.1
## [49] purrr_0.3.3 BiocVersion_3.10.1
## [51] fastmap_1.0.1 yaml_2.2.0
## [53] AnnotationDbi_1.48.0 BiocManager_1.30.9
## [55] memoise_1.1.0