bodymapRat 1.4.0
The bodymapRat
package contains gene expression
data on 652 RNA-Seq samples from a comprehensive rat
transcriptomic BodyMap study. These samples include
the sequence identifier information provided in the
header of the FASTQ files which can be used as a
surrogate for batch. These samples have not been
normalized or pre-processed.
The data are provided in a SummarizedExperiment
. The
phenotypic information can be extracted using the
colData()
function and a description of the phenotypic
data is listed in the table below:
Title | Description |
---|---|
sraExperiment | SRA Experiment ID |
title | Title of sample provided by the authors |
geoAccession | GEO Accession ID |
BioSample | BioSample ID |
avgLength | Average read length |
instrument | Machine identifier (from FASTQ header) |
runID | Run ID (from FASTQ header) |
fcID | Flow cell ID (from FASTQ header) |
fcLane | Flow cell lane (from FASTQ header) |
tile | Tile (from FASTQ header) |
xtile | xtile (from FASTQ header) |
ytile | ytile (from FASTQ header) |
organ | Body organ |
sex | Gender |
stage | Stage |
techRep | Technical replicate number |
colOrgan | Column of colors to help with plotting |
rnaRIN | RIN number |
barcode | barcode number |
The data can be accessed as follows:
library(SummarizedExperiment)
library(bodymapRat)
We use the bodymapRat()
function to download the
relevant files from Bioconductor’s
ExperimentHub web
resource. Running this function will download a
SummarizedExperiment
object, which contains read counts, as
well as the metadata on the rows (genes) and columns (cells).
bm_rat <- bodymapRat()
# Get the expression data
counts = assay(bm_rat)
dim(counts)
## [1] 32637 652
counts[1:5, 1:5]
## SRR1169893 SRR1169894 SRR1169895 SRR1169896 SRR1169897
## ENSRNOG00000000001 1 0 0 1 4
## ENSRNOG00000000007 1 1 0 3 0
## ENSRNOG00000000008 7 4 2 3 7
## ENSRNOG00000000009 0 0 0 0 1
## ENSRNOG00000000010 0 1 0 0 0
# Get the meta data along columns
head(colData(bm_rat))
## DataFrame with 6 rows and 22 columns
## sraExperiment sraRun title geoAccession sraSample
## <character> <character> <character> <factor> <factor>
## SRR1169893 SRX471368 SRR1169893 Adr_F_002_1 GSM1328469 SRS558114
## SRR1169894 SRX471368 SRR1169894 Adr_F_002_1 GSM1328469 SRS558114
## SRR1169895 SRX471369 SRR1169895 Adr_F_002_2 GSM1328470 SRS558115
## SRR1169896 SRX471369 SRR1169896 Adr_F_002_2 GSM1328470 SRS558115
## SRR1169897 SRX471370 SRR1169897 Adr_F_002_3 GSM1328471 SRS558116
## SRR1169898 SRX471370 SRR1169898 Adr_F_002_3 GSM1328471 SRS558116
## BioSample avgLength organ sex stage techRep
## <factor> <integer> <character> <character> <numeric> <integer>
## SRR1169893 SAMN02642886 50 Adrenal F 2 1
## SRR1169894 SAMN02642886 50 Adrenal F 2 2
## SRR1169895 SAMN02642867 50 Adrenal F 2 1
## SRR1169896 SAMN02642867 50 Adrenal F 2 2
## SRR1169897 SAMN02642894 50 Adrenal F 2 1
## SRR1169898 SAMN02642894 50 Adrenal F 2 2
## colOrgan mix rnaRIN barcode instrument
## <character> <character> <character> <character> <character>
## SRR1169893 brown M1 9.3 11 HWI-ST845
## SRR1169894 brown M1 9.3 11 HWI-ST845
## SRR1169895 brown M1 9.1 5 HWI-ST845
## SRR1169896 brown M1 9.1 5 HWI-ST845
## SRR1169897 brown M1 9.5 3 HWI-ST1131
## SRR1169898 brown M1 9.5 3 HWI-ST1195
## runID fcID fcLane tile xtile
## <character> <character> <character> <character> <character>
## SRR1169893 120326 D0VTJACXX 2 1101 1506
## SRR1169894 120525 D10G7ACXX 2 1101 1394
## SRR1169895 120326 D0VTJACXX 5 1101 1170
## SRR1169896 120525 D10G7ACXX 5 1101 1650
## SRR1169897 120424 C0P4UACXX 4 1101 1675
## SRR1169898 120525 C0TDUACXX 4 1101 1138
## ytile
## <character>
## SRR1169893 2000
## SRR1169894 2133
## SRR1169895 2029
## SRR1169896 2126
## SRR1169897 2216
## SRR1169898 2067
The data in this package are used as an example
data set in the
qsmooth
Bioconductor package.
sessionInfo()
## R version 4.0.0 (2020-04-24)
## Platform: x86_64-pc-linux-gnu (64-bit)
## Running under: Ubuntu 18.04.4 LTS
##
## Matrix products: default
## BLAS: /home/biocbuild/bbs-3.11-bioc/R/lib/libRblas.so
## LAPACK: /home/biocbuild/bbs-3.11-bioc/R/lib/libRlapack.so
##
## locale:
## [1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
## [3] LC_TIME=en_US.UTF-8 LC_COLLATE=C
## [5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
## [7] LC_PAPER=en_US.UTF-8 LC_NAME=C
## [9] LC_ADDRESS=C LC_TELEPHONE=C
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
##
## attached base packages:
## [1] parallel stats4 stats graphics grDevices utils datasets
## [8] methods base
##
## other attached packages:
## [1] bodymapRat_1.4.0 ExperimentHub_1.14.0
## [3] AnnotationHub_2.20.0 BiocFileCache_1.12.0
## [5] dbplyr_1.4.3 SummarizedExperiment_1.18.1
## [7] DelayedArray_0.14.0 matrixStats_0.56.0
## [9] Biobase_2.48.0 GenomicRanges_1.40.0
## [11] GenomeInfoDb_1.24.0 IRanges_2.22.1
## [13] S4Vectors_0.26.0 BiocGenerics_0.34.0
## [15] knitr_1.28 BiocStyle_2.16.0
##
## loaded via a namespace (and not attached):
## [1] Rcpp_1.0.4.6 lattice_0.20-41
## [3] assertthat_0.2.1 digest_0.6.25
## [5] mime_0.9 R6_2.4.1
## [7] RSQLite_2.2.0 evaluate_0.14
## [9] httr_1.4.1 pillar_1.4.4
## [11] zlibbioc_1.34.0 rlang_0.4.6
## [13] curl_4.3 blob_1.2.1
## [15] Matrix_1.2-18 rmarkdown_2.1
## [17] stringr_1.4.0 RCurl_1.98-1.2
## [19] bit_1.1-15.2 shiny_1.4.0.2
## [21] compiler_4.0.0 httpuv_1.5.2
## [23] xfun_0.13 pkgconfig_2.0.3
## [25] htmltools_0.4.0 tidyselect_1.0.0
## [27] tibble_3.0.1 GenomeInfoDbData_1.2.3
## [29] interactiveDisplayBase_1.26.0 bookdown_0.18
## [31] crayon_1.3.4 dplyr_0.8.5
## [33] later_1.0.0 bitops_1.0-6
## [35] rappdirs_0.3.1 grid_4.0.0
## [37] xtable_1.8-4 lifecycle_0.2.0
## [39] DBI_1.1.0 magrittr_1.5
## [41] stringi_1.4.6 XVector_0.28.0
## [43] promises_1.1.0 ellipsis_0.3.0
## [45] vctrs_0.2.4 tools_4.0.0
## [47] bit64_0.9-7 glue_1.4.0
## [49] purrr_0.3.4 BiocVersion_3.11.1
## [51] fastmap_1.0.1 yaml_2.2.1
## [53] AnnotationDbi_1.50.0 BiocManager_1.30.10
## [55] memoise_1.1.0