Last updated: 2020-04-26
Checks: 7 0
Knit directory: Bgee/
This reproducible R Markdown analysis was created with workflowr (version 1.6.1). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.
Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.
Great job! The global environment was empty. Objects defined in the global environment can affect the analysis in your R Markdown file in unknown ways. For reproduciblity it’s best to always run the code in an empty environment.
The command set.seed(20200417)
was run prior to running the code in the R Markdown file. Setting a seed ensures that any results that rely on randomness, e.g. subsampling or permutations, are reproducible.
Great job! Recording the operating system, R version, and package versions is critical for reproducibility.
Nice! There were no cached chunks for this analysis, so you can be confident that you successfully produced the results during this run.
Great job! Using relative paths to the files within your workflowr project makes it easier to run your code on other machines.
Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility.
The results in this page were generated with repository version c287d01. See the Past versions tab to see a history of the changes made to the R Markdown and HTML files.
Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish
or wflow_git_commit
). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:
Ignored files:
Ignored: .Rhistory
Untracked files:
Untracked: analysis/.here
Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.
These are the previous versions of the repository in which changes were made to the R Markdown (analysis/downloaddata.Rmd
) and HTML (docs/downloaddata.html
) files. If you’ve configured a remote Git repository (see ?wflow_git_remote
), click on the hyperlinks in the table below to view the files as they were in that past version.
File | Version | Author | Date | Message |
---|---|---|---|---|
Rmd | c287d01 | SFonsecaCosta | 2020-04-26 | Update |
html | ae29961 | SFonsecaCosta | 2020-04-22 | Build site. |
Rmd | 9907294 | SFonsecaCosta | 2020-04-22 | clean text |
html | 8d9e88f | SFonsecaCosta | 2020-04-20 | Build site. |
Rmd | f18beb5 | SFonsecaCosta | 2020-04-20 | Minor things |
html | 3118c12 | SFonsecaCosta | 2020-04-20 | Build site. |
Rmd | 4b9ee87 | SFonsecaCosta | 2020-04-20 | cache=FALSE |
html | 8d821e2 | SFonsecaCosta | 2020-04-20 | Build site. |
Rmd | 9073f83 | SFonsecaCosta | 2020-04-20 | add analysis |
If you do not want to use filters, as showed in the previous section (Extract Information) where you target a particular experiment to download, you can always download all the data of reference species using the getData()
function. Note that in the getData() you should specify which data type you would like to retrieve from Bgee by specifying that in the argument dataType
.
DrosMelRNASeq <- Bgee$new(species = "Drosophila_melanogaster", dataType = "rna_seq")
Querying Bgee to get release information...
Building URL to query species in Bgee release 14_1...
Submitting URL to Bgee webservice... (https://r.bgee.org/bgee14_1/?page=r_package&action=get_all_species&display_type=tsv&source=BgeeDB_R_package&source_version=2.12.1)
Query to Bgee webservice successful!
API key built: 342617eeb6e728567eeaf07855efc4fa274e6ad21dde329ecce7bd3668c4efa9f23c364808e45b7b772594a592dfe6855ff75027e7983a848ff43a25416ebe6f
dataBgee_DM <- getData(DrosMelRNASeq)
The experiment is not defined. Hence taking all rna_seq experiments available for Drosophila_melanogaster.
Downloading expression data...
Saved expression data file in/Users/sfonseca1/Bgee/Drosophila_melanogaster_Bgee_14_1 folder. Now untar file...
Finished uncompress tar files
Saving all data in .rds file...
## Number of experiments that exist in BgeeDB
length(dataBgee_DM)
[1] 11
In case you are interested in downloading a particular experiment (as showed before), anatomical entity or developmental stage, you can just download this data by specifying that in the respective arguments: experimentId
, anatEntityId
and stageId
.
Taking your experimentId result from the filtering in previously section, download just this respective experiment by specifying that in the argument experimentId
in the getData()
function.
DrosMelRNASeq_SRP002072 <- getData(DrosMelRNASeq, experimentId = "SRP002072")
Downloading expression data for the experiment SRP002072...
Saved expression data file in /Users/sfonseca1/Bgee/Drosophila_melanogaster_Bgee_14_1 folder. Now untar file...
Finished uncompress tar files
Saving all data in .rds file...
head(DrosMelRNASeq_SRP002072)
Experiment.ID Library.ID Library.type Gene.ID Anatomical.entity.ID
1 SRP002072 SRX019645 paired FBgn0000003 UBERON:0000922
2 SRP002072 SRX019645 paired FBgn0000008 UBERON:0000922
3 SRP002072 SRX019645 paired FBgn0000014 UBERON:0000922
4 SRP002072 SRX019645 paired FBgn0000015 UBERON:0000922
5 SRP002072 SRX019645 paired FBgn0000017 UBERON:0000922
6 SRP002072 SRX019645 paired FBgn0000018 UBERON:0000922
Anatomical.entity.name Stage.ID Stage.name Sex
1 embryo FBdv:00005306 embryonic stage 4 (Drosophila) <NA>
2 embryo FBdv:00005306 embryonic stage 4 (Drosophila) <NA>
3 embryo FBdv:00005306 embryonic stage 4 (Drosophila) <NA>
4 embryo FBdv:00005306 embryonic stage 4 (Drosophila) <NA>
5 embryo FBdv:00005306 embryonic stage 4 (Drosophila) <NA>
6 embryo FBdv:00005306 embryonic stage 4 (Drosophila) <NA>
Strain Read.count TPM FPKM Detection.flag Detection.quality
1 NA 5071.0000 7931.226169 10518.520218 present high quality
2 NA 172.0002 5.951876 7.893474 present high quality
3 NA 16.0000 0.604627 0.801867 present high quality
4 NA 19.0000 0.793388 1.052205 present high quality
5 NA 841.1847 22.824416 30.270109 present high quality
6 NA 94.0000 9.765199 12.950765 present high quality
State.in.Bgee
1 Part of a call
2 Part of a call
3 Part of a call
4 Part of a call
5 Part of a call
6 Part of a call
Note that in the next sub-sections the arguments anatEntityId
and stageId
in the getData() function are implemented just in BgeeDB package in the Bioconductor release 3.11.
You are also able to retrieve data from the database by specifying the anatomic entity of interest for this particular experiment. Here we will target adult organism as an example.
DrosMelRNASeq_annEnt <- getData(DrosMelRNASeq, experimentId = "SRP002072", anatEntityId = "UBERON:0007023")
head(DrosMelRNASeq_annEnt)
Note you can specify more than one anatomical entity in the anatEntityId
argument.
DrosMelRNASeq_annEnt_plus <- getData(DrosMelRNASeq, experimentId = "SRP002072", anatEntityId = c("UBERON:0007023","UBERON:0000922","UBERON:0002548"))
head(DrosMelRNASeq_annEnt_plus)
As demonstrated before, for experiment and anatomical entities, you also can download data by specifying your target developmental stage of interest in the stageId
argument.
DrosMelRNASeq_SRP002072_stg <- getData(DrosMelRNASeq, stageId = c("UBERON:0000068","FBdv:00005341"))
head(DrosMelRNASeq_SRP002072_stg)
sessionInfo()
R version 3.6.0 (2019-04-26)
Platform: x86_64-apple-darwin15.6.0 (64-bit)
Running under: macOS High Sierra 10.13.4
Matrix products: default
BLAS: /Library/Frameworks/R.framework/Versions/3.6/Resources/lib/libRblas.0.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/3.6/Resources/lib/libRlapack.dylib
locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
attached base packages:
[1] stats4 parallel stats graphics grDevices utils datasets
[8] methods base
other attached packages:
[1] BgeeDB_2.12.1 tidyr_1.0.2 topGO_2.38.1
[4] SparseM_1.78 GO.db_3.10.0 AnnotationDbi_1.48.0
[7] IRanges_2.20.2 S4Vectors_0.24.4 Biobase_2.46.0
[10] graph_1.64.0 BiocGenerics_0.32.0 workflowr_1.6.1
loaded via a namespace (and not attached):
[1] tidyselect_1.0.0 xfun_0.13 purrr_0.3.3 lattice_0.20-41
[5] vctrs_0.2.4 htmltools_0.4.0 yaml_2.2.1 blob_1.2.1
[9] rlang_0.4.5 later_1.0.0 pillar_1.4.3 glue_1.4.0
[13] DBI_1.1.0 bit64_0.9-7 matrixStats_0.56.0 lifecycle_0.2.0
[17] stringr_1.4.0 memoise_1.1.0 evaluate_0.14 knitr_1.28
[21] httpuv_1.5.2 fansi_0.4.1 Rcpp_1.0.4.6 promises_1.1.0
[25] backports_1.1.6 fs_1.4.1 bit_1.1-15.2 digest_0.6.25
[29] stringi_1.4.6 dplyr_0.8.5 rprojroot_1.3-2 grid_3.6.0
[33] cli_2.0.2 tools_3.6.0 bitops_1.0-6 magrittr_1.5
[37] RCurl_1.98-1.1 RSQLite_2.2.0 tibble_3.0.0 crayon_1.3.4
[41] whisker_0.4 pkgconfig_2.0.3 ellipsis_0.3.0 data.table_1.12.8
[45] assertthat_0.2.1 rmarkdown_2.1 R6_2.4.1 git2r_0.26.1
[49] compiler_3.6.0