You’re going to be looking at the count matrix from a droplet-based scRNA-seq of human cells.
WT-1.dge.txt.gz
) and read it into R.Note: given the size of the matrix, you may have to do the analysis on the server, where you will not have access to RStudio, but to the console (just type
R
after logging in and switching to your designated folder). To obtain images for your Rmarkdown report, you could, for example, opt to write and compile the Rmd on your laptop. This would mean that you should set the chunk options toeval=FALSE
because you’re not going to actually execute the code on your machine. To include images this way, you would have to compute them on the server, download them viascp
and integrate them into the report via the common markdown syntax:
. For more info on code chunks, see here. ALTERNATIVELY, without changing the chunk options, you could compile the html on the server wherermarkdown::render("input.Rmd")
(within R) will carry out the rendering that happens when you click “Knit” in RStudio.
SingleCellExperiment
object. (This resource is useful in getting to know the capabilities of the bioconductor objects related to scRNA-seq). (1pt)SCE
object. (0.5pt)Note: You may find that the histograms are not as informative as you had hoped. You may find the combination of violin plots (
geom_violin()
) and beeswarm plots (ggbeeswarm::geom_quasirandom(alpha = 0.5)
) more helpful. More details on beeswarm plots can be found here.
scran
and scater
and judge whether the size factors calculated by computeSumFactors
show the expected behavior as shown in Figure 6 of the simpleSingleCell workflow. (1pt)
Seurat can be installed via the usual install.packages
routine.
Seurat::CreateSeuratObject
) (1pt)NormalizeData
). (.5pt)
Hint: It’s a fairly homogeneous population, i.e. all cells would probably be called the same cell name where cell name would be something like “skin cell”. Explain your reasoning!
The point is for your reasoning, there will be an extra-credit point if you identify the cell type correctly.