Sam’s Notebook: Data Received – C.bairdi RNAseq Day9-12-26 Infected-Uninfected

Previously, we “received” this data, but it turns out it was incomplete (see 20191003).

Today, we finally received all the RNAseq data (>50M reads per samples) back from NWGC that we submitted on 20190521!

The second round of data is in addition to the data we received on 20191003. So, to simplify some of the data management and downstream processing of these files, I decided to concatenate the two sets of file. Concatenation is documented in this Jupyter Notebook (GitHub):

Here’s a table with the library names and the FastQ naming schemes.

NWGC Sample ID Investigator Sample ID
329772 D9_infected
329773 D9_uninfected
329774 D12_infected
329775 D12_uninfected
329776 D26_infected
329777 D26_uninfected

The two samples with strikeouts above failed sequencing. See the previous post from 20191003 about data delivery for all the info on those two samples.