Previously, we “received” this data, but it turns out it was incomplete (see 20191003).
Today, we finally received all the RNAseq data (>50M reads per samples) back from NWGC that we submitted on 20190521!
The second round of data is in addition to the data we received on 20191003. So, to simplify some of the data management and downstream processing of these files, I decided to concatenate the two sets of file. Concatenation is documented in this Jupyter Notebook (GitHub):
Here’s a table with the library names and the FastQ naming schemes.
|NWGC Sample ID||Investigator Sample ID|
The two samples with strikeouts above failed sequencing. See the previous post from 20191003 about data delivery for all the info on those two samples.