This is the code I used:
The file (raw: is kind of crazy now. I want to figure out how to match the data within the spreadsheet between columns.
This file is all the samples that have:
- 3 samples per crab
- 20ng/µL RNA or higher
- Pam’s new data from qPCR (but only for the samples that have 3 samples/crab and 20ng/µL RNA or higher)
The file has three columns that I want to compare to see if the infection status of the samples has changed from what was originally determined by conventional PCR.
The initial column is “infection_status”. This column has “0” for uninfected and “1” for infected, based on conventional PCR.
A second column is “pos_neg_no_quant” and has “POS” for infected, and “NEG” for uninfected, and “0” as a placeholder for the empty cells.
A third column is “pos_neg” and has “pos” for infected, and “neg” for uninfected and “0” as a placeholder for empty cells.
I can’t really figure out based on Pam’s email responses to my questions what the difference between POS and pos and NEG and neg are… I’ll show Sam what she sent me and maybe we can figure it out together on Wednesday.
But, by the end of this week for sure, I will have a visual of all the good samples that I have within the different categories of treatments (infected, uninfected: cold, amb, and warm) and at all three sample dates so that I will know whether I need to isolate more samples, or if we can start preparing more seriously for sending them for sequencing.