My next step was to characterize the location of DMLs and DMRs. I did by revising code in a previously existing Jupyter notebook. I counted the number of DMLs and DMRs, as well as the number of overlaps between each of those features with exons, introns, and mRNA coding regions. I exported .csv files with the intersection of these features in this folder. One thing I noticed was that there were equal numbers of hypermethylated and hypomethylated DMLs. These DMLs were primarily in the mRNA coding regions, and more were in exons than introns. However, there were significantly more hypomethylated DMRs than hypermethylated DMRs, and more DMRs in introns than exons. This could hint at some function for alternative splicing.
For my presentation to the Lotterhos lab, I also cherry-picked some genes what were differentially methylated. Cilia and flagella genes, fatty acid desaturation genes, and multidrug resistance protein genes were hypermethylated, while heat shock protein and MAP kinase genes were hypomethylated.
One thing that came up while Steven and I were looking at my results is that Steven is getting way less DMLs and DMRs than I am. We quickly walked through our
methylKit code and found no discrepancies. This is something we will need to sort out!
- I will update the C. virginica gonad methylation paper with reproducible methods and preliminary results. Steven will use this information to reproduce my work and hopefully figure out why we’re getting different results.
- I need to figure out how to do a proper gene enrichment and flanking analysis
- Ready scripts for sample analysis when my Mox job finishes running