Shelly’s Notebook: Apr 15 – May 3, 2019 Geoduck Broodstock hemolymph WGBS

Bismark alignments

I ran this job on mox to align reads to Pgenerosa_v071.

  • Raw fastqs live here and FastQC generated files are here
  • Sam ran FastQC here
  • Sam mentioned Genewiz recommends “Trimming 10 bases from the beginning of both R1 and R2 following adapter trimming eliminates the majority of Adaptase tails.”


  • Instead of running trimmomatic or following Genewiz’s recommendations, I tried just doing a crude trim removing the first 6 characters (which seemed to be lower quality)from each read before running the alignments. see mox job for code

running bismark aligment and methylation extractor

running cytosine coverage

  • because I didn’t include the genome-wide cytosine coverage option when I initially ran bismark, I ran coverage2cytosine after with this script on mox

Calculating coverage

I adapted Sam’s new coverage script to work on my data in this jupyter notebook: 20190503_Pgnr_comparison.ipynb.

It worked! And generated this plot:

Coverage of genome-wide cytosines is surprising good. Tank 2 got about 66M reads and Tank 3 got about 50M reads. The genome is about 2.4gb. If the genome was evenly fragmented at 150bp and got even coverage, you would need 16M reads to cover the genome 1x. So it’s possible the the depth these libraries got were able to to cover most cytosines.

Next steps:

Run MethylKit

  • As a first pass, run untrimmed alignments through methylkit to see what’s different

Trim properly and rerun alignments and coverage analysis

I should probably try the alignments again with the recommended trimming and see what happens

Check unmapped reads

I don’t think that Bismark outputs unmapped reads unless you specify it in your initial code:

But when I run alignments again, I’ll specify unmapped reads be output .

Shelly’s Notebook: May 2, 2019 Geoduck larvae at Pt. Whitney

Check on heath stack setup

  • food still dicey, sometimes the food is unevenly distributed.
    • ordered a new pump head today and they are sending some sample tubing to see if it will solve the problem (see below for more details).
    • animals looked good but some algal floculation

Algal counts

  • Matt recommended setters should get a concentration of 100K/mL, but not more than that.
  • Inflow algal counts direct from the feeding tube were:
    • H2 (ambient): 4.73 x 10^6 cells/mL
    • H1 (ambient/low pH): 4.8 x 10^6 cells/mL
  • Outflow algal counts direct from the feeding tube were:
    • H2 (ambient): 5.3 x 10^5 cells/mL
    • H1 (ambient): 3.8 x 10^5 cells/mL
  • Flow rate of the feeding tube is 10mL/13 seconds = 0.77mL/second = 46.2 mL/minute = 66,528 mL / day = 35,260 M cells/day
  • FAO manual recommends 0.4mg dry algae per mg spat per week – 10^6 Iso cells = 0.02mg, so 20M cells/week or 2.857M cells/day
  • Setter weight ~= 0.07mm^3 = 0.07mg? – H2 has ~ 52000 animals = 3.640 mg spat – H1 ambient has ~ 60,000 animals = 4.2mg spat


  • Ambient should be getting ~ 22.4M cells/day (2.857M cells/day * 7.84 mg total spat)
  • They are getting about 1.5X as much


Water chem

Kaitlyn and Eileen ran TA on H1 water samples. TA data here and discrete chem data here

  • Still need to run poisoned samples from H1T5-T8 from 4/19 and from 5/2. We can do this next week.

