Sam’s Notebook: RNA Isolation and Quantification – C.bairdi Hemocyte Pellets in RNAlater

Isolated RNA from the following hemolymph pellet samples:

  • 6202_337_12
  • 6142_474_26
  • 6144_477_26
  • 6122_489_26
  • 6176_490_26
  • 6136_491_26
  • 6137_500_26
  • 6125_503_26
  • 6145_505_26
  • 6131_507_26
  • 6159_508_26
  • 6206_509_26
  • 6150_511_26
  • 6161_512_26
  • 6115_514_26
  • 6179_515_26
  • 6172_517_26

Isolated RNA using the Quick DNA/RNA Microprep Kit (ZymoResearch; PDF) according to the manufacturer’s protocol for liquids/cells in RNAlater.

  • Used 35uL from each RNAlater/hemocyte slurry.
  • Mixed with equal volume of H2O (35uL).
  • Retained DNA on the Zymo-Spin IC-XM columns for isolation after RNA isolation.
  • Performed on-column DNase step.
  • RNA was eluted in 15uL H2O

RNA was quantified on the Roberts Lab Qubit 3.0 using the RNA High Sensitivity Assay (Invitrogen), using 2uL of each sample.

RNA was stored in the [-80o</sup>C freezer](http://b.link/srlab-80C) in Shellfish Box #8.

Sam’s Notebook: RNA Isolation and Quantification – C.bairdi Hemocyte Pellets in RNAlater

Isolated RNA from the following hemolymph pellet samples:

  • 6174_233_12
  • 6258_261_12
  • 6270_269_12
  • 6161_302_12
  • 6115_307_12
  • 6206_319_12
  • 6241_323_12
  • 6158_324_12
  • 6272_338_12
  • 6152_344_12
  • 6275_372_12
  • 6218_373_12
  • 6153_404_26
  • 6132_406_26
  • 6162_407_26
  • 6178_410_26
  • 6163_424_26
  • 6140_429_26
  • 6174_433_26
  • 6199_445_26
  • 6156_446_26
  • 6106_450_26
  • 6151_454_26
  • 6177_456_26
  • 6119_461_26
  • 6124_464_26
  • 6134_465_26
  • 6168_466_26
  • 6152_467_26
  • 6121_472_26
  • 6205_475_26
  • 6155_476_26
  • 6143_479_26
  • 6158_483_26
  • 6169_486_26

Isolated RNA using the Quick DNA/RNA Microprep Kit (ZymoResearch; PDF) according to the manufacturer’s protocol for liquids/cells in RNAlater.

  • Used 35uL from each RNAlater/hemocyte slurry.
  • Mixed with equal volume of H2O (35uL).
  • Retained DNA on the Zymo-Spin IC-XM columns for isolation after RNA isolation.
  • Performed on-column DNase step.
  • RNA was eluted in 15uL H2O

RNA was quantified on the Roberts Lab Qubit 3.0 using the RNA High Sensitivity Assay (Invitrogen), using 2uL of each sample.

RNA was stored in the [-80o</sup>C freezer](http://b.link/srlab-80C) in Shellfish RNA Box #7 and Shellfish Box #8.

Sam’s Notebook: Transdecoder – Hematodinium MEGAN6 Taxonomic-Specific Reads Assembly from 20200122

Ran Trinity to de novo assembly on the the C.bairdi MEGAN6 taxonomic-specific RNAseq data on 201200122 and now will begin annotating the transcriptome using TransDecoder on Mox.

SBATCH script (GitHub):

#!/bin/bash ## Job Name #SBATCH --job-name=transdecoder_hemat ## Allocation Definition #SBATCH --account=srlab #SBATCH --partition=srlab ## Resources ## Nodes #SBATCH --nodes=1 ## Walltime (days-hours:minutes:seconds format) #SBATCH --time=10-00:00:00 ## Memory per node #SBATCH --mem=120G ##turn on e-mail notification #SBATCH --mail-type=ALL #SBATCH --mail-user=samwhite@uw.edu ## Specify the working directory for this job #SBATCH --chdir=/gscratch/scrubbed/samwhite/outputs/20200123_hemat_transdecoder_megan # Exit script if a command fails set -e # Load Python Mox module for Python module availability module load intel-python3_2017 # Document programs in PATH (primarily for program version ID) { date echo "" echo "System PATH for $SLURM_JOB_ID" echo "" printf "%0.s-" {1..10} echo "${PATH}" | tr : \\n } >> system_path.log # Set workind directory as current directory wd="$(pwd)" # Capture date as YYYYMMDD timestamp=$(date +%Y%m%d) # Set input file locations and species designation trinity_fasta="/gscratch/srlab/sam/data/Hematodinium/transcriptomes/20200122.hemat.megan.Trinity.fasta" trinity_gene_map="/gscratch/srlab/sam/data/Hematodinium/transcriptomes/20200122.hemat.megan.Trinity.fasta.gene_trans_map" species="hemat" # Capture trinity file name trinity_fasta_name=${trinity_fasta##*/} # Paths to input/output files blastp_out_dir="${wd}/blastp_out" transdecoder_out_dir="${wd}/${trinity_fasta_name}.transdecoder_dir" pfam_out_dir="${wd}/pfam_out" blastp_out="${blastp_out_dir}/${timestamp}.${species}.blastp.outfmt6" pfam_out="${pfam_out_dir}/${timestamp}.${species}.pfam.domtblout" lORFs_pep="${transdecoder_out_dir}/longest_orfs.pep" pfam_db="/gscratch/srlab/programs/Trinotate-v3.1.1/admin/Pfam-A.hmm" sp_db="/gscratch/srlab/programs/Trinotate-v3.1.1/admin/uniprot_sprot.pep" # Paths to programs blast_dir="/gscratch/srlab/programs/ncbi-blast-2.8.1+/bin" blastp="${blast_dir}/blastp" hmmer_dir="/gscratch/srlab/programs/hmmer-3.2.1/src" hmmscan="${hmmer_dir}/hmmscan" transdecoder_dir="/gscratch/srlab/programs/TransDecoder-v5.5.0" transdecoder_lORFs="${transdecoder_dir}/TransDecoder.LongOrfs" transdecoder_predict="${transdecoder_dir}/TransDecoder.Predict" # Make output directories mkdir "${blastp_out_dir}" mkdir "${pfam_out_dir}" # Extract long open reading frames "${transdecoder_lORFs}" \ --gene_trans_map "${trinity_gene_map}" \ -t "${trinity_fasta}" # Run blastp on long ORFs "${blastp}" \ -query "${lORFs_pep}" \ -db "${sp_db}" \ -max_target_seqs 1 \ -outfmt 6 \ -evalue 1e-5 \ -num_threads 28 \ > "${blastp_out}" # Run pfam search "${hmmscan}" \ --cpu 28 \ --domtblout "${pfam_out}" \ "${pfam_db}" \ "${lORFs_pep}" # Run Transdecoder with blastp and Pfam results "${transdecoder_predict}" \ -t "${trinity_fasta}" \ --retain_pfam_hits "${pfam_out}" \ --retain_blastp_hits "${blastp_out}" 

Sam’s Notebook: Transdecoder – C.bairdi MEGAN6 Taxonomic-Specific Reads Assembly from 20200122

Ran Trinity to de novo assembly on the the Hematodinium MEGAN6 taxonomic-specific RNAseq data on 201200122 and now will begin annotating the transcriptome using TransDecoder on Mox.

SBATCH script (GitHub):

#!/bin/bash ## Job Name #SBATCH --job-name=transdecoder_cbai ## Allocation Definition #SBATCH --account=srlab #SBATCH --partition=srlab ## Resources ## Nodes #SBATCH --nodes=1 ## Walltime (days-hours:minutes:seconds format) #SBATCH --time=10-00:00:00 ## Memory per node #SBATCH --mem=120G ##turn on e-mail notification #SBATCH --mail-type=ALL #SBATCH --mail-user=samwhite@uw.edu ## Specify the working directory for this job #SBATCH --chdir=/gscratch/scrubbed/samwhite/outputs/20200123_cbai_transdecoder_megan # Exit script if a command fails set -e # Load Python Mox module for Python module availability module load intel-python3_2017 # Document programs in PATH (primarily for program version ID) { date echo "" echo "System PATH for $SLURM_JOB_ID" echo "" printf "%0.s-" {1..10} echo "${PATH}" | tr : \\n } >> system_path.log # Set workind directory as current directory wd="$(pwd)" # Capture date as YYYYMMDD timestamp=$(date +%Y%m%d) # Set input file locations and species designation trinity_fasta="/gscratch/srlab/sam/data/C_bairdi/transcriptomes/20200122.C_bairdi.megan.Trinity.fasta" trinity_gene_map="/gscratch/srlab/sam/data/C_bairdi/transcriptomes/20200122.C_bairdi.megan.Trinity.fasta.gene_trans_map" species="cbai" # Capture trinity file name trinity_fasta_name=${trinity_fasta##*/} # Paths to input/output files blastp_out_dir="${wd}/blastp_out" transdecoder_out_dir="${wd}/${trinity_fasta_name}.transdecoder_dir" pfam_out_dir="${wd}/pfam_out" blastp_out="${blastp_out_dir}/${timestamp}.${species}.blastp.outfmt6" pfam_out="${pfam_out_dir}/${timestamp}.${species}.pfam.domtblout" lORFs_pep="${transdecoder_out_dir}/longest_orfs.pep" pfam_db="/gscratch/srlab/programs/Trinotate-v3.1.1/admin/Pfam-A.hmm" sp_db="/gscratch/srlab/programs/Trinotate-v3.1.1/admin/uniprot_sprot.pep" # Paths to programs blast_dir="/gscratch/srlab/programs/ncbi-blast-2.8.1+/bin" blastp="${blast_dir}/blastp" hmmer_dir="/gscratch/srlab/programs/hmmer-3.2.1/src" hmmscan="${hmmer_dir}/hmmscan" transdecoder_dir="/gscratch/srlab/programs/TransDecoder-v5.5.0" transdecoder_lORFs="${transdecoder_dir}/TransDecoder.LongOrfs" transdecoder_predict="${transdecoder_dir}/TransDecoder.Predict" # Make output directories mkdir "${blastp_out_dir}" mkdir "${pfam_out_dir}" # Extract long open reading frames "${transdecoder_lORFs}" \ --gene_trans_map "${trinity_gene_map}" \ -t "${trinity_fasta}" # Run blastp on long ORFs "${blastp}" \ -query "${lORFs_pep}" \ -db "${sp_db}" \ -max_target_seqs 1 \ -outfmt 6 \ -evalue 1e-5 \ -num_threads 28 \ > "${blastp_out}" # Run pfam search "${hmmscan}" \ --cpu 28 \ --domtblout "${pfam_out}" \ "${pfam_db}" \ "${lORFs_pep}" # Run Transdecoder with blastp and Pfam results "${transdecoder_predict}" \ -t "${trinity_fasta}" \ --retain_pfam_hits "${pfam_out}" \ --retain_blastp_hits "${blastp_out}"