Gene Set Enrichment Analysis Workflow:
- Get Entrez Gene IDs
- Match IDs with GOterms
- Use both topGO and DAVID for enrichment
Problem:
- The gene IDs found in the C. virginica GFF files are not official, NCBI Entrez Gene IDs. Not sure what LOC{} is, but XM_{} are Genbank IDs. Genbank IDs from the GFF were not recognized by DAVID
Solution:
- blastx to get Uniprot accession codes and GOterms
- Use Uniprot and GOterms in DAVID
- Convert Uniprot accession codes to Entrez IDs
- Use Entrez IDs and GOterms in DAVID