Gene OSTLU_38791 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_38791 
Symbol 
ID5002134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009359 
Strand
Start bp288023 
End bp289129 
Gene Length1107 bp 
Protein Length368 aa 
Translation table 
GC content60% 
IMG OID640417555 
Productpredicted protein 
Protein accessionXP_001417727 
Protein GI145346505 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0337] 3-dehydroquinate synthetase 
TIGRFAM ID[TIGR01357] 3-dehydroquinate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0409062 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCG TCGACGTCGT CAACGTCGAT CTGGGCGATC GATCGTACCC GATCTACGTC 
GGGACGGGGC TGCTGGACGA CGGGGACGCG CTGCGCGCGC ACGTCGCGGG GTCGACGGCG
CTCGTGGTGA CGAACGAAAC CATCGCGGGG CTTGGATACC TCGATCGCAC GGTGAAAGCG
CTCACGGCGA AGGATTCGAA ACTGCGCGTG GAGACGGTGG TGCTGCCGGA CGGAGAGGAG
CATAAGAATT TGGAGGTGCT GAACGCGGTG TACACGAGGG CGCTGGAGAC GCGACTCGAC
CGCGGGACGA CGTTCGTGGC GCTGGGGGGG GGCGTGATCG GTGATATGAC GGGATACGCC
GCGGCGTCGT ATCAGCGCGG GGTGAAGTTC GTGCAAATAC CGACGACGGT GATGGCGATG
GTGGATAGCT CGGTGGGGGG GAAGACCGGG GTGAACCACG CGCTCGGGAA GAATATGATC
GGGGCGTTTT ATCAGCCAGA GTGCGTTTTG ATCGATATCG ATTCGTTGAA GACGCTTCCC
GATCGAGAGT TCGCGAGCGG GATCGCAGAG GTGGTGAAAT ACGGTCTCAT TCGCGATGGG
CCGTTTTTCG AATGGCTCGA GGCGAACGTC GATAAGCTTC TCGCGCGCGA TACGCAAGCC
ATCGCGTACG CCGTCGAGCG ATCGTGCGTG AACAAGGCGG AAGTCGTCGC CGCGGATGAG
AGGGAGGGCG GCGTTCGAGC GACGCTGAAT CTTGGGCACA CGTTCGGTCA CGCGATAGAA
ACCGGTCTCG GCTACGGCGA GTGGTTGCAC GGCGAAGCGG TGAGCGCCGG TATGTGTATG
GCGGCGGATA TGTCTCTTCG ACTCGGTTGG ATCGACGCCT CGCTCAAGGA GCGCACGATC
GCCTTATTGA ACAAGTGCAA AACCCCGATC GACGTCCCTG AAAAGATGAC GGTTCAAATG
TTCATGGACT TGATGGCGGT GGATAAGAAG GCTGCGAATG GGAAATTGCG CTTGATTTTG
TTAAAGGGCG AGCTCGGCGA GTGCGTCTTC ACTGGGGACT TCGACCAAAG CAAGCTCCAG
GAAACCTTAG ACGCGTACGT CAAGTAA
 
Protein sequence
MDGVDVVNVD LGDRSYPIYV GTGLLDDGDA LRAHVAGSTA LVVTNETIAG LGYLDRTVKA 
LTAKDSKLRV ETVVLPDGEE HKNLEVLNAV YTRALETRLD RGTTFVALGG GVIGDMTGYA
AASYQRGVKF VQIPTTVMAM VDSSVGGKTG VNHALGKNMI GAFYQPECVL IDIDSLKTLP
DREFASGIAE VVKYGLIRDG PFFEWLEANV DKLLARDTQA IAYAVERSCV NKAEVVAADE
REGGVRATLN LGHTFGHAIE TGLGYGEWLH GEAVSAGMCM AADMSLRLGW IDASLKERTI
ALLNKCKTPI DVPEKMTVQM FMDLMAVDKK AANGKLRLIL LKGELGECVF TGDFDQSKLQ
ETLDAYVK