Gene OSTLU_37993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOSTLU_37993 
Symbol 
ID5003948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOstreococcus lucimarinus CCE9901 
KingdomEukaryota 
Replicon accessionNC_009364 
Strand
Start bp586719 
End bp587744 
Gene Length1026 bp 
Protein Length341 aa 
Translation table 
GC content61% 
IMG OID640419369 
Productpredicted protein 
Protein accessionXP_001420043 
Protein GI145351349 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.144035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGTA AGCAATGCAA GGTTGCCATC AACGGTTTCG GCCGCATCGG CCGCAACTTC 
TTGCGATGCT GGCACGGACG CGCCAACACC ATGCTCGACA TCGTCGCCAT CAACGACTCG
GGCGGAGTGA AGCAAGCGAG CCACTTGGTC AAGTACGACT CCGTCCTCGG CACGTTCGAG
GCGGATGTCA AGATCATCGA CGACACGCAC ATCTCCATCG ACGGCAAGTC CATCGAGATT
GTGTCTTCCC GTGACCCGCT CCAGTTGCCG TGGAAGGCTC TCGGCGTCGA CATCGTCATT
GAAGGTACCG GCGTCTTCAT CGACACCCCG GGCGCCTCCA AGCACTTGAC CGCGGGCGCC
AAGAAGGTTG TCATCACGGC CCCGGCCAAG GGTGACGACA TCCCGACCTA CGTCCTCGGT
GTCAACGCCG ACCAGTACAA GAACACCGAC AAGATCGTCT CCAACGCGTC GTGCACGACC
AACGGCCTCG CGCCGTTCGT CAAGGTTCTC GACGACCGAT TCGGCATCGT CAAGGGTTTG
ATGACCACCA CGCACTCCTA CACCGGTGAC CAGCGCATTT TGGATGCGTC TCACCGTGAC
TTGCGCCGCG CTCGCGCCGC CGCCTTGAAC ATCGTGCCGA CCTCCACCGG CGCCGCCAAG
GCTGTCGCGC TCGTCTTGCC GCAACTCAAG GGCAAGCTCA ACGGCATCGC GCTCCGCGTC
CCGACGCCGA ACGTGTCCGT CGTCGATCTC GTCATCCAAA CCTCCAAGAA GGTCACCGCC
GACGAAGTCA ACGCCGCGTT CCGTGAAGAA GCCGCCGGCA AGCTCAAGGG TATCCTCGCC
GTCGCCGACG AGCCGCTCGT GTCTTGCGAT TTCAAGTGCT CCGACGTCTC CACGTCCATC
GACGCCGCGC TCACCATGGT CATGGGTGAC GACATGTTGA AGGTTGTCGC GTGGTATGAC
AACGAGTGGG GCTATTCGCA ACGCGTAGTG GACTTGGCGG AATTATGCGC AGCAAACTGG
GAATGA
 
Protein sequence
MKGKQCKVAI NGFGRIGRNF LRCWHGRANT MLDIVAINDS GGVKQASHLV KYDSVLGTFE 
ADVKIIDDTH ISIDGKSIEI VSSRDPLQLP WKALGVDIVI EGTGVFIDTP GASKHLTAGA
KKVVITAPAK GDDIPTYVLG VNADQYKNTD KIVSNASCTT NGLAPFVKVL DDRFGIVKGL
MTTTHSYTGD QRILDASHRD LRRARAAALN IVPTSTGAAK AVALVLPQLK GKLNGIALRV
PTPNVSVVDL VIQTSKKVTA DEVNAAFREE AAGKLKGILA VADEPLVSCD FKCSDVSTSI
DAALTMVMGD DMLKVVAWYD NEWGYSQRVV DLAELCAANW E