Gene P9211_07121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_07121 
SymbolispE 
ID5731093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp623928 
End bp624881 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content40% 
IMG OID641285075 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_001550597 
Protein GI159903253 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.143811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGG TCTCTTCCTC TATAGCAGAG TCTTTGAAAG TTTATGCATC AGCAAAGATA 
AATTTGCATT TAGAAGTTTT GGGATTACGT AAAGATGGCT TTCATGAGTT AGCTATGGTT
ATGCAAAGCA TTGATTTAGT TGATGAGATT GAGATAACTA AGACAAATGA TGAACTGATT
AGCCTTAATT CAGATAATCC AGAGTTAGAC AATGGAGATG GAAATTTGAT TATTAAGGCT
GCAAAGCTAA TTCGAAGTCG ATCAGGATTA AGGGATTTAG GGGCCTTAAT TTATTTAAGA
AAAAAAATTC CTATTGGCGC AGGCTTGGCT GGAGGTTCGA GTGACGGAGC AGCAACTTTG
GTAGGTCTTA ACTCTCTTTG GGGACTAAAT TTCTCTAATA ATCAGTTGGA AGATATGGCG
GCTGAGCTTG GTTCAGATGT TCCATTTTGC ATCTCAGGGG GAGCTCAATT GTGCTTTGGT
CGAGGTGAAT GCCTGGAACC TTTAGATAAA TCAGATCCAA CTTTGGCAAT AGTTCTTGTA
AAAGATCCAT CTGTATCTGT GTCCACTCCA TGGGCCTATT CAAGGTACAA GCAATTAAAT
GAGAGTACTT ACTTAAGCAA AGAAATTGAT TTCCAAGAGA AGCGAATGGC TCTTAGGAAA
GCCTCTTGGT TAAGACCACT TAATGCATCA AACCCTCCTC CTTTGATTAA TGACCTTCAA
GAGGTTGTTG CACCAGCCAC TCCAGCGGTT GAGAAAGCTT TGCAATTCCT TCGCTCATTA
AAAGGTGTTC TTTCGGTAGC AATGAGTGGA TCAGGCCCAA GCTGCTTTGC AATTTTCTCT
GATTTGGATC AGGCTAGAAT TGCTCTTGAG GAGAATCAAG AGGAGCTTCG AAAACAATGC
TTAGAAGGCT GGTGTTGTGC TCTTAATTCG AAAGGAGTGA GGTTCGCGAA GTGA
 
Protein sequence
MNKVSSSIAE SLKVYASAKI NLHLEVLGLR KDGFHELAMV MQSIDLVDEI EITKTNDELI 
SLNSDNPELD NGDGNLIIKA AKLIRSRSGL RDLGALIYLR KKIPIGAGLA GGSSDGAATL
VGLNSLWGLN FSNNQLEDMA AELGSDVPFC ISGGAQLCFG RGECLEPLDK SDPTLAIVLV
KDPSVSVSTP WAYSRYKQLN ESTYLSKEID FQEKRMALRK ASWLRPLNAS NPPPLINDLQ
EVVAPATPAV EKALQFLRSL KGVLSVAMSG SGPSCFAIFS DLDQARIALE ENQEELRKQC
LEGWCCALNS KGVRFAK