Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_07121 |
Symbol | ispE |
ID | 5731093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 623928 |
End bp | 624881 |
Gene Length | 954 bp |
Protein Length | 317 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285075 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001550597 |
Protein GI | 159903253 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.143811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAGG TCTCTTCCTC TATAGCAGAG TCTTTGAAAG TTTATGCATC AGCAAAGATA AATTTGCATT TAGAAGTTTT GGGATTACGT AAAGATGGCT TTCATGAGTT AGCTATGGTT ATGCAAAGCA TTGATTTAGT TGATGAGATT GAGATAACTA AGACAAATGA TGAACTGATT AGCCTTAATT CAGATAATCC AGAGTTAGAC AATGGAGATG GAAATTTGAT TATTAAGGCT GCAAAGCTAA TTCGAAGTCG ATCAGGATTA AGGGATTTAG GGGCCTTAAT TTATTTAAGA AAAAAAATTC CTATTGGCGC AGGCTTGGCT GGAGGTTCGA GTGACGGAGC AGCAACTTTG GTAGGTCTTA ACTCTCTTTG GGGACTAAAT TTCTCTAATA ATCAGTTGGA AGATATGGCG GCTGAGCTTG GTTCAGATGT TCCATTTTGC ATCTCAGGGG GAGCTCAATT GTGCTTTGGT CGAGGTGAAT GCCTGGAACC TTTAGATAAA TCAGATCCAA CTTTGGCAAT AGTTCTTGTA AAAGATCCAT CTGTATCTGT GTCCACTCCA TGGGCCTATT CAAGGTACAA GCAATTAAAT GAGAGTACTT ACTTAAGCAA AGAAATTGAT TTCCAAGAGA AGCGAATGGC TCTTAGGAAA GCCTCTTGGT TAAGACCACT TAATGCATCA AACCCTCCTC CTTTGATTAA TGACCTTCAA GAGGTTGTTG CACCAGCCAC TCCAGCGGTT GAGAAAGCTT TGCAATTCCT TCGCTCATTA AAAGGTGTTC TTTCGGTAGC AATGAGTGGA TCAGGCCCAA GCTGCTTTGC AATTTTCTCT GATTTGGATC AGGCTAGAAT TGCTCTTGAG GAGAATCAAG AGGAGCTTCG AAAACAATGC TTAGAAGGCT GGTGTTGTGC TCTTAATTCG AAAGGAGTGA GGTTCGCGAA GTGA
|
Protein sequence | MNKVSSSIAE SLKVYASAKI NLHLEVLGLR KDGFHELAMV MQSIDLVDEI EITKTNDELI SLNSDNPELD NGDGNLIIKA AKLIRSRSGL RDLGALIYLR KKIPIGAGLA GGSSDGAATL VGLNSLWGLN FSNNQLEDMA AELGSDVPFC ISGGAQLCFG RGECLEPLDK SDPTLAIVLV KDPSVSVSTP WAYSRYKQLN ESTYLSKEID FQEKRMALRK ASWLRPLNAS NPPPLINDLQ EVVAPATPAV EKALQFLRSL KGVLSVAMSG SGPSCFAIFS DLDQARIALE ENQEELRKQC LEGWCCALNS KGVRFAK
|
| |