Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_09261 |
Symbol | ispE |
ID | 4912314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | + |
Start bp | 796714 |
End bp | 797649 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640160509 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001091150 |
Protein GI | 126696264 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.857167 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGATT TAGCTAAACC GAAAATTAAG ATAAAATCTC CTGCCAAAAT AAATTTGCAC CTTGAAGTTA TTGGTAAAAG AGAGGATGGA TTTCATGAGT TAGCAATGAT TATGCAAAAT ATCGATCTTT CTGATTATTT AGAATTTGAA ATAAATAATG AAGGTTTAAT TAAACTTGAG TCTGATTGTA ATGATTTAAG CTTGTCTGAT GATAACTTAA TTGTTAAATC GGCAAATCTA TTAAGAAAAA ATTCAAATAT AAATTACGGT GCGAATATAT TTTTAAGAAA AAATATTCCA ATTGGCGCAG GATTAGCTGG TGGATCCAGT AATGCAGCAG CAACATTAAT TGGTCTTAAT AAGTTATGGA ATTTGGACTT AGATCATGGA ACATTATGTT CATTAGCATC AACTTTAGGA TCTGATATTC CCTTTTTTAT AAATGGTGGC ATTCAGTTAT GTTTTGGAAG AGGAGAAATT TTGGAGAAAT TAGATTCAAA CTTTGAATAT GGAGTAATTC TTTTAAAAAA TCCAAATGTA TCAGTATCTA CAGCTGAAAC TTATAAAAAA TATAGTAATA GATTTTGTGA TAATCATCTT AATGATAGAA AAATGATTGA GAACATAAGA AAAAATTTAA GGGATAATGG TTTGAATAAA TTAAATTTTG ATAATCAACA TTTATTTATT AAAAATGATT TGCAGTTAGT TGTTGAAAAT GAAAATGATT CTGTAAAGCA GGCATTATAT TTACTTTCTA AACTAGAAAA TTGTCTCACA TTTTCAATGA GTGGATCAGG ACCTACATGC TTTGCACTCT TTAAAGATAT AGAGACTGCA AAAAAAGAAT TAACTGCTAA TTCTAAATTT TTTAAAGATA AAGGCTATGA TTCATGGGTT TGCACTTTCC TTGAAAAGGG AATAACATTC ATTTAA
|
Protein sequence | MQDLAKPKIK IKSPAKINLH LEVIGKREDG FHELAMIMQN IDLSDYLEFE INNEGLIKLE SDCNDLSLSD DNLIVKSANL LRKNSNINYG ANIFLRKNIP IGAGLAGGSS NAAATLIGLN KLWNLDLDHG TLCSLASTLG SDIPFFINGG IQLCFGRGEI LEKLDSNFEY GVILLKNPNV SVSTAETYKK YSNRFCDNHL NDRKMIENIR KNLRDNGLNK LNFDNQHLFI KNDLQLVVEN ENDSVKQALY LLSKLENCLT FSMSGSGPTC FALFKDIETA KKELTANSKF FKDKGYDSWV CTFLEKGITF I
|
| |