Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_09281 |
Symbol | ispE |
ID | 4717635 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | + |
Start bp | 798259 |
End bp | 799194 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640078641 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_001009319 |
Protein GI | 123968461 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGATT TTGCTAAAAA GAAAATTAAT ATAAAATCTC CTGCCAAAAT AAATTTGCAC CTTGAAGTGA TTGGTAAAAG AGAGGATGGA TTTCACGAGT TAGCAATGAT TATGCAAAAT ATCGATCTTG CTGATTATTT AGAATTTGAA ATTAATAATG AAGGTTTAAT TAAACTTGAG TCTGATTGTA ATGATTTAAG CCTATCTGAT GATAACTTAA TTGTTAAATC GGCAAACCTA TTAAGGAAAA AATCAAATAT AGATTACGGT GCGAATATAT TTTTAAGAAA AAATATCCCA ATTGGTGCAG GATTAGCTGG TGGATCCAGT AATGCAGCAG CAACATTAAT TGGTCTTAAT AATTTATGGG ATTTGAAATT AGATCAAGAA ACTTTATGTT CATTAGCATC AACTTTAGGA TCTGATATTC CCTTTTTTAT AAATGGTGGT ATTCAATTAT GTTTTGGAAG AGGCGAAATT TTGGAGAAAT TAGATTCAAC CCTTGAATAT GGAGCAATTC TTTTAAAAAA TCCTAATGTA TCAGTATCCA CAGCTGAAAC TTATAAAAAA TATAGTAATA GATTTTGTGA TCAATATCTT ACTGATAGAG AAATGATTGA GAACATAAGA AAAAATTTAA GAGATAATGG TTTAAATAAC TTAAATTTTG ATAATCAACA TTTATCTATT AAAAATGATT TGCAGTTAGT TGTTGAAAAT GAAAATGATT CTGTAAAGCA GGCATTATAT TTACTTTCTA AATTAGAAAA TTGTCTAACA TTTTCAATGA GTGGATCAGG ACCTACATGC TTTGCACTCT TTAAAGATAA AGAGACTGCT AAAAAAGAAT TAACTGCAAA TTCTAAATTA TTTAAAGATA AAGGCTATGA TTCATGGGTT TGCACTTTCC TTGAAAAGGG AATAACATTC ATATAA
|
Protein sequence | MQDFAKKKIN IKSPAKINLH LEVIGKREDG FHELAMIMQN IDLADYLEFE INNEGLIKLE SDCNDLSLSD DNLIVKSANL LRKKSNIDYG ANIFLRKNIP IGAGLAGGSS NAAATLIGLN NLWDLKLDQE TLCSLASTLG SDIPFFINGG IQLCFGRGEI LEKLDSTLEY GAILLKNPNV SVSTAETYKK YSNRFCDQYL TDREMIENIR KNLRDNGLNN LNFDNQHLSI KNDLQLVVEN ENDSVKQALY LLSKLENCLT FSMSGSGPTC FALFKDKETA KKELTANSKL FKDKGYDSWV CTFLEKGITF I
|
| |