Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0757 |
Symbol | ispE |
ID | 3927481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 763133 |
End bp | 763981 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637901875 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_507555 |
Protein GI | 88657607 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0523399 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAAGT TTTTAGTTAA AGCACCAGCA AAAGTTAATT TATTTTTGCA TATTACTGGT AAACGGAGTG ATCAACATCA TTACTTAGAG TCATTGTTTG TCTTTGTTAA TGTTTACGAT ATCTTAGAGG TTGATGTTGG TGGTTCGAAA CGGGGAGTAT ATTTCTCAAA TCTGAGGATT AGCAAATATA ATAATACTGT ATATAAAGCA ATAGAGTTGT TATTGAAACA CAGTGCTGTA TGTCCTAATG TTTCTGTGAG TATTATAAAA AATATTTTAG TATCTGCAGG TTTAGCAGGT GGATCAGCTG ATGCTGCTGC TATTATGCGT TTACTAGGTA ATATGTGGAA CATTGATTAC ACATTGTTAC AGGATTTAGC CTTAAAAATT GGAAGTGATG TTCCAGCATG TTTGGAATCA AAGACCCTTT TTGCTAAAGG GGTAGGGGAA GATATATTGT TATTACCTGA TTTGTTATTG CCAAAATATA TTATACTTGT TGCTCCAAGG GGAAAGACGT TAAGTACAGC AAAGGTCTTT AATAATTACC AGAGTGCTAC TTACTCTCCT TCCATATGTG ATAAGCTTCC TGTAAAGCAA GATGATTGGA TGGAATTAAT TTGTAATGCT AAAAATGATT TATTAGAAGT AGCGTTGAAA TTTGTTCCTG AGATAGAAGA AATATTATTT GTATTAAAGC AATTAAAAAA TTCTGTAATT GCTCGTATGA CTGGTAGTGG AGCAACTTGT TTTGCTTTAT TTAATGAATT AAGTCATGCT GAAGATGCAG CTAGAAAATT GCAGATGACA CGTCCTGATT GGATAATTTT TAATGCTAAA ATACTTTGA
|
Protein sequence | MLKFLVKAPA KVNLFLHITG KRSDQHHYLE SLFVFVNVYD ILEVDVGGSK RGVYFSNLRI SKYNNTVYKA IELLLKHSAV CPNVSVSIIK NILVSAGLAG GSADAAAIMR LLGNMWNIDY TLLQDLALKI GSDVPACLES KTLFAKGVGE DILLLPDLLL PKYIILVAPR GKTLSTAKVF NNYQSATYSP SICDKLPVKQ DDWMELICNA KNDLLEVALK FVPEIEEILF VLKQLKNSVI ARMTGSGATC FALFNELSHA EDAARKLQMT RPDWIIFNAK IL
|
| |