Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0559 |
Symbol | ispG |
ID | 3927424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 562145 |
End bp | 563374 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637901681 |
Product | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase |
Protein accession | YP_507371 |
Protein GI | 88657947 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis |
TIGRFAM ID | [TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.891821 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGTTTAATT GTATTGCAAA TAAAAATAAA ATAACCTATG AAGTGAAAGT TGGTGATGTA GTAATTGGTG GCAACAATCC TGTAGTAGTG CAGTCAATGG CGTTAGGCGG ATCTGGAGAT GTGTATAAAG ATGCACATGA GGTTTTAGAG TTAGCGCAAG CTGGATCTGA ATTGGTAAGA GTTGCAGTTA ATTCAGAACA AGCTATGAAA AATGTTCCGT ATATAAGAGA TGTATTGGTA GATCATGGCT TTAGTGCTAA GATGATAATA GGATGTGGAC AATATGAAAT TGCTAGATTG GTAAATGAGT ATCCTGATTG TGCAGCTGCT TTAGGAAAAA TACGTATTAA TCCAGGAAAT GTTGGTTTTG GAAATAAACG GGATAAGAAT TTTGAAGATA TTGTTGAGTT TGCAATAAAA CATGATATCC CTATCAGAAT AGGTGTAAAT TGGGGGAGTT TAGATAAGTA TTTAGCTTCA AAATTAATGA ATGATAATGC ATTACTTATC AACCCTAAGC CAGATTATAT AGTTTTGCAG AAAGCATTGG TAATTTCTGC TATAACAAGT GCTAAACGTG CAGAAGAAAT TGGCTTATCT AAAAATAAGA TAGTTATATC TTGTAAAACA AGTAAAATAC AAGATTTAAT ACCTGTTTAT ACAGTATTGT CAAATGTATG TAATTATCCA TTACATTTAG GGTTGACAGA AGCAGGGTCT GGTACAAAAG GAATGGTTAG CAGTGCTGCA GGAATATCTT ACTTATTGTT AAATGGTATA GGAGATACTA TACGTGTTTC CTTAACTCAA CAACCTGGTG AAGCAAGAAG TATTGAAGTC AAGTTATGTC AAGAAATTTT GCAAAGTATA GGTTTAAGAA ATTTTTCTGC GCAGGTAACT TCATGTCCAG GTTGTAATAG AACTAATCCT AAGTATTTTC ACCAATTAGC TAAAGATATT AATGATTATA TAAAGCAACG TATGCCTGTG TGGAGAAATG ATAATCCCGG ATCTGAAAAT ATGACTGTAG CAGTAATGGG TTGTATAGTC AATGGTCCAG GTGAAAGTAA ACACGCAAAT TTAGGTATTA GTCTTCCTGG CTATGGTGAG AGGCCTGTAG CTGCAGTGTA TCAGAATGGA GAGAAGTTGT GTACTTTAGA AGGCGGTAAT ATCTTTGAAC AATTTGTATC AATTATCGAA AATTATGTTA ATGTTTATTA CAAACAATAG
|
Protein sequence | MFNCIANKNK ITYEVKVGDV VIGGNNPVVV QSMALGGSGD VYKDAHEVLE LAQAGSELVR VAVNSEQAMK NVPYIRDVLV DHGFSAKMII GCGQYEIARL VNEYPDCAAA LGKIRINPGN VGFGNKRDKN FEDIVEFAIK HDIPIRIGVN WGSLDKYLAS KLMNDNALLI NPKPDYIVLQ KALVISAITS AKRAEEIGLS KNKIVISCKT SKIQDLIPVY TVLSNVCNYP LHLGLTEAGS GTKGMVSSAA GISYLLLNGI GDTIRVSLTQ QPGEARSIEV KLCQEILQSI GLRNFSAQVT SCPGCNRTNP KYFHQLAKDI NDYIKQRMPV WRNDNPGSEN MTVAVMGCIV NGPGESKHAN LGISLPGYGE RPVAAVYQNG EKLCTLEGGN IFEQFVSIIE NYVNVYYKQ
|
| |