Gene ECH_0559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0559 
SymbolispG 
ID3927424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp562145 
End bp563374 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content34% 
IMG OID637901681 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_507371 
Protein GI88657947 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.891821 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTAATT GTATTGCAAA TAAAAATAAA ATAACCTATG AAGTGAAAGT TGGTGATGTA 
GTAATTGGTG GCAACAATCC TGTAGTAGTG CAGTCAATGG CGTTAGGCGG ATCTGGAGAT
GTGTATAAAG ATGCACATGA GGTTTTAGAG TTAGCGCAAG CTGGATCTGA ATTGGTAAGA
GTTGCAGTTA ATTCAGAACA AGCTATGAAA AATGTTCCGT ATATAAGAGA TGTATTGGTA
GATCATGGCT TTAGTGCTAA GATGATAATA GGATGTGGAC AATATGAAAT TGCTAGATTG
GTAAATGAGT ATCCTGATTG TGCAGCTGCT TTAGGAAAAA TACGTATTAA TCCAGGAAAT
GTTGGTTTTG GAAATAAACG GGATAAGAAT TTTGAAGATA TTGTTGAGTT TGCAATAAAA
CATGATATCC CTATCAGAAT AGGTGTAAAT TGGGGGAGTT TAGATAAGTA TTTAGCTTCA
AAATTAATGA ATGATAATGC ATTACTTATC AACCCTAAGC CAGATTATAT AGTTTTGCAG
AAAGCATTGG TAATTTCTGC TATAACAAGT GCTAAACGTG CAGAAGAAAT TGGCTTATCT
AAAAATAAGA TAGTTATATC TTGTAAAACA AGTAAAATAC AAGATTTAAT ACCTGTTTAT
ACAGTATTGT CAAATGTATG TAATTATCCA TTACATTTAG GGTTGACAGA AGCAGGGTCT
GGTACAAAAG GAATGGTTAG CAGTGCTGCA GGAATATCTT ACTTATTGTT AAATGGTATA
GGAGATACTA TACGTGTTTC CTTAACTCAA CAACCTGGTG AAGCAAGAAG TATTGAAGTC
AAGTTATGTC AAGAAATTTT GCAAAGTATA GGTTTAAGAA ATTTTTCTGC GCAGGTAACT
TCATGTCCAG GTTGTAATAG AACTAATCCT AAGTATTTTC ACCAATTAGC TAAAGATATT
AATGATTATA TAAAGCAACG TATGCCTGTG TGGAGAAATG ATAATCCCGG ATCTGAAAAT
ATGACTGTAG CAGTAATGGG TTGTATAGTC AATGGTCCAG GTGAAAGTAA ACACGCAAAT
TTAGGTATTA GTCTTCCTGG CTATGGTGAG AGGCCTGTAG CTGCAGTGTA TCAGAATGGA
GAGAAGTTGT GTACTTTAGA AGGCGGTAAT ATCTTTGAAC AATTTGTATC AATTATCGAA
AATTATGTTA ATGTTTATTA CAAACAATAG
 
Protein sequence
MFNCIANKNK ITYEVKVGDV VIGGNNPVVV QSMALGGSGD VYKDAHEVLE LAQAGSELVR 
VAVNSEQAMK NVPYIRDVLV DHGFSAKMII GCGQYEIARL VNEYPDCAAA LGKIRINPGN
VGFGNKRDKN FEDIVEFAIK HDIPIRIGVN WGSLDKYLAS KLMNDNALLI NPKPDYIVLQ
KALVISAITS AKRAEEIGLS KNKIVISCKT SKIQDLIPVY TVLSNVCNYP LHLGLTEAGS
GTKGMVSSAA GISYLLLNGI GDTIRVSLTQ QPGEARSIEV KLCQEILQSI GLRNFSAQVT
SCPGCNRTNP KYFHQLAKDI NDYIKQRMPV WRNDNPGSEN MTVAVMGCIV NGPGESKHAN
LGISLPGYGE RPVAAVYQNG EKLCTLEGGN IFEQFVSIIE NYVNVYYKQ