Gene ECH_0502 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0502 
SymbolispH 
ID3927316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp503528 
End bp504487 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content30% 
IMG OID637901625 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_507317 
Protein GI88658611 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCATCAAA ATTTAAAAAA AAATGTTGAA GTGATACTTG CTAATCCAAG AGGATTTTGT 
GCGGGAGTTT CTAGAGCAAT AGAAATCGTA AAGCTAGCAG TAAAATACCA TAGTGACAAT
AGAAAAGTTT ACGTACTACA TGAAATTGTA CACAATAAAT ATATAATTAA TTCCTTAAAG
GAAATGGGTG TAATTTTTAT AGATACATTA GATCAAGCTG AAGATGGATC AATATTAATA
TATAGTGCAC ACGGTATTTC AAAAGAAATA GAACACCTAG GACAATCATG CAACTTAGAG
ATTATTGATG CAACATGTCC ATTAGTAAAT AAAGTACATA AGGAAGTGCA AGCTTATGAT
AAAAAAGGAT ATCAAATAAT TTTAATAGGC CATAAAGGGC ATCGTGAAGT CGAAGGTACT
ATGGGACAAA TAACCAACCC TGTACTATTA GTACAAAACC TATCTGACAT TGATAATATA
GAAGTAACAA ATTCAGATAA ACTTGCATAT GTTACACAAA CAACTTTAAG TGTAGATGAC
ACAAAAGAAA TAATCAACAA ACTAAAACAA AAATTCCCAA ATATTAAAGG GCCAGATTTA
AAGGATATCT GTTATGCTAC TCAAAATAGG CAAACTGCTG TAAAACAATT ATCAGAATTA
GTAGATATCA TATTCGTATT AGGAAGCAAG AATAGTTCAA ATTCAAATCG TTTAAAAGAA
CTAGCTGAAT TAAAAACTCC TGCTTTTTTA ATAGATTCTT ATCAGGAAAT TAACTTAGAT
ATTTTAAAAG ATGTAAACAA AATAGGAATA ACTGCAGGAG CATCAGCCCC AGAAATACTA
ATCACAGAAG TAATAGATTT ACTGAAACAG CACATGAATA TCAAGTTATC AGATTTAGAA
GTTATAAGAG AGAACGTTGC ATTCAATATA CCAAAACAAT TAAGAGAATA CAAACTATAA
 
Protein sequence
MHQNLKKNVE VILANPRGFC AGVSRAIEIV KLAVKYHSDN RKVYVLHEIV HNKYIINSLK 
EMGVIFIDTL DQAEDGSILI YSAHGISKEI EHLGQSCNLE IIDATCPLVN KVHKEVQAYD
KKGYQIILIG HKGHREVEGT MGQITNPVLL VQNLSDIDNI EVTNSDKLAY VTQTTLSVDD
TKEIINKLKQ KFPNIKGPDL KDICYATQNR QTAVKQLSEL VDIIFVLGSK NSSNSNRLKE
LAELKTPAFL IDSYQEINLD ILKDVNKIGI TAGASAPEIL ITEVIDLLKQ HMNIKLSDLE
VIRENVAFNI PKQLREYKL