Gene ECH74115_4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4200 
SymbolubiH 
ID6968429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3893866 
End bp3895044 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content60% 
IMG OID643387944 
Product2-octaprenyl-6-methoxyphenyl hydroxylase 
Protein accessionYP_002272383 
Protein GI209399230 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR01984] 2-polyprenyl-6-methoxyphenol 4-hydroxylase
[TIGR01988] Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0148389 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTAA TCATCGTCGG TGGCGGCATG GCGGGCGCGA CGCTGGCGCT GGCTATTTCC 
CGGTTAAGTC ACGGGGCGCT GCCGGTACAT TTGATTGAAG CGACTGCGCC AGAGTCACAT
GCTCATCCGG GCTTTGATGG ACGAGCGATA GCGCTGGCGG CGGGTACCTG TCAGCAACTG
GCGCGCATCG GCGTTTGGCA ATCTCTGGCG CATTGCGCAA CTGCCATCAC CACCGTGCAT
GTCAGCGATC GAGGTCACGC TGGATTTGTC ACCCTCGCCG CAGAAGATTA CCAACTGGCG
GCGCTGGGAC AGGTTGTCGA ATTGCACAAT GTCGGGCAAC GGCTGTTTGC ATTGCTGCGT
AAAGCACCTG GCGTAACGCT GCATTGCCCT GATCGCGTGG CTAACGTTGC CCGTACTCAG
AGTCACGTTG AAGTGACGCT GGAGGGTGGC GAGACGCTGA CGGGCCGCGT GCTGGTAGCC
GCTGATGGCA CCCATTCAGC GTTAGCCACT GTGTGCGGCG TTGACTGGCA GCAGGAGCCT
TACGAACAAC TGGCTGTGAT TGCCAACGTT GCGACTTCCG TTGCGCATGA AGGGCGCGCT
TTTGAACGCT TTACGCAACA TGGCCCGCTG GCGATGTTGC CGATGTCTGA CGGACGCTGT
TCGCTGGTCT GGTGTCATCC ACTGGAACGG CGCGAAGAGG TGCTGTCGTG GAGCGACGAG
AAGTTTTGCC GTGAACTCCA GTCGGCCTTT GGCTGGCGAC TTGGGAAAAT TACCCACGCT
GGTAAACGCA GTGCTTATCC GCTGGCGTTA ACCCACGCCG CCAGATCTAT TACCCATCGT
ACCGTGCTGG TGGGCAATGC GGCGCAAACC CTGCACCCGA TTGCCGGGCA AGGGTTCAAC
CTCGGTATGC GAGATGTCAT GAGTCTTGCG GAAACCCTGA CTCAGGCGCA GGAGCGCGGA
GAAGACATGG GTGATTACGG CGTATTGTGT CGTTATCAGC AGCGTCGACA GAGCGATCGC
GAAGCAACCA TTGGCGTCAC GGACAGCCTT GTACATCTTT TTGCCAACCG TTGGACGCCA
CTGGTTGTCG GGCGCAACAT CGGGCTGATG ACGATGGAAT TATTCACCCC GGCACGCGAT
GTGCTGGCGC AGCGCACCCT CGGTTGGGTG GCGCGTTGA
 
Protein sequence
MSVIIVGGGM AGATLALAIS RLSHGALPVH LIEATAPESH AHPGFDGRAI ALAAGTCQQL 
ARIGVWQSLA HCATAITTVH VSDRGHAGFV TLAAEDYQLA ALGQVVELHN VGQRLFALLR
KAPGVTLHCP DRVANVARTQ SHVEVTLEGG ETLTGRVLVA ADGTHSALAT VCGVDWQQEP
YEQLAVIANV ATSVAHEGRA FERFTQHGPL AMLPMSDGRC SLVWCHPLER REEVLSWSDE
KFCRELQSAF GWRLGKITHA GKRSAYPLAL THAARSITHR TVLVGNAAQT LHPIAGQGFN
LGMRDVMSLA ETLTQAQERG EDMGDYGVLC RYQQRRQSDR EATIGVTDSL VHLFANRWTP
LVVGRNIGLM TMELFTPARD VLAQRTLGWV AR