Gene EcSMS35_3041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3041 
SymbolubiH 
ID6147315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3130118 
End bp3131296 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content60% 
IMG OID641617910 
Product2-octaprenyl-6-methoxyphenyl hydroxylase 
Protein accessionYP_001745061 
Protein GI170680909 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR01984] 2-polyprenyl-6-methoxyphenol 4-hydroxylase
[TIGR01988] Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0632678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTAA TCATCGTCGG TGGCGGCATG GCGGGCGCGA CGCTGGCGCT GGCTATTTCC 
CGGTTAAGTC ACGGGGCGCT GCCGGTACAT TTGATTGAAG CGACTGCGCC AGAGTCACAT
GCTCATCCGG GCTTTGATGG ACGAGCTATT GCGCTGGCGG CGGGTACCTG TCAGCAACTG
GCGCGCATCG GCGTTTGGCA ATCTCTGGCG GATTGCGCAA CCGCCATCAC CAGCGTGCAT
GTCAGCGATC GTGGTCATGC CGGATTCGTC ACCCTCGCCG CAGAAGATTA CCAACTGGCG
GCGCTGGGGC AGGTTGTCGA ATTGCATAAT GTCGGGCAAC GGCTGTTTGC ATTGCTGCGC
AAAGCACCTG GCGTAACGCT GCATTGCCCT GATCGCGTGG CTAACGTTGC CCGTGCTCAG
AGTCACGTTG AAGTGACGCT GGAGAGTGGC GAGACGCTGA CGGGCTGCGT GCTGGTAGCC
GCTGATGGCA CCCATTCAGC GTTAGCCACT GTGTGCGGCG TTGACTGGCA GCAGGAGCCT
TACGAACAAC TGGCTGTGAT TGCTAACGTT GCGACTTCCG TTGCGCACGA AGGCCGCGCT
TTTGAACGCT TCACGAAACA TGGCCCGCTG GCGATGTTGC CGATGTCTGA CGGACGCTGT
TCGCTGGTCT GGTGTCATCC ACTGGAACGG CGCGAAGAGG TGCTGTCGTG GAGCGACGAG
AAGTTTTGCC GTGAACTCCA GTCTGCCTTT GGCTGGCGAC TGGGGCAAAT TACCCACGCC
GGTAAACGCA GTGCTTATCC GCTGGCATTA ACCCGCGCCG CCAGACCGAT TACCCATCGC
ACTGTGCTGG TGGGCAATGC GGCGCAAACT CTGCACCCCA TCGCCGGGCA AGGGTTCAAC
CTCGGTATGC GTGATGTGAT GAGCCTTGCA GAAACCCTGA CTCAGGCGCA GGAGCGCGGA
GAAGACATGG GTGATTACGG CGTATTGTGC CGTTATCAGC AGCGTCGACA GCGCGATCGC
GAAGCAACCA TCGGCGTCAC GGACAGCCTT GTACATCTTT TTGCCAACCG TTGGACCCCG
CTGGTTGTCG GGCGCAACAT CGGGCTGATG ACGATGGAAT TATTTACCCC GGCACGCGAT
GTGCTGGCGC AGCGCACCCT CGGTTGGGTG GCGCGTTGA
 
Protein sequence
MSVIIVGGGM AGATLALAIS RLSHGALPVH LIEATAPESH AHPGFDGRAI ALAAGTCQQL 
ARIGVWQSLA DCATAITSVH VSDRGHAGFV TLAAEDYQLA ALGQVVELHN VGQRLFALLR
KAPGVTLHCP DRVANVARAQ SHVEVTLESG ETLTGCVLVA ADGTHSALAT VCGVDWQQEP
YEQLAVIANV ATSVAHEGRA FERFTKHGPL AMLPMSDGRC SLVWCHPLER REEVLSWSDE
KFCRELQSAF GWRLGQITHA GKRSAYPLAL TRAARPITHR TVLVGNAAQT LHPIAGQGFN
LGMRDVMSLA ETLTQAQERG EDMGDYGVLC RYQQRRQRDR EATIGVTDSL VHLFANRWTP
LVVGRNIGLM TMELFTPARD VLAQRTLGWV AR