Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3041 |
Symbol | ubiH |
ID | 6147315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3130118 |
End bp | 3131296 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641617910 |
Product | 2-octaprenyl-6-methoxyphenyl hydroxylase |
Protein accession | YP_001745061 |
Protein GI | 170680909 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | [TIGR01984] 2-polyprenyl-6-methoxyphenol 4-hydroxylase [TIGR01988] Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0632678 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTAA TCATCGTCGG TGGCGGCATG GCGGGCGCGA CGCTGGCGCT GGCTATTTCC CGGTTAAGTC ACGGGGCGCT GCCGGTACAT TTGATTGAAG CGACTGCGCC AGAGTCACAT GCTCATCCGG GCTTTGATGG ACGAGCTATT GCGCTGGCGG CGGGTACCTG TCAGCAACTG GCGCGCATCG GCGTTTGGCA ATCTCTGGCG GATTGCGCAA CCGCCATCAC CAGCGTGCAT GTCAGCGATC GTGGTCATGC CGGATTCGTC ACCCTCGCCG CAGAAGATTA CCAACTGGCG GCGCTGGGGC AGGTTGTCGA ATTGCATAAT GTCGGGCAAC GGCTGTTTGC ATTGCTGCGC AAAGCACCTG GCGTAACGCT GCATTGCCCT GATCGCGTGG CTAACGTTGC CCGTGCTCAG AGTCACGTTG AAGTGACGCT GGAGAGTGGC GAGACGCTGA CGGGCTGCGT GCTGGTAGCC GCTGATGGCA CCCATTCAGC GTTAGCCACT GTGTGCGGCG TTGACTGGCA GCAGGAGCCT TACGAACAAC TGGCTGTGAT TGCTAACGTT GCGACTTCCG TTGCGCACGA AGGCCGCGCT TTTGAACGCT TCACGAAACA TGGCCCGCTG GCGATGTTGC CGATGTCTGA CGGACGCTGT TCGCTGGTCT GGTGTCATCC ACTGGAACGG CGCGAAGAGG TGCTGTCGTG GAGCGACGAG AAGTTTTGCC GTGAACTCCA GTCTGCCTTT GGCTGGCGAC TGGGGCAAAT TACCCACGCC GGTAAACGCA GTGCTTATCC GCTGGCATTA ACCCGCGCCG CCAGACCGAT TACCCATCGC ACTGTGCTGG TGGGCAATGC GGCGCAAACT CTGCACCCCA TCGCCGGGCA AGGGTTCAAC CTCGGTATGC GTGATGTGAT GAGCCTTGCA GAAACCCTGA CTCAGGCGCA GGAGCGCGGA GAAGACATGG GTGATTACGG CGTATTGTGC CGTTATCAGC AGCGTCGACA GCGCGATCGC GAAGCAACCA TCGGCGTCAC GGACAGCCTT GTACATCTTT TTGCCAACCG TTGGACCCCG CTGGTTGTCG GGCGCAACAT CGGGCTGATG ACGATGGAAT TATTTACCCC GGCACGCGAT GTGCTGGCGC AGCGCACCCT CGGTTGGGTG GCGCGTTGA
|
Protein sequence | MSVIIVGGGM AGATLALAIS RLSHGALPVH LIEATAPESH AHPGFDGRAI ALAAGTCQQL ARIGVWQSLA DCATAITSVH VSDRGHAGFV TLAAEDYQLA ALGQVVELHN VGQRLFALLR KAPGVTLHCP DRVANVARAQ SHVEVTLESG ETLTGCVLVA ADGTHSALAT VCGVDWQQEP YEQLAVIANV ATSVAHEGRA FERFTKHGPL AMLPMSDGRC SLVWCHPLER REEVLSWSDE KFCRELQSAF GWRLGQITHA GKRSAYPLAL TRAARPITHR TVLVGNAAQT LHPIAGQGFN LGMRDVMSLA ETLTQAQERG EDMGDYGVLC RYQQRRQRDR EATIGVTDSL VHLFANRWTP LVVGRNIGLM TMELFTPARD VLAQRTLGWV AR
|
| |