Gene EcSMS35_0685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0685 
SymbolubiF 
ID6144074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp694558 
End bp695733 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content57% 
IMG OID641615575 
Product2-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol hydroxylase 
Protein accessionYP_001742781 
Protein GI170680248 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR01988] Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00002921 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATC AACCAACGGA AATTGCCATT GTCGGCGGAG GAATGGTCGG CGGCGCACTG 
GCGCTGGGAC TGGCTCAGCA CGGATTTTCG GTAATGGTTA TCGAACATGC CCAACCTGCA
CCGTTTGTCG CTGACAGCCA GCCGGACGTG CGGATCTCGG CGATTAGTGC GGCTTCGGTA
TCATTGCTTA AAGGGTTAGG GGTCTGGGAT GCAGTACAGG CTATGCGTTG CCATCCTTAC
CGCAGACTGG AAACGTGGGA GTGGGAAACG GCGCATGTGG TGTTTGACGC CGCTGAACTT
AAGCTACCGT TGCTTGGCTA CATGGTGGAA AACACTGTCC TGCAACAGGC GCTGTGGCAG
GCGCTGGAAG CGCATCCGAA AGTAACGTTA CGTGTGCCAG GCTCGCTGAT TGCGCTGCAT
CGCCATAATG ATCTTCAGGA GCTGGAACTG AAGGGCGGTG AAACGATCCG CGCGAAGCTG
GTGATTGGTG CCGACGGCGC AAATTCGCAG GTGCGGCAGA TGGCGGGAAT TGGCGTTCAT
GCCTGGCAGT ATGCACAGTC GTGTATGTTG ATTAGCGTAC AGTGCGAGAA CGATCCCGGC
GATAGCACCT GGCAGCAATT TACCCCGGAC GGACCGCGTG CGTTTCTGCC GTTGTTTGAT
AACTGGGCAT CGCTGGTGTG GTATGACTCT CCGGCGCGCA TTCGCCAGTT GCAGAATATG
AATATGGCGC AGCTACAGGC GGAAATCGCG AAGCATTTCC CGTCGCGTCT GGGTTACGTG
ACACCGCTTG CCGCGGGGGC GTTTCCGCTG ACACGTCGCC ATGCGTTGCA GTATGTTCAG
CCGGGATTGG CACTGGTGGG AGATGCCGCG CATACCATCC ATCCGCTGGC GGGGCAGGGG
GTGAATCTTG GTTATCGTGA TGTCGATGCC CTGATTGACG TTCTGGTGAA CGCCCGCAGC
TACGGCGAAG CGTGGGCCAG TTATCCTGTG CTCAAGCGTT ACCAGATGCG GCGCATGGCG
GATAACTTCA TTATGCAAAG CGGTATGGAT CTGTTTTATG CCGGATTCAG CAATAATCTG
CCACCACTGC GTTTTATGCG TAATCTCGGA TTAATGGCGG CGGAGCGTGC TGGCGTGTTG
AAACGTCAGG CGCTGAAATA TGCGTTAGGG TTGTAG
 
Protein sequence
MTNQPTEIAI VGGGMVGGAL ALGLAQHGFS VMVIEHAQPA PFVADSQPDV RISAISAASV 
SLLKGLGVWD AVQAMRCHPY RRLETWEWET AHVVFDAAEL KLPLLGYMVE NTVLQQALWQ
ALEAHPKVTL RVPGSLIALH RHNDLQELEL KGGETIRAKL VIGADGANSQ VRQMAGIGVH
AWQYAQSCML ISVQCENDPG DSTWQQFTPD GPRAFLPLFD NWASLVWYDS PARIRQLQNM
NMAQLQAEIA KHFPSRLGYV TPLAAGAFPL TRRHALQYVQ PGLALVGDAA HTIHPLAGQG
VNLGYRDVDA LIDVLVNARS YGEAWASYPV LKRYQMRRMA DNFIMQSGMD LFYAGFSNNL
PPLRFMRNLG LMAAERAGVL KRQALKYALG L