Gene ECH74115_4199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4199 
Symbol 
ID6967944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3892640 
End bp3893842 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content55% 
IMG OID643387943 
Producthypothetical protein 
Protein accessionYP_002272382 
Protein GI209397458 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR01988] Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.122163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAGTG TTGATGTAGC CATTGTTGGC GGCGGCATGG TGGGGCTGGC GGTTGCCTGC 
GGCTTACAGG GTAGCGGCTT ACGCGTTGCC GTACTGGAGC AGCGCGTACC GGAACCTCTG
GCGGCGGATG CACCACCACA ACTGCGCGTT TCGGCTATCA ATGCCGCCAG CGAAAAATTA
CTCACCCGTC TTGGCGTCTG GCAGGACATT CTCTCTCGCC GAGCCAGCTG TTATCACGGT
ATGGAAGTGT GGGACAAAGA CAGTTTTGGT CACATCTCGT TTGACGATCA AAGCATGGGC
TATAGCCATC TTGGGCACAT CGTTGAAAAC TCAGTGATTC ACTACGCGCT GTGGAACAAA
GCGCAGCAGT CGTCAGATAT CACTCTTTTG GCCCCCGCAG AATTACAGCA GGTCGCCTGG
GGAGAAAATG AAACCTTCCT GACGCTGAAA GATGGCAGTA TGTTAACGGC GCGTCTGGTG
ATTGGCGCGG ACGGCGCTAA TTCCTGGTTG CGCAACAAAG CCGATATTCC GCTGACTTTC
TGGGATTATC AGCATCACGC GCTGGTAGCG ACCATACGCA CGGAAGAACC GCATGATGCG
GTGGCGCGGC AGGTTTTCCA TGGCGAAGGC ATTCTGGCCT TTTTACCGCT TAGCGATCCG
CATCTTTGCT CGATTGTCTG GTCACTGTCG CCAGAGGAAG CGCAGCGGAT GCAGCAGGCA
AGTGAAGACG AATTTAATCG CGCGTTAAAT ATCGCTTTTG ATAATCGCCT GGGCTTATGC
AAGGTTGAGA GCGCGCGTCA GGTGTTCCCA CTGACGGGGC GTTATGCGCG CCAGTTTGCC
GCGCACCGTC TGGCGCTGGT GGGCGACGCC GCGCATACCA TTCACCCGCT GGCGGGGCAG
GGGGTAAATC TTGGCTTTAT GGATGCTGCA GAGCTGATTG CTGAACTGAA ACGGTTGCAT
CGTCAGGGTA AAGACATCGG GCAGTACATT TATCTGCGTC GCTATGAGCG TAGCCGCAAG
CACAGTGCGG CGCTGATGCT GGCTGGTATG CAGGGATTCC GCGATCTGTT TTCTGGTACC
AATCCGGTGA AAAAACTGCT GCGTGATATT GGTCTGAAAC TGGCCGACAC GCTTCCTGGC
GTTAAACCAC AACTTATCCG CCAGGCAATG GGATTAAACG ATTTGCCTGA ATGGCTGCGT
TAA
 
Protein sequence
MQSVDVAIVG GGMVGLAVAC GLQGSGLRVA VLEQRVPEPL AADAPPQLRV SAINAASEKL 
LTRLGVWQDI LSRRASCYHG MEVWDKDSFG HISFDDQSMG YSHLGHIVEN SVIHYALWNK
AQQSSDITLL APAELQQVAW GENETFLTLK DGSMLTARLV IGADGANSWL RNKADIPLTF
WDYQHHALVA TIRTEEPHDA VARQVFHGEG ILAFLPLSDP HLCSIVWSLS PEEAQRMQQA
SEDEFNRALN IAFDNRLGLC KVESARQVFP LTGRYARQFA AHRLALVGDA AHTIHPLAGQ
GVNLGFMDAA ELIAELKRLH RQGKDIGQYI YLRRYERSRK HSAALMLAGM QGFRDLFSGT
NPVKKLLRDI GLKLADTLPG VKPQLIRQAM GLNDLPEWLR