Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4199 |
Symbol | |
ID | 6967944 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3892640 |
End bp | 3893842 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387943 |
Product | hypothetical protein |
Protein accession | YP_002272382 |
Protein GI | 209397458 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | [TIGR01988] Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.122163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 85 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAGTG TTGATGTAGC CATTGTTGGC GGCGGCATGG TGGGGCTGGC GGTTGCCTGC GGCTTACAGG GTAGCGGCTT ACGCGTTGCC GTACTGGAGC AGCGCGTACC GGAACCTCTG GCGGCGGATG CACCACCACA ACTGCGCGTT TCGGCTATCA ATGCCGCCAG CGAAAAATTA CTCACCCGTC TTGGCGTCTG GCAGGACATT CTCTCTCGCC GAGCCAGCTG TTATCACGGT ATGGAAGTGT GGGACAAAGA CAGTTTTGGT CACATCTCGT TTGACGATCA AAGCATGGGC TATAGCCATC TTGGGCACAT CGTTGAAAAC TCAGTGATTC ACTACGCGCT GTGGAACAAA GCGCAGCAGT CGTCAGATAT CACTCTTTTG GCCCCCGCAG AATTACAGCA GGTCGCCTGG GGAGAAAATG AAACCTTCCT GACGCTGAAA GATGGCAGTA TGTTAACGGC GCGTCTGGTG ATTGGCGCGG ACGGCGCTAA TTCCTGGTTG CGCAACAAAG CCGATATTCC GCTGACTTTC TGGGATTATC AGCATCACGC GCTGGTAGCG ACCATACGCA CGGAAGAACC GCATGATGCG GTGGCGCGGC AGGTTTTCCA TGGCGAAGGC ATTCTGGCCT TTTTACCGCT TAGCGATCCG CATCTTTGCT CGATTGTCTG GTCACTGTCG CCAGAGGAAG CGCAGCGGAT GCAGCAGGCA AGTGAAGACG AATTTAATCG CGCGTTAAAT ATCGCTTTTG ATAATCGCCT GGGCTTATGC AAGGTTGAGA GCGCGCGTCA GGTGTTCCCA CTGACGGGGC GTTATGCGCG CCAGTTTGCC GCGCACCGTC TGGCGCTGGT GGGCGACGCC GCGCATACCA TTCACCCGCT GGCGGGGCAG GGGGTAAATC TTGGCTTTAT GGATGCTGCA GAGCTGATTG CTGAACTGAA ACGGTTGCAT CGTCAGGGTA AAGACATCGG GCAGTACATT TATCTGCGTC GCTATGAGCG TAGCCGCAAG CACAGTGCGG CGCTGATGCT GGCTGGTATG CAGGGATTCC GCGATCTGTT TTCTGGTACC AATCCGGTGA AAAAACTGCT GCGTGATATT GGTCTGAAAC TGGCCGACAC GCTTCCTGGC GTTAAACCAC AACTTATCCG CCAGGCAATG GGATTAAACG ATTTGCCTGA ATGGCTGCGT TAA
|
Protein sequence | MQSVDVAIVG GGMVGLAVAC GLQGSGLRVA VLEQRVPEPL AADAPPQLRV SAINAASEKL LTRLGVWQDI LSRRASCYHG MEVWDKDSFG HISFDDQSMG YSHLGHIVEN SVIHYALWNK AQQSSDITLL APAELQQVAW GENETFLTLK DGSMLTARLV IGADGANSWL RNKADIPLTF WDYQHHALVA TIRTEEPHDA VARQVFHGEG ILAFLPLSDP HLCSIVWSLS PEEAQRMQQA SEDEFNRALN IAFDNRLGLC KVESARQVFP LTGRYARQFA AHRLALVGDA AHTIHPLAGQ GVNLGFMDAA ELIAELKRLH RQGKDIGQYI YLRRYERSRK HSAALMLAGM QGFRDLFSGT NPVKKLLRDI GLKLADTLPG VKPQLIRQAM GLNDLPEWLR
|
| |