Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3673 |
Symbol | eutH |
ID | 6971201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3387311 |
End bp | 3388537 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387467 |
Product | ethanolamine utilization protein EutH |
Protein accession | YP_002271920 |
Protein GI | 209400184 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3192] Ethanolamine utilization protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 83 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATTA ACGAAATCAT CATGTACATC ATGATGTTCT TTATGCTGAT AGCTGCCGTT GACAGGATCC TGTCACAGTT CGGCGGTTCT GCTCGTTTCC TCGGTAAGTT CGGTAAAAGT ATCGAAGGAT CCGGCGGTCA GTTCGAAGAA GGCTTTATGG CAATGGGCGC ACTGGGCCTG GCGATGGTCG GTATGACCGC GCTGGCACCG GTACTGGCTC ACGTACTCGG GCCGGTAATT ATTCCGGTTT ACGAAATGCT CGGCGCTAAC CCATCGATGT TCGCCGGAAC ACTGCTGGCG TGCGATATGG GCGGCTTCTT CCTCGCCAAA GAGCTGGCGG GCGGCGACGT AGCCGCGTGG CTATACTCTG GGTTAATTCT CGGGTCGATG ATGGGGCCAA CGATTGTGTT TTCCATTCCG GTGGCGCTCG GCATTATCGA ACCTTCTGAC CGTCGTTATC TGGCGCTCGG CGTGCTGGCG GGCATTGTGA CCATTCCGAT TGGTTGTATC GCTGGTGGTC TGGTTGCTAT GTACTCCGGT GTTCAGATCA ACGGCCAGCC GGTGGAATTC ACTTTCGCCC TGATCCTGAT GAACATGATC CCGGTGATCA TTGTTGCGAT TCTGGTGGCG CTGGGGCTGA AATTCATCCC AGAAAAAATG ATCAACGGCT TCCAGATCTT CGCCAAATTC CTCGTTGCAT TGATCACCCT CGGTCTTGCC GCTGCGGTAG TGAAATTCCT GCTTGGCTGG GAACTGATCC CCGGTCTGGA TCCTATCTTT ATGGCCCCTG GCGATAAACC CGGTGAGGTG ATGCGCGCCA TTGAAGTTAT CGGTTCTATC TCCTGCGTTC TGTTAGGGGC GTATCCGATG GTGCTGCTGC TGACCCGCTG GTTTGAAAAA CCGCTGATGA GCGTCGGTAA AGTGCTGAAT ATGAACAACA TCGCGGCAGC CGGCATGGTG GCAACGCTTG CCAACAACAT CCCGATGTTC GGCATGATGA AGCAGATGGA TACCCGCGGC AAAGTCATCA ACTGCGCCTT CGCCGTTTCC GCTGCTTTCG CCCTGGGCGA CCACTTAGGC TTCGCCGCTG CCAACATGAA TGCCATGATC TTCCCGATGA TTGTCGGCAA GTTGATCGGC GGCGTAACGG CGATTGGCGT GGCGATGATG CTGGTGCCAA AAGACGACGC GACCGCGGCT AAAACCGAAG CGGAGGCACA ATCGTGA
|
Protein sequence | MGINEIIMYI MMFFMLIAAV DRILSQFGGS ARFLGKFGKS IEGSGGQFEE GFMAMGALGL AMVGMTALAP VLAHVLGPVI IPVYEMLGAN PSMFAGTLLA CDMGGFFLAK ELAGGDVAAW LYSGLILGSM MGPTIVFSIP VALGIIEPSD RRYLALGVLA GIVTIPIGCI AGGLVAMYSG VQINGQPVEF TFALILMNMI PVIIVAILVA LGLKFIPEKM INGFQIFAKF LVALITLGLA AAVVKFLLGW ELIPGLDPIF MAPGDKPGEV MRAIEVIGSI SCVLLGAYPM VLLLTRWFEK PLMSVGKVLN MNNIAAAGMV ATLANNIPMF GMMKQMDTRG KVINCAFAVS AAFALGDHLG FAAANMNAMI FPMIVGKLIG GVTAIGVAMM LVPKDDATAA KTEAEAQS
|
| |