Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2598 |
Symbol | eutH |
ID | 6144711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2651229 |
End bp | 2652455 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617469 |
Product | ethanolamine utilization protein EutH |
Protein accession | YP_001744634 |
Protein GI | 170683597 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3192] Ethanolamine utilization protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATTA ACGAAATCAT CATGTACATC ATGATGTTCT TTATGCTGAT AGCTGCCGTA GACAGGATCC TGTCGCAGTT CGGCGGTTCT GCTCGTTTCC TCGGTAAGTT CGGTAAAAGT ATCGAAGGAT CCGGCAGTCA GTTCGAAGAA GGCTTTATGG CAATGGGCGC TTTGGGCCTG GCGATGGTTG GTATGACCGC GCTGGCACCT GTTCTGGCCC ACGTACTCGG ACCGGTGATT ATCCCGGTTT ACGAAATGCT CGGCGCAAAC CCGTCAATGT TCGCCGGAAC ACTGCTGGCG TGCGATATGG GCGGCTTCTT CCTCGCCAAA GAGTTGGCGG GCGGCGACGT AGCAGCGTGG CTATACTCTG GATTAATTCT CGGGTCGATG ATGGGGCCAA CGATTGTGTT TTCCATTCCG GTGGCGCTCG GCATTATCGA ACCTTCTGAC CGTCGTTATC TGGCGCTCGG CGTGCTGGCG GGCATTGTGA CCATTCCGAT TGGCTGTATT GCCGGTGGTC TGGTGGCTAT GTACTCCGGT GTGCAGATCA ACGGTCAGCC AGTGGAATTC ACCTTTGCGC TGATCCTGAT GAACATGATC CCGGTACTTA TCGTTGCGGT GCTGGTGGCG CTGGGGCTGA AATTCATCCC GGAAAAAATG ATCAACGGCT TCCAGATCTT CGCCAAATTC CTCGTTGCAT TGATCACCCT CGGTCTTGCC GCTGCGGTAG TGAAATTCCT CCTTGGCTGG GAACTGATCC CGGGCCTTGA TCCTATCTTT ATGGCCCCTG GCGATAAACC CGGTGAAGTG ATGCGCGCCA TTGAAGTTAT CGGCTCGATC TCCTGCGTTC TGTTAGGGGC GTATCCGATG GTGCTGCTGC TGACTCGCTG GTTTGAAAAA CCGCTGATGA GTGTCGGTAA GGTGCTGAAT ATGAACAATA TAGCGGCAGC CGGCATGGTG GCAACGCTTG CCAACAACAT CCCGATGTTT GGCATGATGA AGCAGATGGA TACCCGCGGC AAAGTCATCA ACTGTGCCTT CGCCGTTTCC GCTGCTTTCG CCCTGGGCGA CCATTTAGGC TTCGCGGCTG CCAACATGAA CGCCATGATC TTCCCGATGA TTGTCGGCAA GCTGATCGGC GGCGTCACGG CGATTGGCGT GGCGATGATG CTGGTACCTA AAGAAGACGC GAGCGCGGCT AAAACCGAAG CGGAGGCGCA ATCGTGA
|
Protein sequence | MGINEIIMYI MMFFMLIAAV DRILSQFGGS ARFLGKFGKS IEGSGSQFEE GFMAMGALGL AMVGMTALAP VLAHVLGPVI IPVYEMLGAN PSMFAGTLLA CDMGGFFLAK ELAGGDVAAW LYSGLILGSM MGPTIVFSIP VALGIIEPSD RRYLALGVLA GIVTIPIGCI AGGLVAMYSG VQINGQPVEF TFALILMNMI PVLIVAVLVA LGLKFIPEKM INGFQIFAKF LVALITLGLA AAVVKFLLGW ELIPGLDPIF MAPGDKPGEV MRAIEVIGSI SCVLLGAYPM VLLLTRWFEK PLMSVGKVLN MNNIAAAGMV ATLANNIPMF GMMKQMDTRG KVINCAFAVS AAFALGDHLG FAAANMNAMI FPMIVGKLIG GVTAIGVAMM LVPKEDASAA KTEAEAQS
|
| |