Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1225 |
Symbol | |
ID | 6067264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1344632 |
End bp | 1345858 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641600640 |
Product | ethanolamine utilisation protein EutH |
Protein accession | YP_001724218 |
Protein GI | 170019264 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3192] Ethanolamine utilization protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATTA ACGAAATCAT CATGTACATC ATGATGTTCT TTATGCTGAT AGCTGCCGTA GACAGGATCC TGTCGCAGTT CGGCGGTTCT GCTCGTTTCC TCGGTAAGTT CGGTAAAAGT ATCGAAGGAT CAGGCGGTCA GTTCGAAGAA GGCTTTATGG CAATGGGCGC ACTGGGCCTG GCGATGGTCG GTATGACCGC GCTGGCACCG GTACTGGCTC ACGTTCTCGG GCCGGTAATT ATTCCGGTTT ACGAAATGCT CGGCGCTAAC CCATCGATGT TCGCCGGAAC ACTGCTGGCG TGCGATATGG GCGGCTTCTT CCTCGCCAAA GAGCTGGCGG GCGGCGACGT AGCCGCGTGG CTATACTCTG GGTTAATTCT CGGGTCGATG ATGGGGCCAA CGATTGTGTT TTCCATTCCG GTGGCGCTCG GCATTATCGA ACCTTCTGAC CGTCGTTATC TGGCGCTCGG CGTGCTGGCG GGCATTGTGA CCATTCCGAT TGGTTGTATC GCTGGTGGTC TGGTTGCTAT GTACTCCGGT GTGCAGATCA ACGGCCAGCC GGTGGAATTC ACTTTCGCCC TGATCCTGAT GAACATGATC CCGGTGATCA TTGTTGCGAT TCTGGTGGCG CTGGGGCTGA AATTCATCCC GGAAAAAATG ATCAACGGCT TCCAGATCTT CGCCAAATTC CTCGTTGCAT TGATCACCCT CGGTCTTGCC GCTGCGGTAG TGAAATTCCT GCTTGGCTGG GAACTGATCC CCGGTCTGGA TCCTATCTTT ATGGCCCCTG GCGATAAACC CGGTGAGGTG ATGCGCGCCA TTGAAGTTAT CGGTTCTATC TCCTGCGTTC TGTTAGGGGC GTATCCGATG GTGCTGCTGC TGACTCGCTG GTTTGAAAAA CCGCTGATGA GCGTCGGTAA AGTACTGAAT ATGAACAACA TCGCGGCAGC CGGCATGGTG GCAACGCTTG CCAACAACAT CCCGATGTTC GGCATGATGA AGCAGATGGA TACCCGCGGC AAAGTCATCA ACTGCGCCTT CGCCGTTTCC GCTGCTTTCG CCCTGGGCGA CCACTTAGGC TTCGCCGCTG CCAACATGAA CGCCATGATC TTCCCGATGA TTGTCGGCAA GTTGATCGGC GGCGTAACGG CGATTGGCGT GGCGATGATG CTGGTGCCAA AAGAAGACGC GACCGCGACT AAAACCGAAG CGGAGGCACA ATCGTGA
|
Protein sequence | MGINEIIMYI MMFFMLIAAV DRILSQFGGS ARFLGKFGKS IEGSGGQFEE GFMAMGALGL AMVGMTALAP VLAHVLGPVI IPVYEMLGAN PSMFAGTLLA CDMGGFFLAK ELAGGDVAAW LYSGLILGSM MGPTIVFSIP VALGIIEPSD RRYLALGVLA GIVTIPIGCI AGGLVAMYSG VQINGQPVEF TFALILMNMI PVIIVAILVA LGLKFIPEKM INGFQIFAKF LVALITLGLA AAVVKFLLGW ELIPGLDPIF MAPGDKPGEV MRAIEVIGSI SCVLLGAYPM VLLLTRWFEK PLMSVGKVLN MNNIAAAGMV ATLANNIPMF GMMKQMDTRG KVINCAFAVS AAFALGDHLG FAAANMNAMI FPMIVGKLIG GVTAIGVAMM LVPKEDATAT KTEAEAQS
|
| |