Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5786 |
Symbol | |
ID | 6971161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 5420931 |
End bp | 5421950 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643389416 |
Product | oxidoreductase, zinc-binding dehydrogenase family |
Protein accession | YP_002273809 |
Protein GI | 209400376 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00778522 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGA TAAAAAGCTA TGCCGCAAAA GAAGCGGGCG GCGAACTGGA AGTTTATGAG TACGATCCCG GTGAGCTGAG GCCACAAGAT GTTGAAGTGC AGGTGGATTA TTGCGGGATC TGCCATTCCG ATCTATCGAT GATCGACAAT GAATGGGGAT TTTCACAATA TCCGCTGGTT GCCGGGCATG AGGTGATTGG GCGCGTGGTG GCGCTCGGGA GCGCCGCGCA GGATAAAGGT TTGCAGGTCG GCCAGCGTGT CGGGATTGGC TGGACGGCGC GTAGCTGTGG TCACTGCGAC GCCTGTATTA GCGGTAATCA GATCAACTGC GAGCAAGGTG CTGTGCCGAC GATTATGAAT CGCGGTGGCT TTGCCGAGAA GTTGCGTGCG GACTGGCAAT GGGTGATTCC ACTGCCAGAA AATATTGATA TCGAGTCCGC CGGGCCGCTG TTGTGCGGCG GTATCACGGT CTTTAAACCA CTGTTGATGC ACCATATCAC TGCTACCAGC CGCGTTGGGG TAATTGGTAT TGGCGGGCTG GGGCATATCG CTATAAAACT TCTGCACGCA ATGGGATGCG AGGTGACGGC CTTTAGTTCT AATCCGGCGA AAGAGCAGGA AGTACTGGCG ATGGGTGCCG ATAAAGTGGT GAATAGCCGC GATCCGCAGG CACTGAAAGC CCTGTCGGGG CAGTTTGATC TCATTATCAA TACTGTGAAC GTCAGCCTCG ACTGGCAGCC TTATTTTGAG GCGCTGACCT ATGGCGGTAA TTTCCATACG GTCGGTGCGG TTCTCACGCC GTTGCCGGTT CCGGCCTTTA CGCTGATTGC GGGCGATCGC AGTGTCTCTG GCTCTGCTAC CGGTACACCT TATGAGTTGC GTAAGCTGAT GCGCTTTGCC GCCCGCAGCA AGGTTGCGCC GACTACCGAA TTGTTCCCGA TGTCGAAAAT TAACGATGCC ATCAAGCATG TGCGCGACGG TAAGGCGCGT TACCGCGTGG TTCTGAAAGC TGACTTCTGA
|
Protein sequence | MSMIKSYAAK EAGGELEVYE YDPGELRPQD VEVQVDYCGI CHSDLSMIDN EWGFSQYPLV AGHEVIGRVV ALGSAAQDKG LQVGQRVGIG WTARSCGHCD ACISGNQINC EQGAVPTIMN RGGFAEKLRA DWQWVIPLPE NIDIESAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL GHIAIKLLHA MGCEVTAFSS NPAKEQEVLA MGADKVVNSR DPQALKALSG QFDLIINTVN VSLDWQPYFE ALTYGGNFHT VGAVLTPLPV PAFTLIAGDR SVSGSATGTP YELRKLMRFA ARSKVAPTTE LFPMSKINDA IKHVRDGKAR YRVVLKADF
|
| |