Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4837 |
Symbol | |
ID | 5586102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4830639 |
End bp | 4831658 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640928445 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001465773 |
Protein GI | 157158628 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.000190992 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATGA TAAAAAGCTA TGCCGCAAAA GAAGCGGGCG GCGAACTGGA AGTTTATGAG TACGATCCCG GTGAGCTGAA GCCACAAGAT GTTGAAGTGC AGGTGGATTA CTGCGGGATT TGTCATTCCG ATCTGTCGAT GATCGATAAC GAATGGGGAT TTTCACAATA TCCGCTGGTT GCCGGGCATG AGGTGATTGG GCGCGTGGTG GCACTCGGGA GCGCCGCGCA GGATAAAGGT TTGCAGGTCG GTCAGCGTGT CGGGATTGGC TGGACGGCGC GTAGCTGTGG TCACTGCGAC GCCTGTATTA GCGGTAATCA GATCAACTGC GAGCAAGGTG CGGTGCCGAC GATTATGAAT CGCGGTGGCT TTGCCGAGAA GTTGCGTGCG GGCTGGCAAT GGGTGATTCC ACTGCCAGAA AATATTGATA TCGAGTCCGC CGGGCCGCTG TTGTGCGGCG GTATCACGGT CTTTAAACCA CTGTTGATGC ACCATATCAC TGCTACCAGC CGCGTTGGGG TAATTGGTAT TGGCGGGCTG GGGCATATCG CTATAAAACT TCTGCACGCA ATGGGATGCG AGGTGACGGC CTTTAGTTCT AATCCGGCGA AAGAGCAGGA AGTACTGGCG ATGGGTGCCG ATAAAGTGGT GAATAGCCGC GATCCGCAGG CACTGAAAGC CCTGTCGGGG CAGTTTGATC TCATTATCAA TACTGTGAAC GTCAGCCTCG ACTGGCAGCC TTATTTTGAG GCGCTGACCT ATGGCGGTAA TTTCCATACG GTCGGTGCGG TTCTCACGCC GCTGTCTGTT CCGGCCTTTA CGTTAATTGC GGGCGACCGC AGCATCTCTG GTTCTGCTAC CGGCACGCCT TATGAGCTGC GAAAGCTGAT GCGCTTTGCC GCCCGCAGCA AGGTTGCGCC GACAACCGAA CTGTTCCCGA TGTCGAAAAT TAACGACGCC ATTCAGCATG TACGCGATGG TAAAGCTCGC TACCGAGTAG TCCTGAAAGC CGACTTCTGA
|
Protein sequence | MSMIKSYAAK EAGGELEVYE YDPGELKPQD VEVQVDYCGI CHSDLSMIDN EWGFSQYPLV AGHEVIGRVV ALGSAAQDKG LQVGQRVGIG WTARSCGHCD ACISGNQINC EQGAVPTIMN RGGFAEKLRA GWQWVIPLPE NIDIESAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL GHIAIKLLHA MGCEVTAFSS NPAKEQEVLA MGADKVVNSR DPQALKALSG QFDLIINTVN VSLDWQPYFE ALTYGGNFHT VGAVLTPLSV PAFTLIAGDR SISGSATGTP YELRKLMRFA ARSKVAPTTE LFPMSKINDA IQHVRDGKAR YRVVLKADF
|
| |