Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4525 |
Symbol | |
ID | 5591569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4531319 |
End bp | 4532338 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640923621 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001461062 |
Protein GI | 157163744 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 0.0564298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATGA TAAAAAGCTA TGCCGCAAAA GAAGCGGGCG GCGAACTGGA AGTTTATGAG TACGATCCCG GTGAGCTGAA GCCACAAGAT GTTGAAGTGC AGGTGGATTA CTGCGGGATC TGCCATTCCG ATCTGTCGAT GATCGATAAC GAATGGGGAT TTTCACAATA TCCGCTGGTT GCCGGGCATG AGGTGATTGG TCGCGTGGTG GCGCTCGGGA GTGCCGCGCA GGATAAAGGT TTGCAGGTCG GTCAGCGTGT CGGGATTGGC TGGACAGCGC GTAGCTGTGG TCACTGCGAC GCCTGTATTA GCGGAAATCA GATCAACTGT GAGCAAGGTG CGGTGCCAAC AATTATGAAT CGCGGAGGTT TTGCCGAGAA GTTGCGTGTA GACTGGCAAT GGGTTATTCC ACTGCCGGAA AATATCGACA TTGAATCTGC CGGGCCGCTG TTGTGCGGCG GTATCACGGT CTTTAAACCA CTGTTGATGC ACCATATCAC TGCTACCAGC CGCGTTGGGG TAATTGGTAT TGGCGGGCTG GGGCATATCG CTATAAAACT TCTGCACGCA ATGGGATGTG AGGTGACGGC CTTTAGTTCT AATCCGGCGA AAGAGCAGGA ATTGCTGGCG ATGGGTGCCG ATAAAGTGGT GAATAGCCGC GATCCGCAGG CACTGAAAGC ACTGGCGGGG CAGTTTGATC TCATTATCAA TACCGTGAAC GTCAGCCTCG ACTGGCAGCC TTATTTTGAG GCGCTGACGT ACGGCGGTAA TTTCCACACT GTCGGTGCGG TTCTCACGCC GCTGTCTGTT CCGGCCTTTA CGTTAATTGC GGGCGATCGC AGCGTCTCTG GCTCTGCTAC CGGCACGCCT TATGAACTGC GTAAGCTGAT GCGCTTTGCC GCCCGCAGCA AGGTTGCGCC GACAACCGAA CTGTTCCCGA TGTCGAAAAT TAACGACGCC ATCCAGCATG TGCGCGACGG TAAGGCGCGT TACCGCGTGG TGTTGAAAGC CGATTTTTGA
|
Protein sequence | MSMIKSYAAK EAGGELEVYE YDPGELKPQD VEVQVDYCGI CHSDLSMIDN EWGFSQYPLV AGHEVIGRVV ALGSAAQDKG LQVGQRVGIG WTARSCGHCD ACISGNQINC EQGAVPTIMN RGGFAEKLRV DWQWVIPLPE NIDIESAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL GHIAIKLLHA MGCEVTAFSS NPAKEQELLA MGADKVVNSR DPQALKALAG QFDLIINTVN VSLDWQPYFE ALTYGGNFHT VGAVLTPLSV PAFTLIAGDR SVSGSATGTP YELRKLMRFA ARSKVAPTTE LFPMSKINDA IQHVRDGKAR YRVVLKADF
|
| |