Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3748 |
Symbol | |
ID | 6068040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4099512 |
End bp | 4100531 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603163 |
Product | alcohol dehydrogenase |
Protein accession | YP_001726682 |
Protein GI | 170021728 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0358178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGA TAAAAAGCTA TGCCGCAAAA GAAGCGGGCG GCGAACTGGA AGTTTATGAG TACGATCCCG GTGAGCTGAA GCCACAAGAT GTTGAAGTAC AGGTGGATTA CTGCGGGATT TGTCATTCCG ATCTGTCGAT GATCGACAAT GAATGGGGAT TTTCACAATA TCCGCTGGTT GCCGGGCATG AGGTGATTGG GCGCGTGGTG GCACTCGGGA GCGCCGCGCA GGATAAAGGT TTGCAGGTCG GTCAGCGTGT CGGGATTGGC TGGACGGCGC GTAGCTGTGG TCACTGCGAC GCCTGTATTA GCGGTAATCA GATCAACTGC GAGCAAGGTG CGGTGCCGAC GATTATGAAT CGCGGTGGCT TTGCCGAGAA GTTGCGTGCG GACTGGCAAT GGGTGATTCC ACTGCCAGAA AATATTGATA TCGAGTCCGC CGGGCCGCTG TTGTGCGGCG GTATCACGGT CTTTAAACCA CTGTTGATGC ACCATATCAC TGCTACCAGC CGCGTTGGGG TAATTGGTAT TGGCGGGCTG GGGCATATCG CTATAAAACT TCTGCACGCA ATGGGATGCG AGGTGACGGC CTTTAGTTCT AATCCGGCGA AAGAGCAGGA AGTACTGGCG ATGGGTGCCG ATAAAGTGGT GAATAGCCGC GATCCGCAGG CACTGAAAGC ACTGGCGGGG CAGTTTGATC TCATTATCAA CACCGTCAAC GTCAGCCTCG ACTGGCAGCC TTATTTTGAG GCGCTGACCT ATGGCGGTAA TTTCCATACG GTCGGTGCGG TTCTCACGCC GCTGTCTGTT CCGGCCTTTA CGTTAATTGC GGGCGATCGC AGCGTCTCTG GTTCTGCTAC CGGCACGCCT TATGAGCTGC GTAAGCTGAT GCGTTTTGCC GCCCGCAGCA AGGTTGCGCC GACCACCGAA CTGTTCCCGA TGTCGAAAAT TAACGACGCC ATCCAGCATG TACGCGATGG TAAAGCGCGC TACCGCGTGG TCCTGAAAGC CGACTTCTGA
|
Protein sequence | MSMIKSYAAK EAGGELEVYE YDPGELKPQD VEVQVDYCGI CHSDLSMIDN EWGFSQYPLV AGHEVIGRVV ALGSAAQDKG LQVGQRVGIG WTARSCGHCD ACISGNQINC EQGAVPTIMN RGGFAEKLRA DWQWVIPLPE NIDIESAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL GHIAIKLLHA MGCEVTAFSS NPAKEQEVLA MGADKVVNSR DPQALKALAG QFDLIINTVN VSLDWQPYFE ALTYGGNFHT VGAVLTPLSV PAFTLIAGDR SVSGSATGTP YELRKLMRFA ARSKVAPTTE LFPMSKINDA IQHVRDGKAR YRVVLKADF
|
| |