Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4750 |
Symbol | |
ID | 6143165 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4850100 |
End bp | 4851119 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619565 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001746673 |
Protein GI | 170683328 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATGA TAAAAAGCTA TGCCGCAAAA GAAGCGGGCG GCGAACTGGA ACTTTATGAG TACGATCCCG GTGAATTGAA GCCACAAGAT GTTGAAGTGC AGGTGGATTA CTGCGGGATC TGCCATTCCG ATCTGTCGAT GATCGACAAC GAATGGGGAT TTTCACAGTA TCCGCTGGTT GCCGGGCATG AGGTGATTGG TCGCGTGGTG GCGCTCGGGA GTGCCGCACA GGATAAAGGT TTGCAGGTCG GTCAGCGTGT CGGGATTGGC TGGACGGCAC GCAGCTGCGG GCATTGCGAT GCCTGTATTA GCGGTAATCA GATCAACTGC GAGCAAGGTG CGGTGCCAAC AATTATGAAT CGCGGCGGTT TTGCCGAGAA GTTGCGTGCG GACTGGCAAT GGGTGATTCC GCTGCCAGAA AATATCGATA TCGAGTCCGC CGGGCCGCTG CTGTGTGGTG GTATCACGGT CTTTAAACCA CTGTTGATGC ACCATATCAC TGCTACCAGC CGCGTTGGGG TAATTGGTAT TGGCGGGCTG GGGCATATCG CTATAAAACT TCTGCACGCA ATGGGATGCG AGGTGACAGC CTTTAGTTCT AATCCGGCGA AAGAGCAGGA AGTACTGGCG ATGGGTGCCG ATAAAGTGGT GAATAGCCGC GATCCGCAGG CACTGAAAAC TCTGGCGGGG CAGTTTGATC TCATTATCAA TACTGTGAAC GTCAGCCTCG ACTGGCAGCC TTATTTTGAG GCGCTGACGT ATGGCGGTAA TTTCCATACG GTCGGTGCGG TTCTCACGCC GCTGTCTGTT CCGGCCTTTA CGTTAATTGC GGGCGACCGC AGCGTCTCTG GCTCTGCTAC CGGCACGCCT TATGAGCTGC GTAAGCTGAT GCGCTTTGCC GCCCGCAGCA AGGTTGCGCC AACTACCGAA CTGTTCCCGA TGTCGAAAAT TAACGACGCC ATCCAGCATG TGCGCGACGG TAAGGCGCGT TACCGCGTGG TTCTGAAAGC CGATTTTTGA
|
Protein sequence | MSMIKSYAAK EAGGELELYE YDPGELKPQD VEVQVDYCGI CHSDLSMIDN EWGFSQYPLV AGHEVIGRVV ALGSAAQDKG LQVGQRVGIG WTARSCGHCD ACISGNQINC EQGAVPTIMN RGGFAEKLRA DWQWVIPLPE NIDIESAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL GHIAIKLLHA MGCEVTAFSS NPAKEQEVLA MGADKVVNSR DPQALKTLAG QFDLIINTVN VSLDWQPYFE ALTYGGNFHT VGAVLTPLSV PAFTLIAGDR SVSGSATGTP YELRKLMRFA ARSKVAPTTE LFPMSKINDA IQHVRDGKAR YRVVLKADF
|
| |