Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0356 |
Symbol | |
ID | 6144238 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 367804 |
End bp | 368853 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615252 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001742460 |
Protein GI | 170680738 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.799512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCA AAGCTGTTGG TGCATATTCC GCTAAACAAC CGCTGGAACC GATGGATATC ACCCGGCGTG AACCGGGACC GCATGATGTC AAAATCGAAA TCGCTTACTG TGGCGTCTGC CATTCCGATA TCCACCAGGT CCGTTCCGAG TGGGCGGGGA CGGTTTACCC CTGTGTGCCG GGTCATGAAA TTGTGGGGCG TGTGGTAGCC GTTGGTGATC AGGTAGAAAA ACATGCGCCG GGCGATCTGG TCGGTGTCGG CTGCATTGTC GACAGTTGTA AACATTGCGA AGAGTGTGAA GACGGGCTGG AAAACTACTG TGATCACATG ACCGGCACCT ATAACTCGCC GACGCCGGAC GAACCGGGCC ATACTCTGGG CGGCTACTCA CAACAGATCG TCGTTCATGA GCGATATGTT CTGCGTATTC GTCACCCGCA AGAGCAGCTG GCGGCGGTGG CACCTTTGTT GTGTGCAGGG ATCACCACGT ATTCGCCGCT ACGTCACTGG CAGGCCGGGC CGGGAAAAAA AGTGGGCGTG GTCGGCATCG GCGGTCTGGG ACATATGGGG ATTAAGCTGG CCCACGCGAT GGGGGCGCAT GTGGTGGCAT TTACCACTTC TGAGGCAAAA CGCGAAGCGG CAAAAGCCCT GGGGGCCGAT GAAGTTGTTA ACTCACGCAA TGCCGATGAG ATGGCGGCTC ATCTCAAGAG TTTCGATTTC ATTTTGAATA CAGTAGCTGC GCCACATAAT CTCGACGATT TTACCACCTT GCTGAAGCGT GATGGCACCA TGACGCTGGT TGGTGCGCCT GCGACACCGC ATAAATCACC GGAAGTTTTC AACCTGATCA TGAAACGCCG TGCGATAGCC GGCTCTATGA TTGGCGGCAT TCCAGAAACA CAGGAGATGC TCGATTTTTG CGCCGAACAT GACATCGTGG CTGATATAGA GATGATTCGG GCCGATCAAA TTAATGAAGC CTATGAGCGA ATGCTGCGAG GTGATGTGAA ATATCGTTTT GTTATCGATA ATCGCACACT AACAGACTGA
|
Protein sequence | MKIKAVGAYS AKQPLEPMDI TRREPGPHDV KIEIAYCGVC HSDIHQVRSE WAGTVYPCVP GHEIVGRVVA VGDQVEKHAP GDLVGVGCIV DSCKHCEECE DGLENYCDHM TGTYNSPTPD EPGHTLGGYS QQIVVHERYV LRIRHPQEQL AAVAPLLCAG ITTYSPLRHW QAGPGKKVGV VGIGGLGHMG IKLAHAMGAH VVAFTTSEAK REAAKALGAD EVVNSRNADE MAAHLKSFDF ILNTVAAPHN LDDFTTLLKR DGTMTLVGAP ATPHKSPEVF NLIMKRRAIA GSMIGGIPET QEMLDFCAEH DIVADIEMIR ADQINEAYER MLRGDVKYRF VIDNRTLTD
|
| |