Gene EcSMS35_0356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0356 
Symbol 
ID6144238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp367804 
End bp368853 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content54% 
IMG OID641615252 
Productzinc-binding dehydrogenase family oxidoreductase 
Protein accessionYP_001742460 
Protein GI170680738 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.799512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCA AAGCTGTTGG TGCATATTCC GCTAAACAAC CGCTGGAACC GATGGATATC 
ACCCGGCGTG AACCGGGACC GCATGATGTC AAAATCGAAA TCGCTTACTG TGGCGTCTGC
CATTCCGATA TCCACCAGGT CCGTTCCGAG TGGGCGGGGA CGGTTTACCC CTGTGTGCCG
GGTCATGAAA TTGTGGGGCG TGTGGTAGCC GTTGGTGATC AGGTAGAAAA ACATGCGCCG
GGCGATCTGG TCGGTGTCGG CTGCATTGTC GACAGTTGTA AACATTGCGA AGAGTGTGAA
GACGGGCTGG AAAACTACTG TGATCACATG ACCGGCACCT ATAACTCGCC GACGCCGGAC
GAACCGGGCC ATACTCTGGG CGGCTACTCA CAACAGATCG TCGTTCATGA GCGATATGTT
CTGCGTATTC GTCACCCGCA AGAGCAGCTG GCGGCGGTGG CACCTTTGTT GTGTGCAGGG
ATCACCACGT ATTCGCCGCT ACGTCACTGG CAGGCCGGGC CGGGAAAAAA AGTGGGCGTG
GTCGGCATCG GCGGTCTGGG ACATATGGGG ATTAAGCTGG CCCACGCGAT GGGGGCGCAT
GTGGTGGCAT TTACCACTTC TGAGGCAAAA CGCGAAGCGG CAAAAGCCCT GGGGGCCGAT
GAAGTTGTTA ACTCACGCAA TGCCGATGAG ATGGCGGCTC ATCTCAAGAG TTTCGATTTC
ATTTTGAATA CAGTAGCTGC GCCACATAAT CTCGACGATT TTACCACCTT GCTGAAGCGT
GATGGCACCA TGACGCTGGT TGGTGCGCCT GCGACACCGC ATAAATCACC GGAAGTTTTC
AACCTGATCA TGAAACGCCG TGCGATAGCC GGCTCTATGA TTGGCGGCAT TCCAGAAACA
CAGGAGATGC TCGATTTTTG CGCCGAACAT GACATCGTGG CTGATATAGA GATGATTCGG
GCCGATCAAA TTAATGAAGC CTATGAGCGA ATGCTGCGAG GTGATGTGAA ATATCGTTTT
GTTATCGATA ATCGCACACT AACAGACTGA
 
Protein sequence
MKIKAVGAYS AKQPLEPMDI TRREPGPHDV KIEIAYCGVC HSDIHQVRSE WAGTVYPCVP 
GHEIVGRVVA VGDQVEKHAP GDLVGVGCIV DSCKHCEECE DGLENYCDHM TGTYNSPTPD
EPGHTLGGYS QQIVVHERYV LRIRHPQEQL AAVAPLLCAG ITTYSPLRHW QAGPGKKVGV
VGIGGLGHMG IKLAHAMGAH VVAFTTSEAK REAAKALGAD EVVNSRNADE MAAHLKSFDF
ILNTVAAPHN LDDFTTLLKR DGTMTLVGAP ATPHKSPEVF NLIMKRRAIA GSMIGGIPET
QEMLDFCAEH DIVADIEMIR ADQINEAYER MLRGDVKYRF VIDNRTLTD