Gene EcSMS35_2939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2939 
SymbolfucO 
ID6145785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3015164 
End bp3016315 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content54% 
IMG OID641617808 
ProductL-1,2-propanediol oxidoreductase 
Protein accessionYP_001744963 
Protein GI170682829 
COG category[C] Energy production and conversion 
COG ID[COG1454] Alcohol dehydrogenase, class IV 
TIGRFAM ID[TIGR02638] lactaldehyde reductase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.284503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCTA ACAGAATGAT TCTGAACGAA ACGGCATGGT TTGGTCGGGG TGCTGTTGGG 
GCTTTAACCG ATGAGGTGAA ACGCCGTGGT TATCAGAAGG CGCTGATCGT CACCGATAAA
ACGCTGGTGC AATGCGGCGT GGTGGCGAAA GTGACCGATA AGATGGATGC TGCAGGGCTG
GCATGGGCGA TTTACGACGG CGTAGTGCCC AACCCAACAA TTACTGTCGT CAAAGAAGGG
CTCGGTGTAT TCCAGAATAG CGGCGCGGAT TACCTGATCG CTATTGGTGG CGGTTCTCCA
CAGGATACGT GTAAAGCGAT TGGCATTATC AGCAACAATC CGGAGTTTGC CGATGTGCGT
AGCCTGGAAG GGCTTTCCCC GACCAATAAA CCCAGTGTAC CGATTCTGGC AATCCCCACC
ACAGCAGGTA CTGCGGCAGA AGTGACCATT AACTACGTGA TCACTGACGA AGAAAAACGG
CGCAAGTTTG TTTGCGTTGA TCCGCATGAT ATCCCGCAGG TGGCGTTTAT TGACGCTGAC
ATGATGGATG GTATGCCTCC AGCGCTGAAA GCCGCGACGG GTGTCGATGC GCTCACTCAT
GCTATTGAGG GGTATATTAC CCGTGGCGCG TGGGCGCTAA CCGATGCACT GCACATTAAA
GCGATTGAAA TCATTGCTGG GGCGCTGCGA GGATCGGTTG CTGGTGATAA GGATGCCGGA
GAAGAAATGG CGCTCGGGCA GTATGTTGCG GGTATGGGCT TCTCGAATGT TGGGTTAGGG
TTGGTGCATG GTATGGCGCA TCCACTGGGC GCGTTTTACA ACACGCCACA CGGTGTTGCG
AACGCCATCC TGTTACCGCA TGTCATGCGC TATAACGCTG ACTTTACCGG TGAGAAGTAC
CGCGATATCG CGCGCGTTAT GGGCCTGAAA GTGGAAGGTA TGAGCCTGGA AGAGGCGCGT
AATGCCGCTG TTGAAGCGGT GTTTGCTCTC AACCGTGATG TCGGTATTCC GCCACATTTA
CGTGATGTTG GTGTACGCAA GGAAGACATT CCGGCACTGG CGCAGGCGGC ACTGGATGAT
GTTTGTACCG GTGGCAACCC GCGTGAAGCA ACGCTTGAGG ATATTGTAGA GCTTTACCAT
ACCGCCTGGT AA
 
Protein sequence
MMANRMILNE TAWFGRGAVG ALTDEVKRRG YQKALIVTDK TLVQCGVVAK VTDKMDAAGL 
AWAIYDGVVP NPTITVVKEG LGVFQNSGAD YLIAIGGGSP QDTCKAIGII SNNPEFADVR
SLEGLSPTNK PSVPILAIPT TAGTAAEVTI NYVITDEEKR RKFVCVDPHD IPQVAFIDAD
MMDGMPPALK AATGVDALTH AIEGYITRGA WALTDALHIK AIEIIAGALR GSVAGDKDAG
EEMALGQYVA GMGFSNVGLG LVHGMAHPLG AFYNTPHGVA NAILLPHVMR YNADFTGEKY
RDIARVMGLK VEGMSLEEAR NAAVEAVFAL NRDVGIPPHL RDVGVRKEDI PALAQAALDD
VCTGGNPREA TLEDIVELYH TAW