Gene EcSMS35_3296 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3296 
SymbolyqhD 
ID6143233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3370449 
End bp3371612 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content54% 
IMG OID641618126 
Productalcohol dehydrogenase yqhD 
Protein accessionYP_001745276 
Protein GI170682079 
COG category[C] Energy production and conversion 
COG ID[COG1979] Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.495379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACT TTAATCTGCA CACCCCAACC CGCATTCTGT TTGGTAAAGG CGCAATCGCT 
GGTTTACGCG AGCAAATTCC TCACGATGCA CGCGTATTGA TTACCTACGG CGGCGGCAGC
GTGAAAAAAA CCGGCGTTCT CGATCAAGTT CTGGATGCCC TGAAAGGCAT GGACGTGCTG
GAATTTGGCG GTATTGAGCC AAACCCGGCT TATGAAACGC TGATGAACGC CGTGAAACTG
GTTCGCGAAC AAAAAGTGAC TTTCCTGCTG GCGGTTGGCG GCGGTTCTGT ACTGGACGGC
ACCAAATTTA TCGCCGCAGC GGCTAATTAT CCGGAAAATA TCGATCCGTG GCACATTCTG
CAAACGGGCG GCAAAGAGAT TAAAAGCGCC ATCCCGATGG GCTGTGTGCT GACGCTGCCA
GCAACCGGTT CAGAATCCAA CGCAGGCGCG GTGATCTCCC GTAAAACCAC AGGCGACAAG
CAGGCATTCC ATTCCGCCCA TGTTCAGCCG GTATTTGCCG TGCTCGATCC GGTTTATACC
TACACCCTGC CGCCGCGTCA GGTGGCTAAC GGTGTAGTGG ATGCCTTTGT GCACACCGTG
GAACAGTATG TTACCAAACC GGTTGATGCC AAAATTCAGG ACCGTTTCGC AGAAGGCATT
TTGTTGACGC TGATCGAAGA TGGTCCGAAA GCCCTGAAAG AGCCAGAAAA CTACGATGTG
CGCGCCAACG TTATGTGGGC GGCGACTCAG GCGCTGAACG GTTTGATCGG CGCTGGCGTA
CCGCAGGACT GGGCAACGCA TATGCTAGGC CACGAACTGA CTGCGATGCA CGGTCTGGAT
CACGCACAAA CACTGGCTAT CGTCCTGCCT GCACTGTGGA ATGAAAAACG CGATACCAAG
CGCGCTAAGC TGCTGCAATA TGCTGAACGC GTCTGGAACA TCACTGAAGG TTCCGACGAT
GAGCGTATTG ACGCCGCGAT TGCCGCAACC CGCAATTTCT TTGAGCAATT AGGCGTGCCG
ACCCACCTCT CCGACTACGG TCTGGACGGC AGCTCCATCC CGGCTTTGTT GAAAAAACTG
GAAGAGCACG GTATGACCCA ACTGGGCGAA AATCATGACA TTACGCTGGA TGTCAGCCGC
CGCATATACG AAGCCGCCCG CTAA
 
Protein sequence
MNNFNLHTPT RILFGKGAIA GLREQIPHDA RVLITYGGGS VKKTGVLDQV LDALKGMDVL 
EFGGIEPNPA YETLMNAVKL VREQKVTFLL AVGGGSVLDG TKFIAAAANY PENIDPWHIL
QTGGKEIKSA IPMGCVLTLP ATGSESNAGA VISRKTTGDK QAFHSAHVQP VFAVLDPVYT
YTLPPRQVAN GVVDAFVHTV EQYVTKPVDA KIQDRFAEGI LLTLIEDGPK ALKEPENYDV
RANVMWAATQ ALNGLIGAGV PQDWATHMLG HELTAMHGLD HAQTLAIVLP ALWNEKRDTK
RAKLLQYAER VWNITEGSDD ERIDAAIAAT RNFFEQLGVP THLSDYGLDG SSIPALLKKL
EEHGMTQLGE NHDITLDVSR RIYEAAR