Gene ECH74115_4322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4322 
SymbolyqhD 
ID6971424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3998794 
End bp3999957 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content55% 
IMG OID643388051 
Productalcohol dehydrogenase yqhD 
Protein accessionYP_002272489 
Protein GI209399675 
COG category[C] Energy production and conversion 
COG ID[COG1979] Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACT TTAATCTGCA CACCCCAACC CGCATTCTGT TTGGTAAAGG CGCAATCGCT 
GGTTTACGCG AACAAATTCC TCACGATGCT CGCGTATTGA TTACCTACGG TGGCGGCAGC
GTGAAAAAAA CCGGCGTTCT CGATCAAGTT CTGGATGCCC TGAAAGGCAT GGACGTGCTG
GAATTTGGCG GTATTGAGCC AAACCCGGCT TATGAAACGC TGATGAACGC CGTGAAACTG
GTTCGCGAAC AGAAAGTGAC TTTCCTGCTG GGGGTTGGCG GCGGTTCTGT ACTGGACGGC
ACCAAATTTA TCGCCGCAGC GGCTAACTAT CCGGAAAATA TCGATCCGTG GCACATTCTG
CAAACGGGCG GTAAAGAGAT TAAAAGCGCC ATCCCGATGG GCTGTGTGCT GACGCTGCCA
GCAACCGGTT CAGAATCCAA CGCCGGTGCG GTGATCTCCC GTAAAACCAC AGGCGATAAG
CAGGCGTTCC ATTCAGCCCA TGTTCAGCCG GTATTTGCCG TGCTCGATCC GGTTTATACC
TACACCCTGC CGCCGCGTCA GGTGGCTAAC GGCGTAGTGG ACGCCTTTGT ACACACCGTG
GAACAGTATG TTACCAAACC GGTTGATGCC AAAATTCAGG ACCGTTTCGC AGAAGGCATT
TTGCTGACGC TGATCGAAGA TGGTCCGAAA GCCCTGAAAG AGCCAGAAAA CTACGATGTG
CGCGCCAACG TCATGTGGGC GGCGACGCAG GCGCTGAACG GTTTGATTGG CGCTGGCGTA
CCGCAGGACT GGGCAACGCA TATGCTGGGC CACGAACTGA CTGCGATGCA CGGTCTGGAT
CACGCGCAAA CACTGGCTAT CGTCCTGCCT GCACTGTGGA ATGAAAAACG CGATACCAAG
CGCGCTAAGC TGCTGCAATA TGCTGAACGC GTCTGGAACA TCACTGAAGG TTCCGACGAT
GAGCGTATTG ACGCCGCGAT TGCCGCAACC CGCAATTTCT TTGAGCAATT AGGCGTGCCT
ACCCACCTCT CCGACTACGG TCTGGACGGC AGCTCCATCC CGGCTTTGCT GAAAAAACTG
GAAGAGCACG GCATGACCCA ACTGGGCGAA AATCATGACA TTACGCTGGA TGTCAGCCGC
CGTATATACG AAGCCGCCCG CTAA
 
Protein sequence
MNNFNLHTPT RILFGKGAIA GLREQIPHDA RVLITYGGGS VKKTGVLDQV LDALKGMDVL 
EFGGIEPNPA YETLMNAVKL VREQKVTFLL GVGGGSVLDG TKFIAAAANY PENIDPWHIL
QTGGKEIKSA IPMGCVLTLP ATGSESNAGA VISRKTTGDK QAFHSAHVQP VFAVLDPVYT
YTLPPRQVAN GVVDAFVHTV EQYVTKPVDA KIQDRFAEGI LLTLIEDGPK ALKEPENYDV
RANVMWAATQ ALNGLIGAGV PQDWATHMLG HELTAMHGLD HAQTLAIVLP ALWNEKRDTK
RAKLLQYAER VWNITEGSDD ERIDAAIAAT RNFFEQLGVP THLSDYGLDG SSIPALLKKL
EEHGMTQLGE NHDITLDVSR RIYEAAR