Gene EcHS_A3188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3188 
SymbolyqhD 
ID5593101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3200186 
End bp3201349 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content55% 
IMG OID640922308 
Productalcohol dehydrogenase yqhD 
Protein accessionYP_001459806 
Protein GI157162488 
COG category[C] Energy production and conversion 
COG ID[COG1979] Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones75 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACT TTAATCTGCA CACCCCAACC CGCATTCTGT TTGGTAAAGG CGCAATCGCT 
GGTTTACGCG AACAAATTCC TCACGATGCT CGCGTATTGA TTACCTACGG CGGCGGCAGC
GTGAAAAAAA CCGGCGTTCT CGATCAAGTT CTGGATGCCC TGAAAGGCAT GGACGTGCTG
GAATTTGGCG GTATTGAGCC AAACCCGGCT TATGAAACGC TGATGAACGC CGTGAAACTG
GTTCGCGAAC AGAAAGTGAC TTTCCTGCTG GCGGTTGGCG GCGGTTCTGT ACTGGACGGC
ACCAAATTTA TCGCCGCAGC GGCTAACTAT CCGGAAAATA TCGATCCGTG GCACATTCTG
CAAACGGGCG GTAAAGAGAT TAAAAGCGCC ATCCCGATGG GCTGTGTGCT GACGCTGCCA
GCAACCGGTT CAGAATCCAA CGCAGGCGCG GTGATCTCCC GTAAAACCAC AGGCGACAAG
CAGGCGTTCC ATTCTGCCCA TGTTCAGCCG GTATTTGCCG TGCTCGATCC GGTTTATACC
TACACCCTGC CGCCGCGTCA GGTGGCTAAC GGCGTAGTGG ACGCCTTTGT ACACACCGTG
GAACAGTATG TTACCAAACC GGTTGATGCC AAAATTCAGG ACCGTTTCGC AGAAGGCATT
TTGCTGACGC TAATCGAAGA TGGTCCGAAA GCCCTGAAAG AGCCAGAAAA CTACGATGTG
CGCGCCAACG TCATGTGGGC GGCGACTCAG GCGCTGAACG GTTTGATTGG CGCTGGCGTA
CCGCAGGACT GGGCAACGCA TATGCTGGGC CACGAACTGA CTGCGATGCA CGGTCTGGAT
CACGCGCAAA CACTGGCTAT CGTCCTGCCT GCACTGTGGA ATGAAAAACG CGATACCAAG
CGCGCTAAGC TGCTGCAATA TGCTGAACGC GTCTGGAACA TCACTGAAGG TTCCGATGAT
GAGCGTATTG ACGCCGCGAT TGCCGCAACC CGCAATTTCT TTGAGCAATT AGGCGTGCCT
ACCCACCTCT CCGACTACGG TCTGGACGGC AGCTCCATCC CGGCTTTGCT GAAAAAACTG
GAAGAGCACG GCATGACCCA ACTGGGCGAA AATCATGACA TTACGTTGGA TGTCAGCCGC
CGTATATACG AAGCCGCCCG CTAA
 
Protein sequence
MNNFNLHTPT RILFGKGAIA GLREQIPHDA RVLITYGGGS VKKTGVLDQV LDALKGMDVL 
EFGGIEPNPA YETLMNAVKL VREQKVTFLL AVGGGSVLDG TKFIAAAANY PENIDPWHIL
QTGGKEIKSA IPMGCVLTLP ATGSESNAGA VISRKTTGDK QAFHSAHVQP VFAVLDPVYT
YTLPPRQVAN GVVDAFVHTV EQYVTKPVDA KIQDRFAEGI LLTLIEDGPK ALKEPENYDV
RANVMWAATQ ALNGLIGAGV PQDWATHMLG HELTAMHGLD HAQTLAIVLP ALWNEKRDTK
RAKLLQYAER VWNITEGSDD ERIDAAIAAT RNFFEQLGVP THLSDYGLDG SSIPALLKKL
EEHGMTQLGE NHDITLDVSR RIYEAAR