Gene ECH74115_4989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4989 
Symboltdh 
ID6967787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4640257 
End bp4641282 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content52% 
IMG OID643388670 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_002273097 
Protein GI209400006 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase
[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTTCCTGTA 
CCGGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT
GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC
GTAGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA AGGCTTCAAA
ATCGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT
GGCGGTCGTA CCCATCTGTG CCGCAACACC ATTGGTGTCG GTGTTAACCG TCCGGGCTGC
TTCGCCGAAT ATCTGGTGAT TCCGGCGTTC AACGCTTTCA AAATCCCCGA CAATATCTCC
GACGACCTGG CTTCCATTTT TGATCCCTTC GGTAACGCCG TGCATACCGC GCTGTCGTTC
GATCTGGTGG GCGAAGATGT GCTGGTTTCT GGTGCAGGTC CGATAGGTAT TATGGCTGCG
GCGGTGGCGA AACACGTTGG TGCACGCAAT GTGGTGATCA CCGATGTTAA CGAATACCGC
CTTGAGCTGG CGCGCAAAAT GGGTATCACC CGTGCGGTTA ACGTCGCGAA AGAAAACCTT
AATGATGTGA TGGCTGAACT GGGCATGACC GAAGGCTTTG ATGTCGGTCT GGAAATGTCC
GGCGCGCCGC CAGCGTTTCG TACCATGCTC GATACCATGA ACCACGGCGG TCGTATTGCG
ATGCTGGGTA TTCCACCGTC TGACATGTCT ATCGACTGGA CCAAAGTGAT CTTTAAAGGC
TTGTTCATTA AAGGCATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG
CTGATTCAGT CTGGCCTCGA TCTCTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT
TTCCAGAAGG GCTTTGACGC TATGTGTTCG GGCCAGTCCG GGAAAGTTAT TCTGAGCTGG
GATTAA
 
Protein sequence
MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV 
VGHEYVGEVV GIGQEVKGFK IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC
FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA
AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMAELGMT EGFDVGLEMS
GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA
LIQSGLDLSP IITHRFSIDD FQKGFDAMCS GQSGKVILSW D