Gene EcSMS35_3953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3953 
Symboltdh 
ID6143740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4031537 
End bp4032562 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content52% 
IMG OID641618779 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_001745918 
Protein GI170683839 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase
[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0503034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.824331 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTTCCTGTA 
CCAGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT
GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC
GTGGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA AGGCTTCAAG
ATTGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT
GGCGGTCGTA CCCATCTGTG CCGCAACACG ATCGGCGTCG GCGTTAACCG TCCGGGCTGC
TTTGCCGAGT ATCTGGTGAT TCCGGCGTTC AACGCTTTCA AAATCCCCGA CAATATCTCC
GATGATCTGG CTTCCATTTT TGATCCCTTC GGTAACGCCG TGCATACCGC GCTGTCGTTT
GATCTGGTGG GCGAGGATGT GCTGGTCTCT GGTGCAGGCC CGATTGGTAT TATGGCGGCG
GCGGTGGCGA AACACGTTGG TGCACGTAAC GTGGTGATTA CTGATGTTAA CGAATACCGC
CTTGAGCTGG CGCGTAAAAT GGGTATCACC CGTGCGGTTA ACGTCGCGAA AGAAAACCTT
AATGATGTGA TGACAGAACT GGGCATGACC GAAGGCTTTG ATGTCGGTCT GGAAATGTCC
GGTGCGCCGC CAGCGTTTCG TACCATGCTC GACACCATGA ACCACGGCGG TCGTATTGCG
ATGCTGGGTA TTCCACCGTC CGATATGTCT ATCGACTGGA CCAAAGTAAT CTTTAAAGGC
TTGTTCATTA AAGGTATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG
CTGATTCAGT CTGGCCTCGA TCTCTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT
TTCCAGAAGG GCTTTGACGC TATGCGTTCG GGCCAGTCCG GGAAAGTTAT TCTGAGCTGG
GATTAA
 
Protein sequence
MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV 
VGHEYVGEVV GIGQEVKGFK IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC
FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA
AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMTELGMT EGFDVGLEMS
GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA
LIQSGLDLSP IITHRFSIDD FQKGFDAMRS GQSGKVILSW D