Gene EcHS_A3828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3828 
Symboltdh 
ID5593375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3823097 
End bp3824122 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID640922940 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_001460418 
Protein GI157163100 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase
[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.0267024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTTCCTGTA 
CCGGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT
GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC
GTGGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA GGGCTTCAAG
ATTGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT
GGCGGTCGTA CCCATCTGTG CCGCAACACG ATCGGCGTCG GCGTTAACCG CCCGGGCTGC
TTCGCCGAAT ATCTGGTGAT TCCGGCGTTC AACGCCTTCA AAATTCCCGA CAATATCTCC
GACGACCTGG CTTCCATTTT TGATCCCTTC GGTAATGCCG TGCATACCGC GCTGTCGTTC
GATCTGGTTG GCGAGGATGT GCTGGTTTCT GGTGCAGGCC CGATTGGTAT TATGGCAGCG
GCGGTGGCGA AACACGTTGG TGCACGCAAT GTGGTGATCA CTGATGTTAA CGAATACCGC
CTTGAGCTGG CGCGTAAAAT GGGTATCACC CGTGCGGTTA ACGTCGCCAA AGAAAATCTC
AATGACGTGA TGGCGGAGTT AGGCATGACC GAAGGTTTTG ATGTCGGTCT GGAAATGTCC
GGTGCGCCGC CAGCGTTTCG TACCATGCTT GACACCATGA ACCACGGCGG CCGTATTGCG
ATGCTGGGTA TTCCGCCGTC TGATATGTCT ATCGACTGGA CCAAAGTGAT CTTTAAAGGC
TTGTTCATTA AAGGTATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG
CTGATTCAGT CTGGCCTCGA TCTTTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT
TTCCAGAAGG GCTTTGACGC TATGCGTTCG GGCCAGTCCG GGAAAGTTAT TCTGAGCTGG
GATTAA
 
Protein sequence
MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV 
VGHEYVGEVV GIGQEVKGFK IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC
FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA
AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMAELGMT EGFDVGLEMS
GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA
LIQSGLDLSP IITHRFSIDD FQKGFDAMRS GQSGKVILSW D