Gene EcolC_0092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0092 
Symboltdh 
ID6068362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp97234 
End bp98259 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID641599496 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_001723105 
Protein GI170018151 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase
[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.212636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.425053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTTCCTGTA 
CCGGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT
GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC
GTGGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA GGGCTTCAAG
ATTGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT
GGCGGTCGTA CCCATCTGTG CCGCAACACG ATCGGCGTCG GCGTTAACCG CCCGGGCTGC
TTCGCCGAAT ATCTGGTGAT TCCGGCGTTC AACGCCTTCA AAATTCCCGA CAATATCTCC
GACGACCTGG CTTCCATTTT TGATCCCTTC GGTAATGCCG TGCATACCGC GCTGTCGTTC
GATCTGGTTG GCGAGGATGT GCTGGTTTCT GGTGCAGGCC CGATTGGTAT TATGGCAGCG
GCGGTGGCGA AACACGTTGG TGCACGCAAT GTGGTGATCA CTGATGTTAA CGAATACCGC
CTTGAGCTGG CGCGTAAAAT GGGTATCACC CGTGCGGTTA ACGTCGCCAA AGAAAATCTC
AATGACGTGA TGGCGGAGTT AGGCATGACC GAAGGTTTTG ATGTCGGTCT GGAAATGTCC
GGTGCGCCGC CAGCGTTTCG TACCATGCTT GACACCATGA ACCACGGCGG CCGTATTGCG
ATGCTGGGTA TTCCGCCGTC TGATATGTCT ATCGACTGGA CCAAAGTGAT CTTTAAAGGC
TTGTTCATTA AAGGTATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG
CTGATTCAGT CTGGCCTCGA TCTTTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT
TTCCAGAAGG GCTTTGACGC TATGCGTTCG GGCCAGTCCG GGAAAGTTAT TCTGAGCTGG
GATTAA
 
Protein sequence
MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV 
VGHEYVGEVV GIGQEVKGFK IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC
FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA
AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMAELGMT EGFDVGLEMS
GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA
LIQSGLDLSP IITHRFSIDD FQKGFDAMRS GQSGKVILSW D