Gene SbBS512_E4044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4044 
Symboltdh 
ID6270320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3776329 
End bp3777354 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content53% 
IMG OID641727884 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_001882316 
Protein GI187731381 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase
[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.100127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCGT TATCCAAACT GAAAGCGGAA GAGGGCATCT GGATGACCGA CGTCCCTGTA 
CCGGAACTCG GGCATAACGA TCTGCTGATT AAAATCCGTA AAACAGCCAT CTGCGGGACT
GACGTTCACA TCTATAACTG GGATGAGTGG TCGCAAAAAA CCATCCCGGT GCCGATGGTC
GTAGGCCATG AATATGTCGG TGAAGTGGTA GGTATTGGTC AGGAAGTGAA AGGCTTCAAA
ATCGGCGATC GCGTTTCTGG CGAAGGCCAT ATCACCTGCG GTCACTGTCG CAACTGTCGT
GGCGGTCGTA CCCATCTGTG CCGCAACACG ATCGGCGTCG GCGTTAACCG TCCGGGCTGC
TTCGCCGAAT ATCTGGTGAT CCCGGCGTTC AACGCCTTCA AAATCCCAGA CAATATCTCC
GACGACCTGG CTTCCATTTT TGATCCCTTC GGTAACGCCG TGCATACCGC GCTGTCGTTC
GATCTGGTTG GCGAGGATGT GCTGGTCTCT GGTGCAGGTC CGATAGGTAT TATGGCTGCG
GCGGTGGCGA AACACGTTGG TGCACGCAAT GTGGTGATCA CTGATGTTAA CGAATACCGC
CTTGAGCTGG CGCGCAAAAT GGGTATCACC CGTGCGGTTA ACGTCGCGAA AGAAAACCTT
AATGATGTGA TGGCTGAACT GGGCATGACC GAAGGCTTTG ATGTCGGTCT GGAAATGTCC
GGTGCGCCGC CAGCGTTTCG TACCATGCTT GACACCATGA ACCACGGCGG CCGTATTGCG
ATGCTGGGTA TTCCGCCGTC TGATATGTCT ATCGACTGGA CCAAAGTGAT CTTTAAAGGC
TTGTTCATTA AAGGTATTTA CGGTCGTGAG ATGTTTGAAA CCTGGTACAA GATGGCGGCG
CTGATTCAGT CTGGCCTCGA TCTCTCGCCG ATCATTACCC ATCGTTTCTC TATCGATGAT
TTCCAGAAGG GCTTTGACGC TATGCGTTCG GGCCAGTCCG GGAAAGTAAT TCTGAGCTGG
GATTAA
 
Protein sequence
MKALSKLKAE EGIWMTDVPV PELGHNDLLI KIRKTAICGT DVHIYNWDEW SQKTIPVPMV 
VGHEYVGEVV GIGQEVKGFK IGDRVSGEGH ITCGHCRNCR GGRTHLCRNT IGVGVNRPGC
FAEYLVIPAF NAFKIPDNIS DDLASIFDPF GNAVHTALSF DLVGEDVLVS GAGPIGIMAA
AVAKHVGARN VVITDVNEYR LELARKMGIT RAVNVAKENL NDVMAELGMT EGFDVGLEMS
GAPPAFRTML DTMNHGGRIA MLGIPPSDMS IDWTKVIFKG LFIKGIYGRE MFETWYKMAA
LIQSGLDLSP IITHRFSIDD FQKGFDAMRS GQSGKVILSW D