Gene Rleg_2958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2958 
Symboltdh 
ID8015744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2947482 
End bp2948519 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content62% 
IMG OID644825528 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_002976756 
Protein GI241205660 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase
[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.108215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.634974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACA TGATGAAGGC GCTGGTCAAA GCAAAACCCG AGGTCGGGCT TTGGATGGAG 
AATGTGCCGA TGCCCGAGGT CGGGCCGAAC GACGTGCTTA TCCGGGTGAA GAAATCGGCG
ATCTGCGGCA CTGACGTCCA TATCTGGAAC TGGGACCAGT GGGCGCAGAA GACCATTCCG
GTGCCGATGG TGGTTGGCCA TGAATTCTCC GGCGAGATCG CCGAGATCGG TTCGGCGGTC
ACCCGCTATC ATATCGGCGA GCGGGTCTCC GGCGAGGGGC ATATCGTCTG CGGCAAGTGC
CGCAACTGCC GGGCGGGCAG GGGGCATCTC TGCCGCAACA CGCTCGGTGT CGGCGTCAAC
CGCCCGGGTT CGTTCGGTGA GTTCGTCTGC ATTCCGGAAA GCAATGTCGT GCCGATCCCG
GATGATATTT CCGACGAGAT CGCCGCGATC TTCGATCCGT TCGGCAATGC CGTGCACACC
GCGCTTTCCT TCGATCTCGT CGGTGAGGAC GTGCTCGTCA CCGGCGCCGG GCCGATCGGC
ATCATGGGCG CGCTCGTCGC CAAACGATCC GGCGCCCGCA AGGTCGTCAT CACCGATATC
AATCCGCACC GGCTGGAGCT GGCGCGCAAG CTCGGCATCG ACCACGTCGT CGACGCATCG
AAGGAAAACC TCGCCGACGT GATGAAGGCG ATCGGCATGA CGGAGGGTTT CGACGTCGGG
CTCGAAATGT CGGGGGCCGC ACCTGCCTTC CGCGACATGA TCGACAAGAT GAACAATGGC
GGCAAGATCG CCATCCTCGG CATCGCGCCG GCGGGCTTCG AAATCGACTG GAACAAGGTG
ATCTTCAAGA TGCTCAATCT CAAGGGCATC TACGGCCGCG AGATGTTCGA GACCTGGTAC
AAGATGATCG CCTTCGTCCA AGGCGGCCTC GATCTCGCGC CCATCATCAC CCACCGGATC
GGCATCGACG ATTTCCGCGA CGGCTTCGAG GCGATGCGGT CGGGCAATTC CGGCAAGGTT
GTGATGGACT GGATGTGA
 
Protein sequence
MSNMMKALVK AKPEVGLWME NVPMPEVGPN DVLIRVKKSA ICGTDVHIWN WDQWAQKTIP 
VPMVVGHEFS GEIAEIGSAV TRYHIGERVS GEGHIVCGKC RNCRAGRGHL CRNTLGVGVN
RPGSFGEFVC IPESNVVPIP DDISDEIAAI FDPFGNAVHT ALSFDLVGED VLVTGAGPIG
IMGALVAKRS GARKVVITDI NPHRLELARK LGIDHVVDAS KENLADVMKA IGMTEGFDVG
LEMSGAAPAF RDMIDKMNNG GKIAILGIAP AGFEIDWNKV IFKMLNLKGI YGREMFETWY
KMIAFVQGGL DLAPIITHRI GIDDFRDGFE AMRSGNSGKV VMDWM