Gene Rsph17029_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1037 
Symboltdh 
ID4896461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1071301 
End bp1072326 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content68% 
IMG OID640111624 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_001042920 
Protein GI126461806 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase
[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.210217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGCAC TGGTGAAGGC CAAGGCGGAG CCGGGCCTCT GGATGGAAGA GCGGCCCGTG 
CCCGAGATCG GCCCCGACGA GGTGCTCATC CGGGTCCGCA AGACGGGGAT CTGCGGCACC
GATGTCCATA TCTGGAACTG GGACGACTGG GCGGCGAAGA CGGTGCCGGT GCCGCTCGTC
ACCGGGCACG AGTTTGCGGG CGAGATCGTC GAGGTGGGCC GCGACGTGCG CGACCTCAGC
CCGGGCCAGC GCTGCTCGGG CGAGGGCCAT CTGATCGGCC ACCATTCGCG GCAGGTGCGG
GCCGGGCGCT TCCATCTCGA TCCCGAGACG CGCGGCATCG GCGTCAATGT GCCGGGCGCC
TTCGCCGACT ATCTGCGGCT TCCCGCCTTC AACGTGGTGC CGCTGCCCGA TGCCATCGAC
GACGAGGTGG GGGCGATCCT CGATCCCCTC GGCAATGCCG TTCACACGGC GCTCAGCTTC
GATCTGGTGG GAGAGGATGT GCTCGTGACC GGCGCAGGCC CCATCGGGAT CATGGCCGCG
GCTGTGGCGC GGCATGTCGG CGCGCGCCAT GTCGTCATCA CCGACGTCAA TGCCGACCGG
TTGCGGCTGT CAACCGAGGT GGCCGATGTG GTGCCGGTCA ATGTGGCGAC CGAGGATCTG
CGTTCGGTGA TGGGCCGGCT GAAGATCGTG CAGGGCTTCG ACGTGGGGAT GGAAATGTCG
GGCGCGCCCG CGGGCTTCGA CCAGATGGTC GAAGCGATGG TGATGGGCGG TCGCATCGCG
ATGCTGGGGA TCCCGCCCGG CCGCAGCCCC GTGGACTGGA GCAGGATCGT CTTCAAGGCG
CTGACCATCA AGGGCGTCTA CGGCCGCGAG ATCTTCGAGA CCTGGTACAA GATGATCGCG
ATGCTGGAGA ACGGGCTCGA TATCCGGCGC GTCATCACCC ACCGCTTTCC TGTGGCGGAT
TTCGCCGAGG GTTTTGCCGC CATGCGCAGC GGCGCGTCGG GCAAGGTGGT GCTGGACTGG
GGCTGA
 
Protein sequence
MRALVKAKAE PGLWMEERPV PEIGPDEVLI RVRKTGICGT DVHIWNWDDW AAKTVPVPLV 
TGHEFAGEIV EVGRDVRDLS PGQRCSGEGH LIGHHSRQVR AGRFHLDPET RGIGVNVPGA
FADYLRLPAF NVVPLPDAID DEVGAILDPL GNAVHTALSF DLVGEDVLVT GAGPIGIMAA
AVARHVGARH VVITDVNADR LRLSTEVADV VPVNVATEDL RSVMGRLKIV QGFDVGMEMS
GAPAGFDQMV EAMVMGGRIA MLGIPPGRSP VDWSRIVFKA LTIKGVYGRE IFETWYKMIA
MLENGLDIRR VITHRFPVAD FAEGFAAMRS GASGKVVLDW G