Gene RSP_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4033 
Symboltdh 
ID3711803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007488 
Strand
Start bp7843 
End bp8856 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content70% 
IMG OID640069306 
ProductZinc-containing alcohol dehydrogenase 
Protein accessionYP_345173 
Protein GI77404599 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCAG CGATTTTCGA CGGCAGCCCG ACGCTCCGGC TGACCGACCT TCCGATTCAG 
GAACCCGGCC CCGACGAGGT GCTGATCCGC ATCGATTCCG CCACGATCTG CGGCACCGAT
CAGCATATCC TCGAGGGCAA GTTCTGGGCG AAGCCCCCGG TGGTGCTGGG CCATGAATTC
GCGGGCTACG TCGAGCGCGT GGGCGAGCGG GTGCAGAACT GCAGGCCGGG CGATCTCGTC
TCGGTCGAGC CGCATGTCTA TTGCGGCTGC TGCAAGCCCT GCCGGCTCGG CAAGCCGCAT
CTCTGCCTCG ACCGTCTGGC TTGGGGGATC AACCTCAACG GCGGGTTCGA GCAATATGCC
ACCGTGCGGA TGGACACCGT CTATCAGGTG CCGGAAGGCA TCGGCCCCGA AGAGGCGGCT
CTGGGCGAGA TCACCGGCTG CTGCATGCAC GGGATCGACC GCGTGGGGGT CGAGCTCGGC
GATCTCGTCG TGATCCTCGG CGGCGGCGCG GCGGGCCTGA TCCTCGCCCG GCTGGCCGAG
CTGCGCGGGG CCGCGCGCAT CGTCATCTCC GAGCCGAACG CCGCCCGGCG CGAGCAGATC
CGCGCCTTCG GCTACCCGGA CGTGGTCGAC CCGCTGAACG AGGATCTGGC CGCCCGCATC
GGCGCCCTGA CCGACGGGCT CGGCGCCGAC GTGGTGATCG AGGCCGCGGG CCGCGCCGAG
ACGGCCGCGC AGGCGGTGGA GCTCGTCTGC CACGGCGGGC GCGTCCTCTT CTTCGGCGTG
GCCGCCCCCG GCACCATGGC CGCCATCGAG CCGAACCGGA TCTTCGCGCG CGAGATCACG
GTCGTGGGCT CGATCCGCAA CCCCTATACC CACCACCGCG TGATGGAGAT CCTGCCCCGG
CTCCGGCTGA AGGACATCGT CACCCACCGC TTCCCGCTGG AGAATATCGC CGAGGCCTTC
GACGCCGCCC ACCGCGGCGA GGGCCTCAAG ATCTGCATCA AGCCGAACGG CTGA
 
Protein sequence
MKAAIFDGSP TLRLTDLPIQ EPGPDEVLIR IDSATICGTD QHILEGKFWA KPPVVLGHEF 
AGYVERVGER VQNCRPGDLV SVEPHVYCGC CKPCRLGKPH LCLDRLAWGI NLNGGFEQYA
TVRMDTVYQV PEGIGPEEAA LGEITGCCMH GIDRVGVELG DLVVILGGGA AGLILARLAE
LRGAARIVIS EPNAARREQI RAFGYPDVVD PLNEDLAARI GALTDGLGAD VVIEAAGRAE
TAAQAVELVC HGGRVLFFGV AAPGTMAAIE PNRIFAREIT VVGSIRNPYT HHRVMEILPR
LRLKDIVTHR FPLENIAEAF DAAHRGEGLK ICIKPNG