Gene Rsph17029_3833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3833 
Symbol 
ID4898267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp962411 
End bp963595 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content67% 
IMG OID640114437 
Productalcohol dehydrogenase 
Protein accessionYP_001045685 
Protein GI126464572 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.24597 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.375314 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCAC TCACCTGGCA CGGAAAGCAC GATGTCCGCG TCGAGACCCA TCCCGATCCC 
GAGATCGTCA ATCCGCGCGA CGCGATCATC GAGGTGACGG CCACGGCGAT CTGCGGCTCG
GATCTGCATC TCTACGACGG GGTGATCCCG GGGCTGATGT CGGGCGACAT CCTCGGCCAC
GAGTTCATGG GCCGCGTCGT CGAGACCGGG CCGAAAAGCA CGCTGAAGAA GGGCCAGCGG
GTCGTGGTGC CCTTCACGAT CTCCTGCGGG AAGTGCTTCT TCTGCGAGCA TCAGCTCTTT
TCCGCCTGCG ACAATTCCAA CCCCGCGGAG AAGCAGGACC TGTCCGAGCC TCTCTATGGC
CATGCGGTGT CGGGGCTCTT CGGCTATTCC CATCTGACGG GCGGCTATCC GGGCGGGCAG
GCCGAATATG TGCGCGTGCC CTATTCGGAC GTGGGACCGA TCGTGATCCC GGACGGGCTC
GAGGACGAGC AGGTGCTGTT CCTGTCGGAC ATCCTGCCGA CGGGCTGGAT GGCCGCCGAG
AATGCCGGGA TCGAGGAGGG CGACACGGTC GCCGTCTGGG GCTGCGGCCC GGTCGGCCTC
TTCGCGATCC AGTCGGCCCT GCTGATGGGG GCGGGCAAGG TCATCGCCAT CGACGAATAT
CCCAAGCGGC TGGCGCTGGC GCGCAGGCTC GGGGCGGAAG TGATCGACTT CCGGCGCACG
AAGGTGCTCG AGGCGCTGAT GGAGATGTCG GGGGGCCTCG GCCCCGATGC GGTGATCGAT
GCCGTGGGGA TGGAGGCGCA TGGCTTCATG CCCGACACGC TGATGGACAA CATGAAGCAG
CGCGTGGGGA TCGGCGCGGA CAGCGGGCAC GCGCTGCGCG AGGCGATCCT CGCGGTGCGC
AAGGGCGGCC GCGTCTCGGT GCCCGGCGTC TATGGCGGCT TCCTCGACAA GTTTCCGCTC
GGCGCGCTGA TGGAGAAGGG CCTGACCGTG AAGACCGGCC AGACCCATGT GCAGCGATAC
ACCGAGGAGC TTCTGCGCCG GATCGGCGAC GGCGAGATCG ACACGACCTT CCTGATCTCG
CACCGCCTGC CGCTCGAGGA GGCGGCGCGG GGCTACGAGA ACTTCCGCTT CAACCAGAAC
GAATGGACCA AGGTGGTGCT GAAGCCGGGC CTGACCGGCG CCTGA
 
Protein sequence
MRALTWHGKH DVRVETHPDP EIVNPRDAII EVTATAICGS DLHLYDGVIP GLMSGDILGH 
EFMGRVVETG PKSTLKKGQR VVVPFTISCG KCFFCEHQLF SACDNSNPAE KQDLSEPLYG
HAVSGLFGYS HLTGGYPGGQ AEYVRVPYSD VGPIVIPDGL EDEQVLFLSD ILPTGWMAAE
NAGIEEGDTV AVWGCGPVGL FAIQSALLMG AGKVIAIDEY PKRLALARRL GAEVIDFRRT
KVLEALMEMS GGLGPDAVID AVGMEAHGFM PDTLMDNMKQ RVGIGADSGH ALREAILAVR
KGGRVSVPGV YGGFLDKFPL GALMEKGLTV KTGQTHVQRY TEELLRRIGD GEIDTTFLIS
HRLPLEEAAR GYENFRFNQN EWTKVVLKPG LTGA