Gene Rsph17029_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0472 
Symbol 
ID4895587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp492993 
End bp494117 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content73% 
IMG OID640111056 
Productalcohol dehydrogenase 
Protein accessionYP_001042360 
Protein GI126461246 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA CGGCGGCGGT GCTCGAACGC GACGGGATCG CGGGGAATTA TGCCGAGGAG 
CGGCCGCTCG CGATCCGCGA GCTCGATCTC GCGGAGCCGG GGCCGGGCGA GGTCCTGATC
CGGGTGGCCG CCGCGGGCAT CTGCCATTCC GACCTGTCGG TCATCAACGG CACGCGGCGG
CGGCCGCTGC CGATGGTGCT CGGCCACGAG GCCTCGGGCC ATGTCGAGGC TCTGGGCGAG
GGGGTCGAGG ATCTTGAGCC CGGCGACCAT GTCGTCTGCA TCTTCGCACC CGGCTGCGGC
CGCTGCACGC CCTGCGCCGA GGGGCGGCCT GCGCTCTGCG AGAAGGCGGC GCGCCATCAT
GCGGTGGGCG AACTGATGAC CGGGCACCGG CGGCTGTCGC TCGGCGGGCG GTCCGTGCAC
CATCACCTCG GCATCTCGGG CTTTGCCACC CATGCCGTGG TGGCGCGGCC GTCGCTGGTC
CGCGTCCCGC GCGAGGTCCC GCCCCATGTC TCGGCGCTTT TCTCCTGCGC CATGCTGACG
GGGGCGGGGG CCGTCTTCAA CACGGCGCAG ATCCGGCCCG GCTCGAAGGT CGCCGTGGTG
GGGCTGGGCG GCGTCGGCCT GTCCGCCATC CTCGGCGCGG CGGCGGCCGG AGCGGCCGAG
ATCGTGGCGA TCGACCCGTT TCCCGCCAAG ATGGAGGCCG CGCGCGCCAT GGGCGCGACG
CTCTCGGTGC CCGCGGACGG AGATACGGTG GCGGCGGTGC GCGACCTGAC GGCGGGCGGC
GTCGATTACG CCTTCGAGCT GGCGGGCTCG GTCCGGGCGC TCGAAACCGC CTTCGCCGTC
ACCCGCCGCG GCGGCATGAC GGTGACCGCG GGCCTGCCCC ACCCGGACGA CCGGATGTCG
CTCGAGGCAC TGAAGCTCGT GGCCGAGGAG CGCACCCTGA AAGGCAGCTA CATCGGATCC
TGCGTGCCCC AGCGCGACCT GCCGCGGATG CTGGCGCTGC ACCGCCGCGG CCTGCTGCCG
GTCGAGAAGA TGCTGACCCA CCGGCTGAAG CTCGACGAGA TCAACCTCGC GATGGACCGG
CTGGCCGAGG GCAGCGCGAT CCGGCAGGTG GTGGATCTCG GCTGA
 
Protein sequence
MKITAAVLER DGIAGNYAEE RPLAIRELDL AEPGPGEVLI RVAAAGICHS DLSVINGTRR 
RPLPMVLGHE ASGHVEALGE GVEDLEPGDH VVCIFAPGCG RCTPCAEGRP ALCEKAARHH
AVGELMTGHR RLSLGGRSVH HHLGISGFAT HAVVARPSLV RVPREVPPHV SALFSCAMLT
GAGAVFNTAQ IRPGSKVAVV GLGGVGLSAI LGAAAAGAAE IVAIDPFPAK MEAARAMGAT
LSVPADGDTV AAVRDLTAGG VDYAFELAGS VRALETAFAV TRRGGMTVTA GLPHPDDRMS
LEALKLVAEE RTLKGSYIGS CVPQRDLPRM LALHRRGLLP VEKMLTHRLK LDEINLAMDR
LAEGSAIRQV VDLG