Gene Rsph17029_3799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3799 
Symbol 
ID4898264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp927145 
End bp928434 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content71% 
IMG OID640114403 
Productputative L-sorbosone dehydrogenase 
Protein accessionYP_001045651 
Protein GI126464538 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2133] Glucose/sorbosone dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.410038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.17604 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTCC TGGCGCGTGC CACCGCCATC GTCGGCAATA CGATGGTTCT GATGCGGCGG 
TTCGGATCGC CGGGCACCCA GGCGATCGGG CAGAGCCCGG CCATCCCCGA GGCCCAGAAG
CAGGGCATCA TGACCCTCAA GATGCCCGTG GCCAAGGGCT GGGCGCCGGG CCATCTGCCC
ACGCCCGCAC CGGGCCTCCA GGTCAATGCC TTCGCCCGGG ATCTGGAGCA TCCGCGCTGG
ATCGAGGTGC TGCCCTCGGG CGACGTGCTG GTTGCCGAGG CACGCCAGCT TCCCACCCCG
CCGAAGACCC TCCTCGACCG CGCGGCGCAG GCCACCATGC GCCGCATCCG CGCGCTCGGC
GACAGCCCGA ACCGCATCAC CCTCCTGCGC GATCCGGAAG GCCGGGGCGA GGCGCAAGAG
CGCGGGACTT TCCTCGAGAA CCAGAGCCAG CCCTTCGGCA TGGCGCTGGT GGGCGATACC
TTCTACGTCG GCAACACCGA CGGCATTATG GCCTTCCCCT ACCGCCCGGG CGCCACGCGG
CTCGAGGGGC CGGGGCGGCG CCTGACCACC TTCAAGCCCG GCGGGCACTG GACGCGCAGC
CTGATCGTCT CGCCCGACGG GCGCCGGATC TATGCCGGCG TGGGCTCGCT CAGCAACATC
GGCGACGACG GGATGGAGGC CGAGGAGGGC CGCGCCGCGA TCTGGGAGCT GGACCTCGCC
AGCGGTCAGG CCCGGATCTA TGCCTCGGGC CTGCGCAACC CGGTGGGCCT CGCGTGGGAG
CCCACGACGC GCGTGCTCTG GACCGTGGTC AACGAGCGCG ACGGGCTCGG CGACGAGACC
CCGCCCGACT ATCTGACCTC CGTCGAGGAG GACGGCTTCT ACGGCTGGCC CTACTGCTAC
TGGAACCGGA TCGTCGACGA TCGCGTGCCG CAGGATCCGG CGATGGTCGC CCGCGCGATC
ACGCCCGACT ATGCGCTCGG CGGACACACG GCCTCGCTCG GCCTCTGCTG GGTGCCCGCG
GGCACGCTGC CGGGCTTCGG CGACGGAATG GCCATCGGCC AGCACGGCTC GTGGAACCGT
TCGAAACTCA GCGGTTACCG GCTGATCTTC GTGCCCTTCG CGAACGGCCG GCCCTCCGGC
CCGCCGCGGG ACATCCTGAC CGGCTTCCTC TCCGACGACG AGAAGCTGGC CTACGGTCGC
CCGGTCGGCG TGGCGGTGGG CCCCGACCGC CGCTCGCTTC TGCTCGCCGA CGATGTGGGC
GACGTGATCT GGCGCGTGAC CGGCGCCTGA
 
Protein sequence
MDFLARATAI VGNTMVLMRR FGSPGTQAIG QSPAIPEAQK QGIMTLKMPV AKGWAPGHLP 
TPAPGLQVNA FARDLEHPRW IEVLPSGDVL VAEARQLPTP PKTLLDRAAQ ATMRRIRALG
DSPNRITLLR DPEGRGEAQE RGTFLENQSQ PFGMALVGDT FYVGNTDGIM AFPYRPGATR
LEGPGRRLTT FKPGGHWTRS LIVSPDGRRI YAGVGSLSNI GDDGMEAEEG RAAIWELDLA
SGQARIYASG LRNPVGLAWE PTTRVLWTVV NERDGLGDET PPDYLTSVEE DGFYGWPYCY
WNRIVDDRVP QDPAMVARAI TPDYALGGHT ASLGLCWVPA GTLPGFGDGM AIGQHGSWNR
SKLSGYRLIF VPFANGRPSG PPRDILTGFL SDDEKLAYGR PVGVAVGPDR RSLLLADDVG
DVIWRVTGA