Gene Rsph17025_0234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_0234 
Symbol 
ID5083782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp227543 
End bp228580 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content69% 
IMG OID640481789 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_001166449 
Protein GI146276290 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.594018 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAGGAG ACATCGTGAC ACGACTCAAG ATCGCCCTGA TGGGGTGTGC GGGGGCGCTT 
GCCCTGTCGG CCGCTCCGGC CCTGGCCCAG GAGGTCGGCG CCTGCCTGAT CACCAAGACC
GACACAAACC CGTTCTTCGT GAAGATGAAG GAAGGGGCCA CGGCGAAGGC ACAGGAACTC
GGCATCAACC TCAAGTCCTA CGCGGGCCGG ATCGACGGCG ACAGCGAAAG CCAGGTGGCC
GCCATCGAGA CCTGCATCGC CGACGGGGCG AAGGGCATCC TGCTGACGCC GTCCGACACC
AAGGGGATCG TGCCGTCGGT GCAGAAGGCG CGCGACGCGG GCATCCTCGT GATCGCGCTC
GACACGCCGC TGGATCCGAT CGACGCGGCC GACGGCACCT TCGCCACCGA CAACTTCCTC
GCGGGCGAGC TGATCGGCCA GTGGGCGCAG GCCAAGATGG GCGATGCGGC AGCGGATGCG
CGGATCGCGA TGCTGAACCT CGGCGTGTCG CAGCCCTCGG TGGATGTGCT GCGCGCGCAG
GGCTTCCTGC AGGGCTTCGG CGTCGATCTC GGCGACCCGA ACCGCTGGGG CGACGAGACC
GACCCGCGCA TCGTCGGCCA TGACGTGACC GACGGCAACG AGGAGGGCGG GCGCCGCGCG
ATGGAGAGCC TTCTGGCGCA GGATCCGATG ATCAACCTCG TCTATACGAT CAATGAGCCG
GCGGCGGCCG GCGCCTACGA GGCGCTGCGC TCCATCGGAC GCGAGAGCGA CGTGCTGATC
GTCTCGATCG ACGGCGGCTG CCCGGGCGTC CAGAACGTGA AGGAGGGGGT GATCGGCGCC
ACCTCGCAGC AATATCCGCT GCAGATGGCG GCGCTGGGGG TCGAGGCCAT TGCGGCCTTT
GCCAAGGACG GCACCCGGCC CGCGACGACC GAGGGCAAGG ACTTCACCGA CACGGGCGTG
GCGCTTGTGA CCGACCAGCC GGTCGAGGGG GTGGAGTCGA TCGACAGCGC CCGCGGCGCG
GAACTCTGCT GGGGCTGA
 
Protein sequence
MGGDIVTRLK IALMGCAGAL ALSAAPALAQ EVGACLITKT DTNPFFVKMK EGATAKAQEL 
GINLKSYAGR IDGDSESQVA AIETCIADGA KGILLTPSDT KGIVPSVQKA RDAGILVIAL
DTPLDPIDAA DGTFATDNFL AGELIGQWAQ AKMGDAAADA RIAMLNLGVS QPSVDVLRAQ
GFLQGFGVDL GDPNRWGDET DPRIVGHDVT DGNEEGGRRA MESLLAQDPM INLVYTINEP
AAAGAYEALR SIGRESDVLI VSIDGGCPGV QNVKEGVIGA TSQQYPLQMA ALGVEAIAAF
AKDGTRPATT EGKDFTDTGV ALVTDQPVEG VESIDSARGA ELCWG