Gene Rsph17029_3146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3146 
Symbol 
ID4898609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp168895 
End bp169911 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content67% 
IMG OID640113748 
ProductABC sugar transporter, periplasmic binding protein 
Protein accessionYP_001045018 
Protein GI126463905 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.451363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.179714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGA CGGTGAGGCT GCTCGGCACG GTTGCCGCAG GACTGATGGC GGCCAATGTG 
GCCGCTGCTC AGGAGATCGC CTTCATCCCG AAGCTGGTCG GGGTGGGCTT CTTCACCTCG
GGCGGCAACG GCGCGATGAA GATGGGCGAG GAGCTGGGCG TCAAGGTGAC CTACGACGGC
CCGACCGAGC CCAGCGTCTC GGGGCAGGTC CAGTTCGTGA ACAACTTCGT CAACCAGGGC
TACGGGGCCA TCGTGCTCTC GTCGGTCTCG CCGGACGGGC TCTGCCCCGC GCTGAAGCAG
GCGATGGCCC GCGACGTGCT GGTGATGACC TGGGACAGCG ACGTGAACCC CGACTGCCGC
TCCTACTACA TCAACCAGGG CACGCCCGAG CAGCTCGGCG GCCTTTTGGT CGACATGGCC
AATGACGGGC TCGAGGGCAA GGAAAAGGCC AAGGTGGCCT TCTTCTATTC CTCGCCGACC
GTCACCGACC AGAACGCCTG GGCCGAGGCC GCCAAGGCCA GGATCGCGGC CGACCATCCG
GGCTGGGAGA TCGTGACCAC CCAGTATGGC TACAACGACG CGCAGAAGTC GCTGCAGACG
GCCGAGAGCA TCCTGAGCGC CTATCCCGAT CTCGATGCGA TCATCGCGCC CGACGCGAAC
GCGCTGCCGG CCTCGGCGCA GGCGGCCGAG AACCTCGGCC GGGCGGGCGA GGTGACGATC
GTGGGCTTCT CGACGCCCAA CGTGATGCGC CCCTATGTGA AGCGCGGCAC CGTCGAGCGG
TTCGGCCTCT GGGACGTGAC GCAGCAGGGG GCCATCTCGG TCGCCGTGGC GGCCCATGTG
CTGAAGGACG GCCCGCTCAA TGTGGGCGAC AGTCTCGAGG TGCCGGGCAT CGGCTCGGTC
GAGGTCTCGC CCAACTCGGT GCAGGGCTAC GACTACGAGG CCGAGGGCAA CGGCATCATC
CTGCTGCCCG AGCGGACGGT CTTCACCGCC GAGAACATCG ACAACTTCGA CTTCTGA
 
Protein sequence
MRKTVRLLGT VAAGLMAANV AAAQEIAFIP KLVGVGFFTS GGNGAMKMGE ELGVKVTYDG 
PTEPSVSGQV QFVNNFVNQG YGAIVLSSVS PDGLCPALKQ AMARDVLVMT WDSDVNPDCR
SYYINQGTPE QLGGLLVDMA NDGLEGKEKA KVAFFYSSPT VTDQNAWAEA AKARIAADHP
GWEIVTTQYG YNDAQKSLQT AESILSAYPD LDAIIAPDAN ALPASAQAAE NLGRAGEVTI
VGFSTPNVMR PYVKRGTVER FGLWDVTQQG AISVAVAAHV LKDGPLNVGD SLEVPGIGSV
EVSPNSVQGY DYEAEGNGII LLPERTVFTA ENIDNFDF