Gene Rsph17029_0853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0853 
Symbol 
ID4897933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp870816 
End bp871742 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content65% 
IMG OID640111438 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001042736 
Protein GI126461622 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID[TIGR03414] choline ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.564926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCA TCCGACAGGC CCTGCTGGCC GCCGCCTCGA CCCTGGCCAT GGCCGGGACT 
GCCCTCGCCC AGGACCAGTG CCGCGAGGTG ACCTTCTCCG ACGTGGGCTG GACCGACATC
ACCGTCACCA CCTCCGCCAC CCGTCAGGTG CTCGAGGCGC TCGGCTACGA GGTCGAGGTC
GACATCCTCG GCGTGCCCGT GACCTACGCC TCGATGGACA AGGGCGACGT GGACGTGTTC
CTCGGCACCT GGCTTCCCGC TCAGGAAAGC GCCATCGGCC CCTATCTCGA GAAGGGCAGC
ATCGAGGAGA TCACGACGAA CCTCGAAGGC ACGAAATACA CGCTCGCGGT GCCGACCTAT
CTCTACGACA AGGGCCTGAA GAGCTACGGC GACATCGCCA AGTTCAAGGA CGAGCTGGAA
GGCAAGGTCT ACGGCATCGA GCCCGGCAAC GAAGGCAACG AATATCTCAT CAGCCTGACC
GAGGCCGGAA AGCCGCTCGA AGGGTTCGAG GTCGTCCAGA GCTCCGAGCA GGGGATGCTG
GCGCAGGTCG CCCGCTTCTA CCCGCAGGAG AAGGGCGTGG TCTTCCTCGG CTGGGAGCCC
CACCCGATGA ACGCCACCTT CTCGCTGAAA TACCTGCCGG GCGGCGAGGA TTTCTTCGGC
GAGGACGGCG TGGTGAAGAC CGTGGCGCGC AAGGGCTTCA AGGAAGACTG TCCGAACGTG
ACCAAGATGC TGTCGCAGCA GAAATTCACC CTGCCCATGG AGAACGAGAT CATGGGCAAG
ATCCTCGACG ACGGGATGGA GCCCGATGCG GCGGTGATGG AATGGCTGAA GGCCAACCCG
GAGACCGTCG ATCCCTGGCT CGCCGGCGTG ACGACGGTGG ACGGCAAGGA TGCGCTGCCG
GTGGTCAAGG AAGCCCTCGG CCTCTGA
 
Protein sequence
MTPIRQALLA AASTLAMAGT ALAQDQCREV TFSDVGWTDI TVTTSATRQV LEALGYEVEV 
DILGVPVTYA SMDKGDVDVF LGTWLPAQES AIGPYLEKGS IEEITTNLEG TKYTLAVPTY
LYDKGLKSYG DIAKFKDELE GKVYGIEPGN EGNEYLISLT EAGKPLEGFE VVQSSEQGML
AQVARFYPQE KGVVFLGWEP HPMNATFSLK YLPGGEDFFG EDGVVKTVAR KGFKEDCPNV
TKMLSQQKFT LPMENEIMGK ILDDGMEPDA AVMEWLKANP ETVDPWLAGV TTVDGKDALP
VVKEALGL