Gene RPD_2104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2104 
Symbol 
ID4022586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2354768 
End bp2356327 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content68% 
IMG OID637962297 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_569240 
Protein GI91976581 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component
[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.882961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.496916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTCT TCAGCGACCC ACGCTGGACC GAGGCGCTGG CCAATCTGCC GGACTATCTC 
GGCAGCCATG TCCGCGTCAG CGTCGCGGCG CTGGGGCTCG GCCTCGCCGT CAGCCTGCCG
CTCGCGATCC TCGCGCGTCA CCGGCCGGTG CTGCGCGGCG CCCTGCTCGG CTTCGCCGGC
ATCGTGCAGA CCATTCCGGG CCTCGCATTG CTGGCGTTGT TCTATCCTCT GCTGCTGGCA
CTGGCGGCAC TGAGCGCGAG CACGCTCGGC GTCAGCTTCT CCGCGTTCGG CTTCCTGCCC
GCGGTGCTGG CGCTGGCGCT GTATTCGATG CTGCCGGTGC TGCGCAACAC CATCACCGGG
CTCGCCGGCG TCGACCCGGC GCTGATCGAG GCCGCACAGG GCGTCGGTAT GACTCCACGG
CAGACGCTGA CGATGGTCGA GCTGCCACTG GCGCTGCCGG TGATCATGGC CGGCATCCGC
ACCAGTGCGG TGTGGGTGAT CGGCACCGCG ACGCTGTCGA CGCCGATCGG TCAGACCAGC
CTCGGCAACT ACATCTTTGC GGGACTGCAA ACCCAGAACT GGGTGCTGGT GCTGTTCGGC
TGTGCCGCAG CCGCTGTGCT GGCGCTGGCG GTCGATCAAT TGTTGGCGCT GATCGAGACT
GGCCTGCGGC TCCGCAGCCG GCTCCGCGTC GCGCTCGGCG GCCTCGGGCT CGCCGCGCTG
GTGATCGCGA CGCTGGCGCC GTCGCTCGCG TCGTCGAAAT CGACCTATGT GGTCGGCGCC
AAGACCTTCA CCGAGCAATA CGTCTTGGCG GCGCTGATTG CGCAGCAACT CGACGACCGA
GGTCTTCCCG CCACGATCCG CGCCGGCCTC GGCTCCAGCG TAATCTACGA CGCGCTCAAA
AGCGGCGATA TCGATGCTTA CGTCGATTAT TCCGGCACGC TGTGGGCCAA CCAGTTTCAC
CACCCCGGCA TCGCCGCGCG GGACCAAGTC TTGAGCGATC TGAAAGGCGA CCTCGGCAGA
GACGGCGTCA CGCTGCTCGG CGAACTCGGC TTCGCGAACG CCTATGCGCT GGTGATGCCG
CGCGCCAGAG CGGAGGCGCT CGGCGTCCGG TCGATCGCTG ATCTCGCTGC GCGCGCGCCG
CAGATGACGA TCGCTGGCGA CTATGAATTC TTCTCGCGCC CGGAATGGGC CGAGCTGCGC
AGGACTTACG GGCTTTCGTT CAAGACCGAG CGGCAGATGC AGCCGGACTT CATGTATGCC
GCCGTCGCCT CCGGCGAGGT CGATCTGATC GCGGGCTACA CCTCGGACGG GCTGATCGCG
AAATACGATC TGGTCGTCCT CGACGATCCG CGACAGGCGA TTCCGCCTTA CGACGCCGTG
CTGCTGGTGT CGCCGAAGCG CGCCGGTGAC ACGAAGCTGC GCGACGCGCT CAGCCCCCTC
GTCGGCCGTA TCGACATCGC GGCGATGCGA GCGGCCAATC TCCGGGCCGC TTCCGGCGGC
GGCACAGGCA CGCCGGATGT GGTGGCGCGT GCGCTGTGGG ACAGCGTGAA GCCGCGGTGA
 
Protein sequence
MSVFSDPRWT EALANLPDYL GSHVRVSVAA LGLGLAVSLP LAILARHRPV LRGALLGFAG 
IVQTIPGLAL LALFYPLLLA LAALSASTLG VSFSAFGFLP AVLALALYSM LPVLRNTITG
LAGVDPALIE AAQGVGMTPR QTLTMVELPL ALPVIMAGIR TSAVWVIGTA TLSTPIGQTS
LGNYIFAGLQ TQNWVLVLFG CAAAAVLALA VDQLLALIET GLRLRSRLRV ALGGLGLAAL
VIATLAPSLA SSKSTYVVGA KTFTEQYVLA ALIAQQLDDR GLPATIRAGL GSSVIYDALK
SGDIDAYVDY SGTLWANQFH HPGIAARDQV LSDLKGDLGR DGVTLLGELG FANAYALVMP
RARAEALGVR SIADLAARAP QMTIAGDYEF FSRPEWAELR RTYGLSFKTE RQMQPDFMYA
AVASGEVDLI AGYTSDGLIA KYDLVVLDDP RQAIPPYDAV LLVSPKRAGD TKLRDALSPL
VGRIDIAAMR AANLRAASGG GTGTPDVVAR ALWDSVKPR