Gene RPC_3244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3244 
Symbol 
ID3971910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3591553 
End bp3593109 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content69% 
IMG OID637926355 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_533105 
Protein GI90424735 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component
[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.111227 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.744983 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGT TCGCCGACCC GCGCTGGAGC GAGGCGCTGT CGCATCTGCC GGACTATCTC 
GGCAACCACG TCCGGGTCAG CCTGGCCGCG CTTGCGCTGG GTCTCGCGGT CAGCCTGCCC
TTGGCGATCC TGGCGAGGCG GCGGCCGCTG CTGCGCTCCA CCCTGCTCGG CTTCGCCAGC
ATCGTGCAGA CGGTGCCCGG GCTGGCGCTG CTGGCGCTGT TCTATCCGCT GCTGCTGGCG
CTGGCGGCGC TATCGCTCAG CTGGTTCGGC GTCGGCTTCT CGGCGTTCGG ATTTCTGCCG
GCGGTGCTGG CGCTGGCGCT GTATTCGATG CTGCCGGTGC TGCGCAACAC CATCACCGGG
CTCGACGGCA TCGATCCGGT GCTGCTGGAG GCCGCGCAGG GCGTCGGCAT GACGCAGCGG
CAATCGCTGG TCATGGTCGA ACTGCCGCTG GCGTTGCCGG TGATGATGGC CGGCATCCGC
ACCGCCGCGG TGTGGGTGAT CGGCACCGCG ACGCTCTCCA CCCCGATCGG CCAGACCAGC
CTCGGCAACT ACATCTTCGC CGGATTGCAA ACCCAGAACT GGGTGTTGGT GCTGTTCGGC
TGCACCGCGG CGGCCGTGCT GGCGCTCGCG GTCGATCAAT TGCTGGCACT GGTCGAGCGC
GGGCTGACGC TGCGCAGCCG GCTGCGCGTC GCTGTGGGCG GGGTCGGCCT GCTCGCCCTG
CTGACCGCGG CGCTGGCACC GTCGCTGACG CAGCCTTCCT CGCGCTACGT CGTCGGCGCC
AAGACCTTCA CCGAGCAATA TGTGCTGTCG GCCTTGATCG CGCAGCGGCT GCGCGAAGCC
GGGTTGGCCG CGACCAGCCG CGAGGGGCTG GGCTCCAGCG TGATCTATGA CGCGCTCGCC
ACCAGCGACA TCGACGTCTA TGTGGATTAT TCCGGCACGC TGTGGGCCAA TCAGTTCCAC
CACCCCGAGA TCAAGCCGCG CGCGGAATTG CTCGCCGAGC TGACGAACGA ACTCGCCCGG
TCCAAGGTGA CGCTGCTCGG CGAACTCGGC TTTGCCAACG CCTATGCGCT GGTGATGCCG
CGCGCCCGCG CCGACGCGCT CGGCATTCGT TCGATCGCCG ATCTCGCGCG GCAGGCTTCG
ACGCTGTCGA TCGCCGGCGA TTACGAATTC TTCTCGCGAC CGGAATGGGC CGGGCTGCGC
CAAGCCTATG GCCTGGCGTT TCGCAGCCAG CGCACCCTGC AGCCGGACTT CATGTATGCG
GCGGTGGCCT CGGGTGAAGT CGACGTCATC GCCGGCTACA CCTCGGACGG GCTGATCGCC
AAATACGATC TCGTGGTGCT CGATGACCCA AAGGCCGCGA TCCCGCCCTA CGACGCCATC
GTGCTGTTGT CGCCGCGGCG CGCCGACGAT GCCGCGTTGC GCGCGGCGCT CGCGCCGCTC
ATCGGCAAGA TCGACATCGC CGCGATGCGC GCCGCCAATC TGCGCGCTGC GGGCGGCGAC
GGCGACAGCT CGCCCGACGC GGTGGCGCGG TGGCTGTGGC GGCGGATCGC GCCGTAG
 
Protein sequence
MTPFADPRWS EALSHLPDYL GNHVRVSLAA LALGLAVSLP LAILARRRPL LRSTLLGFAS 
IVQTVPGLAL LALFYPLLLA LAALSLSWFG VGFSAFGFLP AVLALALYSM LPVLRNTITG
LDGIDPVLLE AAQGVGMTQR QSLVMVELPL ALPVMMAGIR TAAVWVIGTA TLSTPIGQTS
LGNYIFAGLQ TQNWVLVLFG CTAAAVLALA VDQLLALVER GLTLRSRLRV AVGGVGLLAL
LTAALAPSLT QPSSRYVVGA KTFTEQYVLS ALIAQRLREA GLAATSREGL GSSVIYDALA
TSDIDVYVDY SGTLWANQFH HPEIKPRAEL LAELTNELAR SKVTLLGELG FANAYALVMP
RARADALGIR SIADLARQAS TLSIAGDYEF FSRPEWAGLR QAYGLAFRSQ RTLQPDFMYA
AVASGEVDVI AGYTSDGLIA KYDLVVLDDP KAAIPPYDAI VLLSPRRADD AALRAALAPL
IGKIDIAAMR AANLRAAGGD GDSSPDAVAR WLWRRIAP