Gene RPB_3337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3337 
Symbol 
ID3911139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3817274 
End bp3818833 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content70% 
IMG OID637885240 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_486944 
Protein GI86750448 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component
[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.917916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.224892 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGT TCGGCGATCC GCGCTGGAAC GAGGCGCTGG CCAATCTGCC GGACACGCTC 
GGCGCGCATG TCCGCGTCAG CCTCGCCGCG CTGGCGCTCG GCCTCGTCGT CAGCCTGCCG
CTGGCGATTG TTGCGCGACG CCGCCCGATC CTGCGCGGCG CCCTGCTCGG CTTCGCCGGC
ATCGTCCAGA CCATCCCCGG CCTCGCTCTA CTGGCGCTGT TCTATCCGCT GCTGCTGGCG
CTGGCGGCGT TGAGTCTCCG CGCCTTCGGC GTCGGCTTCT CGGCGTTCGG CTTCCTGCCC
GCGGTGCTGG CGCTGGCGCT GTATGCGATG CTGCCGGTGC TGCGCAACAC CATCACCGGG
CTCGACGGCG TCGATCCGGC GCTCGTCGAG GCGGCGCAGG GCGTCGGCAT GACGCCGCGG
CAGGTGTTGA CGATGGTCGA GGTACCGCTG GCGCTGCCGG TGATCATGGC CGGCATTCGC
ACAGCCGCCG TATGGGTGAT CGGCACCGCG ACGCTGTCCA CGCCCATTGG CCAGACCAGC
CTCGGCAACT ACATCTTCGC CGGGCTGCAG ACGCAGAACT GGGTGCTGGT GCTGTTCGGC
TGCGCCGCCG CCGCAGTGCT GGCGCTCGCG GTCGATCAAC TGCTGGCGCT GATCGAGAGC
GGCCTGCGGC TGCGCAGCCG CGTCCGCACC GCGCTCGGCG GCCTTGGCCT CGCCGCGCTG
CTGATCGCGA CACTGGCGCC GTCGCTGGCC GGCACGCGAT CGACCTACTT CGTCGGCGCC
AAGACCTTCA CCGAGCAATA TGTGCTCGCG GCCCTTCTCC AGCAGCGGCT CGACGCCGCC
GGGCTGTCGG CCACGACGCG CGCCGGCCTC GGCTCCAGCG TGATCTACGA CGCGCTGAGG
AGCGGCGACA TCGACGCCTA TGTCGACTAC TCCGGCACGC TGTGGGCCAA TCAGTTCAAG
CACCCCGGCA TCGCCGCGCG CGACCAGGTG CTGGCCGATC TGAAGCAGGA TCTCGCCAAG
GACGGCGTCA CCCTGCTCGG CGACCTCGGC TTCGCCAACG CCTATGCGCT GGCGATGCCG
CGCGCCAGGG CCGAGGCGCT CGGTATCCGC TCGATCGCCG ATCTCGCCGC GCGCGCGCCG
GCGATGACGA TCGCCGGCGA CTACGAGTTC TTCTCCCGCC CGGAATGGGC CAATCTGCGC
CGCACCTATG GGCTCACTTT CAAGGCCGAA CGGCAGATGC AGCCCGACTT CATGTACGCG
GCCGTCGCCA CCGGCGAGGT CGATCTGATC GCCGGCTATA CCTCGGACGG GCTGATCGCC
AAATACGATC TCGTCGTGCT GGACGATCCC AAGCAGGCGA TCCCGCCTTA CGACGCGGTG
CTGCTGATCT CACCAAAGCG CGCCGGCGAC GCGAAGCTAC GCGATGCGCT GACGCCGCTG
GTCGGCCGCA TCGATATCGG CGCGATGCGC GCCGCCAATC TGCGCGCCGC CTCGGGCGGC
GGCGACGGCA CGCCCGACGC GGTGGCGCGG GCGCTGTGGG ACGGCATCAA GGCGCGCTGA
 
Protein sequence
MSLFGDPRWN EALANLPDTL GAHVRVSLAA LALGLVVSLP LAIVARRRPI LRGALLGFAG 
IVQTIPGLAL LALFYPLLLA LAALSLRAFG VGFSAFGFLP AVLALALYAM LPVLRNTITG
LDGVDPALVE AAQGVGMTPR QVLTMVEVPL ALPVIMAGIR TAAVWVIGTA TLSTPIGQTS
LGNYIFAGLQ TQNWVLVLFG CAAAAVLALA VDQLLALIES GLRLRSRVRT ALGGLGLAAL
LIATLAPSLA GTRSTYFVGA KTFTEQYVLA ALLQQRLDAA GLSATTRAGL GSSVIYDALR
SGDIDAYVDY SGTLWANQFK HPGIAARDQV LADLKQDLAK DGVTLLGDLG FANAYALAMP
RARAEALGIR SIADLAARAP AMTIAGDYEF FSRPEWANLR RTYGLTFKAE RQMQPDFMYA
AVATGEVDLI AGYTSDGLIA KYDLVVLDDP KQAIPPYDAV LLISPKRAGD AKLRDALTPL
VGRIDIGAMR AANLRAASGG GDGTPDAVAR ALWDGIKAR