Gene RPB_0232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0232 
Symbol 
ID3907859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp264849 
End bp266402 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content64% 
IMG OID637882114 
Productextracellular solute-binding protein 
Protein accessionYP_483854 
Protein GI86747358 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATAC TCGATCTGGG ATCACGAACC CTGCGACGCC GCGATATTCT GGCGCTGATC 
GGCGGCGGCG CCGCAGCCGC TGCGGTCGGC CTGCCGGCGC TGGCGCAGGA GCCCAAGAAG
GGCGGCGTGC TGAAGGTCGC GGCCCCAGCG AATCCGTCGT CGCTCGATCC GGCCACCGGC
GGCGCCGGTT CCGACCACAG CATTCTCTGG ACGATCTACG ATACGCTGGT CGAGTGGGAC
TACGACACGC TGAAGCCGAG GCCCGGCATG GCGAAATGGT CGTATCCGAA TCCGACCACG
ATGGTGATCG ACATCACCCC CGGCATCCAG TTCCACGACG GCACCGCGAT GGACGCCGAG
GCGGTGAAGT TCAACCTCGA TCGCAACCGC TCCGATCAGC GGTCCAATAT CAAGTCGGAT
CTCGCCAGCA TCGAGTCGAT CGAGGTGACC AGCCCGCTGC AGGTGACGCT GAAGCTGAAG
AGCCCGGATA CATCCCTGCC GGCGATCCTG TCCGACCGCG CCGGCATGAT GGTGTCGCCG
ACCAACATCA AGGCGCTTGG CAACGAGACA GACCGCAAGC CGGTCGGCGC CGGGCCGTGG
AAGTTCGTGC GCTGGAACGA CAACGAAATC ATCGTCGTGG CTCGCCACGA GAACTACTGG
CGCAAGGGCC GGCCGTATCT CGACGGCATC GAGTTCAACA TCATCACCGA AAACGCCACG
GCGCTGCGGT CGGTGGTCGC CGGCCAGAAC GACATGGCAT TTCAGTTGCC GGCACGGCTG
AAGCCGGTGA TTGAGCGCGC CAAGGACCTG ACCATGGTCA GCTCGCCGAC GCTGTATTGC
ATTCAGGTGT ATTTCAACTA CGCCCGCGCG CCGCTCGACA ATCTCAAGGT TCGTCAGGCG
ATCAATTTCG CGTTCGACCG CGACACCTTC GTCAAGGCGG CGCTGAGCGG GCTCGGCGAA
TCGGCCCGGA TGACGCTGCC GAGCTCGCAC TGGGCGTTCA ACAAGGATGT GGCCGGCACC
TATCCGCACG ATCCGGAGAA GGCGAAGAAG TTGCTGGCAG AGGCCGGCTA CAAGGACGGC
CTCGAGCTGA CGATCGGCGG CTATACCGAT CAGGATTCGG TGCGCCGCGG CGAGGTGATC
CAGGATCAGC TCGGCAAGGT CGGCATCCGG CTCAAATTCA CCCGCGGCAC CATCGCGGAA
ATCAGCGCGC AGTTCTTCGC GCAGGAGAAG AAGTTCGACC TGTTGGTGTC GGCCTGGACC
GGGCGTCCCG ATCCGAGCAT GACCTATGGG CTCGGCTTCG ACAAAGGCGC GTACTACAAC
GCCGGCCGCA CCGCCGATCC TGAGCTGTCC AAGCTGATCC TCGAAAGCCG CGTCAGCGAG
GATTTGGCCA AGCGCGCCGA AGTGTTCGCC AGGATCCAGC GCATCACGGT CGAACAGGCA
CTGTCGGCGC CGCTGGCGTT CCAGTTCGAG CTCGACGCGC TGTCGTCCAA GGTGAAGGGC
TTCAAGCCCA ATCTGCTCGG CAAGCCGAAG TTCGAATACA TCTCCCTCGC GTGA
 
Protein sequence
MRILDLGSRT LRRRDILALI GGGAAAAAVG LPALAQEPKK GGVLKVAAPA NPSSLDPATG 
GAGSDHSILW TIYDTLVEWD YDTLKPRPGM AKWSYPNPTT MVIDITPGIQ FHDGTAMDAE
AVKFNLDRNR SDQRSNIKSD LASIESIEVT SPLQVTLKLK SPDTSLPAIL SDRAGMMVSP
TNIKALGNET DRKPVGAGPW KFVRWNDNEI IVVARHENYW RKGRPYLDGI EFNIITENAT
ALRSVVAGQN DMAFQLPARL KPVIERAKDL TMVSSPTLYC IQVYFNYARA PLDNLKVRQA
INFAFDRDTF VKAALSGLGE SARMTLPSSH WAFNKDVAGT YPHDPEKAKK LLAEAGYKDG
LELTIGGYTD QDSVRRGEVI QDQLGKVGIR LKFTRGTIAE ISAQFFAQEK KFDLLVSAWT
GRPDPSMTYG LGFDKGAYYN AGRTADPELS KLILESRVSE DLAKRAEVFA RIQRITVEQA
LSAPLAFQFE LDALSSKVKG FKPNLLGKPK FEYISLA