Gene RPD_4147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4147 
SymboltolB 
ID4024669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4618605 
End bp4619954 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content66% 
IMG OID637964355 
Producttranslocation protein TolB 
Protein accessionYP_571267 
Protein GI91978608 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0823] Periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID[TIGR02800] tol-pal system beta propeller repeat protein TolB 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTGCC CGAACATGCC GCTCAACTTG AATCGCAGAC AATTGATGCT CTCCGCAGCG 
AGTGCGGCCG GCGCGCTCGC GCTCGGCCCG GCGCGTGACG CTTTCGGCCA GGCCCGCGTG
CAGATCACCG AGGGCAACGT CGCGCCGCTG CCGATCGCGA TCCCGAACTT CGTCGCCGGC
ACGCCGTCGG ACAATGAAGT CGGCGGCGGC GTCAGCCAGG TGATCACCAA CAACCTCAAG
CGTAGCGGGT TGTTCGCGCC GATCGATCAG GCCGCTTATC TCGAGAAGAT CACCAACATC
GACGTGCCGC CGCAGTTCAA GAGCTGGACC AGCATCAACG CCCAGGCGCT GGTCACCGGC
CGGATGACGC GGCAGCCGGA CGGCCGCCTC AAGGCCGAGT TCCGGCTGTG GGATGTCGCC
ACCGGCCAGC AGCTCGCGGG CCAGCAGTAT TTCACCTCGC CGGAATACTG GCGGCGCATC
GCCCACATCA TCTCCGACCA GATCTACGAA CGGCTCACCG GCGAGAAGGG ATACTTCGAC
AGCCGCGTGG TGTTCATCGA TGAAAGCGGC CCGGCGGACC GCCGCGTCAA GCGGCTGGCG
CTGATGGACC AGGACGGCGC CAATGTTCGC TATCTGACCC GCGGCAGCGA TCTGGTGCTG
ACGCCCCGGT TCTCGCCGTC GACCCAGGAA ATCACCTATA TGGAGTTCGG CCAGGGCGAC
CCGAAGGTCT ATCTGTTCAA CATCGAGACC GGGCAGCGCG AGATCGTCGG CAACTTCCCC
GGAATGTCGT TCGCGCCGCG GTTTTCGCCG GACGGTCAGC GCATCATCAT GAGCCTGCAG
CAGGGCGGCA ATTCCAATCT GTTCGTCATG GACCTGCGCT CGAAGACGAC GACACGGCTG
ACCGACACGC CGGCGATCGA CACCTCGCCG TCCTATTCGC CCGACGCAGC CCGGATCTGC
TTCGAGTCCG ACCGCGGCGG CAAGCCGCAG ATCTACATCA TGCCGGCGGG CGGCGGGCAG
GCGCAGCGCA TTTCCTTCGG CGACGGGAGC TATTCGACCC CGGTATGGTC GCCGCGCGGC
GACTACATCG CCTTCACTAA GCAGGGCGGC GGTCAGTTCG CGATCGGCAT CATGAAGCCC
GACGGCTCTG GCGAGCGAAT TCTGACCTCG GGCTTCCACA ATGAGGGGCC GACCTTCGCG
CCGAACGGCC GCGTGCTGAT GTTTTTCCGC GATCCGGGCG GGAACGCGGG GCCGTCGCTC
TATACGGTCG ACGTGTCGGG CCGCAACGAA TTGCGGGTTC CGACCCCGGG CTACGCGTCC
GACCCGGCCT GGTCGCCGCT GCTGTCATAG
 
Protein sequence
MDCPNMPLNL NRRQLMLSAA SAAGALALGP ARDAFGQARV QITEGNVAPL PIAIPNFVAG 
TPSDNEVGGG VSQVITNNLK RSGLFAPIDQ AAYLEKITNI DVPPQFKSWT SINAQALVTG
RMTRQPDGRL KAEFRLWDVA TGQQLAGQQY FTSPEYWRRI AHIISDQIYE RLTGEKGYFD
SRVVFIDESG PADRRVKRLA LMDQDGANVR YLTRGSDLVL TPRFSPSTQE ITYMEFGQGD
PKVYLFNIET GQREIVGNFP GMSFAPRFSP DGQRIIMSLQ QGGNSNLFVM DLRSKTTTRL
TDTPAIDTSP SYSPDAARIC FESDRGGKPQ IYIMPAGGGQ AQRISFGDGS YSTPVWSPRG
DYIAFTKQGG GQFAIGIMKP DGSGERILTS GFHNEGPTFA PNGRVLMFFR DPGGNAGPSL
YTVDVSGRNE LRVPTPGYAS DPAWSPLLS