Gene RPB_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0102 
Symbol 
ID3909688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp109910 
End bp112177 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content65% 
IMG OID637881983 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_483725 
Protein GI86747229 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.058066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.995779 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCCT CTCCGACTGC TTCCGGGCGC AACCGTCGCC TGCTGTCCTC CGTCGCCCTG 
ACGGCTCTGC TGTGGCCCGC TGCCGGCCAC GCCCAATCCG CGCCGCGCGG CCTCGACCCG
ATCGTGGTCG AAGGCCAGAC CGCGCGGCCG TCCAAGCCGC GCGCGGCTTC GTCCGCGGCC
GGCTCGTCGC GGACGCGCCG GGCCTCGGCG CCCGCGTCAC GCCCGCCTGC CGCGCCTGCT
GCCGCGCCGT CAGCCGCCGT CGCAGCCGTG CCGACCTTCA ATCTCGGCAC GATGGCATCG
ACCGCCAGCC GCCTCGGATT GACGCCGCTG CAGACCCCCG CCTCCGTCGA CATCATCACC
GCACAGACCA TCGCCGAGCG CGGCCAGCGC GATGTGCTCG ACGCGGTCAC TCAGAACGCC
ACAGGCATCA CGGCAACACC GGAGCCGGGC AATGGCGGCG TGGTGTTCTC GACCCGCGGT
TTCAGCGGCA CCGGTTCGGT GATGACCCTG TATGACGGCA CCCGCCTGTA TGTCGGCGCC
GGCACCGTGA CCTTCCCGTT CGACACCTGG TCGGCCGAGC GCATCGAGGT GCTACGCGGC
CCCGCCTCGG TGATGTACGG CGAAGGCGCG ATCGGCGGCG CCATCAACGT GATCACCAAG
CAGCCGCTCG ACGTGCAGCG CAACCAGGCC GAAGTGTCGC TCGACACCAA TCTGACGCGA
CGGATCGCGG TCGACTCCGG TGGTCCGATC AACAAGGATG TCTCCTATCG CATCACCGCC
ACCGGCAACA TGTCCGACGG TTGGGTCGAT CGCGACAAGA CTTCGAACAT CGCCGTCTCG
GCCGCCGTCA AGGTGAAGCA GACCGACCAC CTGACCTGGA CCCTGTCCAC CGCCTATGGC
GACCGCCACC CGTCGCTGTA TTACGGCACG CCGCTGGTCA ATGGCCGGCT CGACGAATCG
CTGCGTTTCA AGAACTACAA CGTCGGCGAC AGCAGCATCC GCTATCAGGA CAGCTGGACG
CAGTTGAAGT CCGAGTGGCA GGTGACCGAC AGCATCACCG TCCGCAACGC GCTGTACTAT
CTGAACAGCC GCAGGCACTG GAAGAGCGCG GAGGAATACG CCTTCAACCC GACCACCCGC
CAGGTCGACC GCAGCACTTA TCTGGAGATC TTCCACGACC AGCAGCAGAT TGGCGACCGC
ATGGATGCGA CGGTGCGCGG CCATCTGTTC GGACTGGAAA ACACCTTTGT CGCCGGGTTC
GACGTCAACC GCATCAACTT CAAGCACACC AATAATTCGC CCTACGGCGG AACGTCGTCG
GTCGATCCGT ACAATTTCGA TCCGGGCTTG TTTGCGAGCC CGGATCTGAC ACGACCGGGC
TATTCCAGCG TCACCAATCA GTATGCGGTG TTCGCCGAGA ACCGTCTGCA ACTCACCGAG
CAATTGGCGC TGATCGGCGG CATCCGGCAG GACCAGCCGA CCGTGGAGCG GACCGATCTG
CGAAACCCGG CTAACAATTA CACCCGATCG TATTCTTCGA CGACGTGGCG GGCCGGGGCG
GTGTACACGC CGATCCAGGA CCTCGCCTTC TACGGCCAAT ATTCGACCGC GGTCGATCCG
GTCGGCGGTT TGGTGACGGC GAGCAACGCC AATGCGAAGT TCGAACTGGC GACCGGCAAG
CAGGTCGAGA TCGGCGTCAA GCAATCGTTC TGGGGCGGGC GTGGCGAGTG GACCCTGGCG
GGCTACCACA TCGTCAAGAA CAACCTGCTT GCGCGCTCTT CGGAAGATCC AGATCAGGTC
GTCCAAGTCG GGCAGCAATC GTCACGCGGT ATCGAAGCGT CGGTTGGCTT GGCGCTCGAC
CACGGTTGGC GGGTCGACGC CAACACCACC TTCCTGCAGG CGAAATACGA CGACTTCGTG
CAGTCAGTTA ATGACGTGGG CGTGAACTTC GCCGGCAACG TGCCGATCAA CGTGCCGCTC
AACGTCTCGA ACGCGTGGCT GACCTGGGCG TTTGCGGCCG GCTGGTCCGC CAATGCCGGC
GTGCAGGTGG TCGGCAAGCG CTTTGCCGAC GCCGCCAACA CGCTGGAGAT GCCGGGCTAC
ACGCTTGTCA ATGCCGGTCT GCAGTGGAAG CCGGACGCCG CCTCGACGCT ATCGCTGCGG
CTCTACAACA TCTTCGACAA GGTCTATGCG ACGTCGAGCT ATGTCGACAA CCAATGGCTG
CTGGGGCGGC CCCGCACCGC GGAGCTGTCT TACAACGTCA AGTTCTGA
 
Protein sequence
MSPSPTASGR NRRLLSSVAL TALLWPAAGH AQSAPRGLDP IVVEGQTARP SKPRAASSAA 
GSSRTRRASA PASRPPAAPA AAPSAAVAAV PTFNLGTMAS TASRLGLTPL QTPASVDIIT
AQTIAERGQR DVLDAVTQNA TGITATPEPG NGGVVFSTRG FSGTGSVMTL YDGTRLYVGA
GTVTFPFDTW SAERIEVLRG PASVMYGEGA IGGAINVITK QPLDVQRNQA EVSLDTNLTR
RIAVDSGGPI NKDVSYRITA TGNMSDGWVD RDKTSNIAVS AAVKVKQTDH LTWTLSTAYG
DRHPSLYYGT PLVNGRLDES LRFKNYNVGD SSIRYQDSWT QLKSEWQVTD SITVRNALYY
LNSRRHWKSA EEYAFNPTTR QVDRSTYLEI FHDQQQIGDR MDATVRGHLF GLENTFVAGF
DVNRINFKHT NNSPYGGTSS VDPYNFDPGL FASPDLTRPG YSSVTNQYAV FAENRLQLTE
QLALIGGIRQ DQPTVERTDL RNPANNYTRS YSSTTWRAGA VYTPIQDLAF YGQYSTAVDP
VGGLVTASNA NAKFELATGK QVEIGVKQSF WGGRGEWTLA GYHIVKNNLL ARSSEDPDQV
VQVGQQSSRG IEASVGLALD HGWRVDANTT FLQAKYDDFV QSVNDVGVNF AGNVPINVPL
NVSNAWLTWA FAAGWSANAG VQVVGKRFAD AANTLEMPGY TLVNAGLQWK PDAASTLSLR
LYNIFDKVYA TSSYVDNQWL LGRPRTAELS YNVKF