Gene RPB_4239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4239 
Symbol 
ID3912047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4815208 
End bp4817553 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content68% 
IMG OID637886142 
ProductTonB-dependent siderophore receptor 
Protein accessionYP_487841 
Protein GI86751345 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4773] Outer membrane receptor for ferric coprogen and ferric-rhodotorulic acid 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.475083 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCG CCCGCCGCTG GCCGATCCTG TCATCGCCGC GCTTGCTTCG CAGCGCCACG 
GCCCGACTGC TGCTCGGCGC CGCGATCTGC CTCGTCAGTC TGCCGCCCTC GTCGCGACCG
GCCGAAGCCC AGGGCGTTGC GCCCGCGGCC GGCGCCGAGC TGCCGAGCGT CACCGTCGAT
GCGCCCGCCG CCCGGCCGGC GCGTCCGCGA TCGGTGGCGG CGCCGCAGCG AAATCCACAG
GCCGTGGCGC GCCGGCCGAA TCCTCAACGC AATCCCGCGC CGGCGCCGCC GACGCCGTCC
GAACGCGCCG CGGCCGCCGC AGCCGTGCTC AACGAGCAGA AACTCGGCTA CCGGGCGATG
CCGAGCCCGA CGACGCTGCG CACCGGCGCC TCGCCGCTCG AGAGCTCGCA ATCCGTCAAT
GTGGTGCCCG AGCAGGTGTT GAAGGATCAG TTGCCGCGCA ATCTCGACGA CGCGCTCGCC
AACGTCTCCG GCGTCACCCA GACCAACACG CTCGCCGGCT CGTTCGACGC GGTGATCCGG
CGCGGCTTCG GCGACAATCG CGACGGTTCG ATCATGCGCA ACGGCATGCC GCTGGTGCAG
GGCCGCGCCC TCGGCGCGAC CGTCGAGAGC GTCGAGGTGC TGAAGGGCCC GGCGTCGCTG
CTGTACGGCA TCCAGACCCC CGGCGGCATC GTCAACACCA TCAGCCGGCG GCCGGAGCTG
TATCAGCACG GCTCGCTGAC GTTGCTCGGC TCCAGCTTCG GCGGCGGCAA GAACGGCGCC
AATGCGATCT TCGACCTCAC CGGGCCGATC GGCGACACCG GCCTCGCCTA TCGGTTCATC
GGCTCTGGGC TCGACGAGGA TTACTGGCGC AATTTCGGCG CCAATCGCGA GATGCTGCTG
GCGCCGTCGC TGGCCTGGTA TGGCGAGCGC ACCACCGTGC AGTTCAGCTA CGAGCACCGC
GAGTTCAGCT ACCCGTTCGA TCGCGGCACG TCGTTCGTCA ACGGCGCGCC GCTGGCGATC
CCGGCGACGC GACGGCTCGA CGAGGCGTTC AACCGCACCT GGGGCAAGTC GGATCTGGTG
CAGGGCTCGG TCGAGCACCG CCTCGACGAC GTCTGGAAGC TCACCGCCGC CTACAGCTAC
AATTCCGAGA CCTACGACGC CAATCAGCTC CGCATCACCG GCGTCAATGC CGCAACCGGC
GTCGAGACCC GCAGCAATGA CGGCACCAAG GGCGCGCTGG CCTATTCGAG CTACGGCACG
TCATATCTGT CGGGCGAGCT CTGGCTCGGC GGGCTGCGCA ACGACGTGCT GATCGGCGGC
GATGCGCAGC GCCGCGTGAT CTATCGGCAG AACCTGATCC GCCAGTCGAC GCCGAGCTTC
AACTTCTACA ATCCGGTCTA TGGCCTGGTG CAGCCGGGCA CGACCGTCTC GGCGTCCGAC
AGCGACCAGA CCGACAAGCT CGAAACCCGC TCGCTGTTCG TGCAGGACAC GCTGCATCTC
ACCGACTGGT TCTCGCTGGT CGGCGGCCTG CGCTGGATGG AATACGATCA GCTCGCCGGT
CGCGGCCGGC CGTTCACCGC CAACACCGAC CTCGCGGGGT CGAAGGTGCT GCCGCTCGCC
GGCGCGATTT TCAAGCTCAA CCAGCAGGTC TCGCTCTACG CCAGCTACAC CCAGTCGCTG
CAGCCGAGTT CGACCATCGC GCCGCTGACC GGCGGCGTCG TCATCGGCTC CAACATCGCG
CCCGAAGAGG GCACGCAATA TGAAGCCGGC GTCAAATTCG ACCTGAACAA GCGGCTGTCC
GGCACGCTGG CGGTCTACGA CATCGACAAG AAGAACGTGC TGGTGTCGCA GTTCAACGCG
TCGACGGGGC TGAACGAATA TCGCGCCGCC GGCAAGGTGC GCTCGCGCGG CGTCGAATTC
GACGTCACCG GCCGGCTCGA CGATCATTGG AGCACGATCG CCAGCTACGG CTACACCGAC
GCCTATGTGA CCGAAGATCC GACGCTGGTC GGCAAGCGCC TGCAGAACGT CGCGATGAAC
ACCGCGTCGC TGTTCCTGGT GTATGATTTC GGCACCGCGC TGCCCGGGCG GCTCCGGCTC
GGCGGCGGCG CCCGCTATGT CGGCGACCGG CCCGGCGACA GCACCAATTC CTTCGTGCTG
CCGGCCTATA CGGTGGCCGA TGTCTTCGCC AGCTACGAGG TGAAACACGC CGGCATCCCG
GTGATCTATC AGCTCAACGT CAAGAACCTG TTCGATCAGG TGTACTACCC GTCGGCGGTC
AACACCCTGA ACGTCGCCCT CGGCGACGCC CGGCGGTTCT CGCTGTCGGC GACGGCGAAG
TTCTAG
 
Protein sequence
MPTARRWPIL SSPRLLRSAT ARLLLGAAIC LVSLPPSSRP AEAQGVAPAA GAELPSVTVD 
APAARPARPR SVAAPQRNPQ AVARRPNPQR NPAPAPPTPS ERAAAAAAVL NEQKLGYRAM
PSPTTLRTGA SPLESSQSVN VVPEQVLKDQ LPRNLDDALA NVSGVTQTNT LAGSFDAVIR
RGFGDNRDGS IMRNGMPLVQ GRALGATVES VEVLKGPASL LYGIQTPGGI VNTISRRPEL
YQHGSLTLLG SSFGGGKNGA NAIFDLTGPI GDTGLAYRFI GSGLDEDYWR NFGANREMLL
APSLAWYGER TTVQFSYEHR EFSYPFDRGT SFVNGAPLAI PATRRLDEAF NRTWGKSDLV
QGSVEHRLDD VWKLTAAYSY NSETYDANQL RITGVNAATG VETRSNDGTK GALAYSSYGT
SYLSGELWLG GLRNDVLIGG DAQRRVIYRQ NLIRQSTPSF NFYNPVYGLV QPGTTVSASD
SDQTDKLETR SLFVQDTLHL TDWFSLVGGL RWMEYDQLAG RGRPFTANTD LAGSKVLPLA
GAIFKLNQQV SLYASYTQSL QPSSTIAPLT GGVVIGSNIA PEEGTQYEAG VKFDLNKRLS
GTLAVYDIDK KNVLVSQFNA STGLNEYRAA GKVRSRGVEF DVTGRLDDHW STIASYGYTD
AYVTEDPTLV GKRLQNVAMN TASLFLVYDF GTALPGRLRL GGGARYVGDR PGDSTNSFVL
PAYTVADVFA SYEVKHAGIP VIYQLNVKNL FDQVYYPSAV NTLNVALGDA RRFSLSATAK
F