Gene RPB_4192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4192 
Symbol 
ID3912000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4763062 
End bp4765416 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content67% 
IMG OID637886096 
ProductTonB-dependent receptor 
Protein accessionYP_487795 
Protein GI86751299 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0530533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTCC CGTTCAAACG CGCGCTCCTG CTCGGAAGCG CTGCCGTCGT CGCCTTCGAA 
AGTCTGCCTG CCCCACAGGC CTTCGCGCAA ACCACCCTGC CCGAGGTGAC CGTCACCGCG
CCGAGCCCGA TCGTGCGCCG CCACACCACC CCGCCGGCCC GCCCCGCGAC CCGCGTGGCC
GCGCCCGCGC GGCAGCGCGG AGCCGCTCCG GCCGAAACGC AGCCCGTCGT CGCCCAGCCG
GCTCCGGCGC TGCCCGGTAC GCTGCCGATC GTCACCGACC AGTTCGCCAC CGTCACCGTG
GTGCCGAACG AGGAGATCCG GCGCAATGGC GGCGGCACGC TCGGCGATCT GCTGAACAAC
AAGCCGGGCA TCACCGGCTC GAGCTACGCA CCGGGCGGCG CCAGCCGGCC GATCATCCGC
GGTCTCGACG TCAATCGCGT CAACATCATC GAGAACGGCA TCGGCAGCAA CGGCGCCTCC
GATCTCGGCG AAGACCATTT CGTGCCGATC GACCCGCTCG CGACCAACCA GGTCGAGGTG
ATCCGCGGCC CGGCGACGCT GCGCTACGGC TCGACCGCGA TCGGCGGCGT GGTCAGCGCC
ACCAACAACC GGATTCCCGA CGCGTTGCCG CCCTGCGCGC AACCGTTCCA GAGCTACGGC
CTGCCGGTGA ACGCGCCCGC GGCGCTCGGC GGCTCGGCCG GCTGCATGAA CGCCGAGGTC
CGCAGCGCCG TGAGTTCGGT CGATCGCGGC GTCGAAGGCG CCGTGCTGCT GGACGCCGCC
GGCAACAATG TCGCGGTCCA TGCCGACGTC TACGGCCGCA ACACCCGCGA CTACAACGTA
CCGAGCTACC CGTATGCCGA TGCCGGCATT CCGTTCAACG GGCGCCAGAC CAACTCGGCC
TCGCAGGCGA GCGGGGCGTC GATCGGCGGC TCGTATCTGT TCCACGGCGG CTTCATCGGT
GCGTCGGTCA CGCAGAACAA CTCGATCTAC CACATCCCCG GCCCCGAGGG AGTGGAACTG
GGCACAAAGA TCGACGCCAA GCAGACCAAG TTCAACGCCA AGGGCGAGTA TCGTCCCGAC
GCCGCCGCGA TCGACGCGAT CCGGTTCTGG GTCGGCGCCA CCGACTACAA GCACAACGAG
ATCGGCCTCG CCGATGCCGC CGACCCGACC AGCGGCGGTG TGCGTCAGAC CTTCACCAAC
CGCGAGCAGG AAGGCCGGCT CGAAGTTCAG CTGACGCCGT TCAACGCCGG CTTCGCGGCG
GTGACCACGG CGGTCGGCGT CCAGGCCAGC CATCAGGAAC TGACTGCGCC CAGCCCCGAC
GATCCGACCA GCCCGCTGAA CGGACTGTTC GATCCCAACA AGAACACCAA GGTCGCCGGC
TACGTCTTCA ACGAACTGCA GTTCACCAAT ACCACCAAGG CGCAGGTCGC CGGCCGGATC
GAGCACGTCG AACTGTCGGG ATCATCACCC TCCTCGGTGC CGGAGATCTT CGACCTCAAC
ACCGATCCCA ATGCGATCGG CGCCGCCACC TCGCGCAACC TGTCCTTCAC GCCGAAGAGC
TTCAGCCTCG GCCTGATCCA GGCGCTGCCA TGGGGCCTGT CGGCCAGCAT CACCGGGCAA
TATGTCGAGC GCGCCCCGAA GCCCGCCGAA TTGTTCTCGC GCGGCGGCCA CGACGCCACC
GCGACCTTCG ACATCGGCAA TCCCAATCTG AAGATGGAGA CGGCGAAGTC GGTCGAAGTC
GGCCTGCGCC GGGCGGACGG CCCGTTCCGG TTCGAGATCA CCGGCTACTA CACCCAGTTC
AGCGGCTTCA TCTATCGCCG GCTGACCGGC AACACGTGCG AGGACGGCGC GTGTATCGTC
GGCACTGGCC TCGAACTGAA CCAGGCGATC TATTCGCAGC GCGACGCCAC CTTCAAGGGC
GGTGAATTCC AGAGCCAGCT CGACGTCGCG CAGTTCTACG GCGGCACCTG GGGCATCGAG
AACCAGGTCG ACGTGGTACG CGCCACCTTC GCCGACGGCA CCAACGTGCC GCGGATTCCC
CCGGTGCGCC TCGGCGGCGG CCTGTTCTGG CGCGACGCCA ACTGGCTGAT GCGGGTCAAC
CTGCTGCACG CCTTCGCGCA GAACAACGTC GCCGACATCG CCGAGACGAC GACGCCCGGC
TACAATCTGC TGAAGGCCGA GATCAGCTAC CGCACCAAGC TCAACCCCAA CGTCTGGGGC
GCACAGGAAA TGCTGGTCGG CCTGGTCGGC AACAATCTGC TCAACGAGGA CATCCGCAAC
TCGGTGTCCT ACAGCAAGGA CAACGTGCTG ATGCCCGGTA TCGGCGTGCG CGCGTTCGCG
AATCTGAAGT TCTGA
 
Protein sequence
MSLPFKRALL LGSAAVVAFE SLPAPQAFAQ TTLPEVTVTA PSPIVRRHTT PPARPATRVA 
APARQRGAAP AETQPVVAQP APALPGTLPI VTDQFATVTV VPNEEIRRNG GGTLGDLLNN
KPGITGSSYA PGGASRPIIR GLDVNRVNII ENGIGSNGAS DLGEDHFVPI DPLATNQVEV
IRGPATLRYG STAIGGVVSA TNNRIPDALP PCAQPFQSYG LPVNAPAALG GSAGCMNAEV
RSAVSSVDRG VEGAVLLDAA GNNVAVHADV YGRNTRDYNV PSYPYADAGI PFNGRQTNSA
SQASGASIGG SYLFHGGFIG ASVTQNNSIY HIPGPEGVEL GTKIDAKQTK FNAKGEYRPD
AAAIDAIRFW VGATDYKHNE IGLADAADPT SGGVRQTFTN REQEGRLEVQ LTPFNAGFAA
VTTAVGVQAS HQELTAPSPD DPTSPLNGLF DPNKNTKVAG YVFNELQFTN TTKAQVAGRI
EHVELSGSSP SSVPEIFDLN TDPNAIGAAT SRNLSFTPKS FSLGLIQALP WGLSASITGQ
YVERAPKPAE LFSRGGHDAT ATFDIGNPNL KMETAKSVEV GLRRADGPFR FEITGYYTQF
SGFIYRRLTG NTCEDGACIV GTGLELNQAI YSQRDATFKG GEFQSQLDVA QFYGGTWGIE
NQVDVVRATF ADGTNVPRIP PVRLGGGLFW RDANWLMRVN LLHAFAQNNV ADIAETTTPG
YNLLKAEISY RTKLNPNVWG AQEMLVGLVG NNLLNEDIRN SVSYSKDNVL MPGIGVRAFA
NLKF