Gene RPB_4526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4526 
Symbol 
ID3912343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5114862 
End bp5116484 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content73% 
IMG OID637886430 
Producthypothetical protein 
Protein accessionYP_488120 
Protein GI86751624 
COG category[S] Function unknown 
COG ID[COG2845] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACA AGCCGAAATC CCTGCTGACG GCCCTGACCC GACGCGGCCC GCTGCTGGCG 
ATCATGGCGC TGTTGCTGGT CGGCGTCGCC GGGCCGGCCT CGGCGCAGTT TTTCGGCTTC
GGCGGGCCGT CGCAACCGCC GCCGCGCCCA CAGCGCGGCA TCGGCAACGG CGGCGGCGGT
TACAACGGCG GCGGCGGCGG TTTCTTCGGC AGCGACGTGT TCGCGCCGTT CCAGCATCAG
GCGCCCAAGC GCGCCCCGGT GCGCGAGGAT TATTCCCGCG CACCCGCGCC GGACAAGCGC
GACGCGGCTT TGACGCCGGA GCGCAACGTC GTGGTGCTGG GCGACGCGAT GGCCGACTGG
CTCGCTTACG GCCTTGAGCA GGCCTATGCC GAGCAGCCCG ACATGGGGGT GATCCGCAAG
CACAAGACCG TCTCCGGCCT GCTGCGCTAC CAGCCCAAGG GCGAGCCGTC CGACTGGATC
GCCGCCGCCC GGGACATCCT CGCCGCCGAG AACCCGGACG CGATCGTGGT GATGCTCGGC
CTCAGCGACC GCGTCCCGAT CACCGAACCC GTGGCGGAGA AGGACAAGAA GAAGGACGGC
AAGGGCAAGC CCGAAGACGC CGACGCCAAG CCTAATGCGA AGCCGGACGA CAAGACCGCC
GACGGCGCCG CCGACGATGA CGACGAGGAC GACGACACGC CCCAGATCAT GACGCCGGAG
AAGGGCAAAC GCTCCGGCGT CGCCCAGTTC CGCGACGATC GCTGGATCGA GCTCTACACC
AAGAAGCTCG AGGACATGAT CGCCGTGCTC AAGACCAAGG GCGTGCCGGT GCTGTGGGTC
GGCCTGCCGG CGGTGCGCGG CACCAAATCG ACCTCGGACG CGCAGTTCCT CAACGCGCTG
TATCGCGACG CCGCCGCCAA GGCGGGAATC ACCTATGTCG ACGTCTGGGA CGGCTTCGTC
GACGAGGCCG GGCGCTATGT GCTGCAGGGC CCCGACTTCG AAGGCCAGAC CCGCCGGCTG
CGCGCCTATG ACGGGGTGTA TTTCACCAAA TCCGGCGCCC GCAAGCTGGC GCATTATGTC
GAGCGCGAGA TCGCCCGCCT GCTCGCCGCG CGATCCGGGC CGATCGCCCT GCCGACCGAT
CCCGGCACGC CGGACGCGAG CGCAAAGCCG GACGGCCCGG CGCCGCGGCC GATCGCCGGC
CCGATCATGC CGCTGGTGGC GTCCTCGGTC TCGACCGAGC GCCTGCTGGG CGGCCCCGGC
GTCGCGCCGG CGCCGGTCGA TGCGCTGGTG GCGCGCACGC TGGTGAAGGG CGAGCCGCTC
GCCGCCCCGG CCGGTCGCGC CGACGACTAC GCCTGGCCGC GCCGCGAGAT CGTGGTCGAG
CGCGCGCAGG AGCCGCCGCC GCCGAAGAGC GCGGCGCCGG TGGCGAGCAA CGCACCGGGC
AGCGCCGCGC CGGGTGCGAG CGGCCAGCCG CAACCGCAGA AGCGCATCGC CCGCGCCGCG
CCGCCACCGC CGCCCGCCGC CTCCGGCTTC TTCGGCTTCG CGCCGGCGCA GCCCCAGCCG
CAGATGCGCC GCGCGCCCCC GCCGCCGCCG CCCGCGTCGG GATTCTTCTC GATCTTCCGC
TGA
 
Protein sequence
MSDKPKSLLT ALTRRGPLLA IMALLLVGVA GPASAQFFGF GGPSQPPPRP QRGIGNGGGG 
YNGGGGGFFG SDVFAPFQHQ APKRAPVRED YSRAPAPDKR DAALTPERNV VVLGDAMADW
LAYGLEQAYA EQPDMGVIRK HKTVSGLLRY QPKGEPSDWI AAARDILAAE NPDAIVVMLG
LSDRVPITEP VAEKDKKKDG KGKPEDADAK PNAKPDDKTA DGAADDDDED DDTPQIMTPE
KGKRSGVAQF RDDRWIELYT KKLEDMIAVL KTKGVPVLWV GLPAVRGTKS TSDAQFLNAL
YRDAAAKAGI TYVDVWDGFV DEAGRYVLQG PDFEGQTRRL RAYDGVYFTK SGARKLAHYV
EREIARLLAA RSGPIALPTD PGTPDASAKP DGPAPRPIAG PIMPLVASSV STERLLGGPG
VAPAPVDALV ARTLVKGEPL AAPAGRADDY AWPRREIVVE RAQEPPPPKS AAPVASNAPG
SAAPGASGQP QPQKRIARAA PPPPPAASGF FGFAPAQPQP QMRRAPPPPP PASGFFSIFR