Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4526 |
Symbol | |
ID | 3912343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5114862 |
End bp | 5116484 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637886430 |
Product | hypothetical protein |
Protein accession | YP_488120 |
Protein GI | 86751624 |
COG category | [S] Function unknown |
COG ID | [COG2845] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACA AGCCGAAATC CCTGCTGACG GCCCTGACCC GACGCGGCCC GCTGCTGGCG ATCATGGCGC TGTTGCTGGT CGGCGTCGCC GGGCCGGCCT CGGCGCAGTT TTTCGGCTTC GGCGGGCCGT CGCAACCGCC GCCGCGCCCA CAGCGCGGCA TCGGCAACGG CGGCGGCGGT TACAACGGCG GCGGCGGCGG TTTCTTCGGC AGCGACGTGT TCGCGCCGTT CCAGCATCAG GCGCCCAAGC GCGCCCCGGT GCGCGAGGAT TATTCCCGCG CACCCGCGCC GGACAAGCGC GACGCGGCTT TGACGCCGGA GCGCAACGTC GTGGTGCTGG GCGACGCGAT GGCCGACTGG CTCGCTTACG GCCTTGAGCA GGCCTATGCC GAGCAGCCCG ACATGGGGGT GATCCGCAAG CACAAGACCG TCTCCGGCCT GCTGCGCTAC CAGCCCAAGG GCGAGCCGTC CGACTGGATC GCCGCCGCCC GGGACATCCT CGCCGCCGAG AACCCGGACG CGATCGTGGT GATGCTCGGC CTCAGCGACC GCGTCCCGAT CACCGAACCC GTGGCGGAGA AGGACAAGAA GAAGGACGGC AAGGGCAAGC CCGAAGACGC CGACGCCAAG CCTAATGCGA AGCCGGACGA CAAGACCGCC GACGGCGCCG CCGACGATGA CGACGAGGAC GACGACACGC CCCAGATCAT GACGCCGGAG AAGGGCAAAC GCTCCGGCGT CGCCCAGTTC CGCGACGATC GCTGGATCGA GCTCTACACC AAGAAGCTCG AGGACATGAT CGCCGTGCTC AAGACCAAGG GCGTGCCGGT GCTGTGGGTC GGCCTGCCGG CGGTGCGCGG CACCAAATCG ACCTCGGACG CGCAGTTCCT CAACGCGCTG TATCGCGACG CCGCCGCCAA GGCGGGAATC ACCTATGTCG ACGTCTGGGA CGGCTTCGTC GACGAGGCCG GGCGCTATGT GCTGCAGGGC CCCGACTTCG AAGGCCAGAC CCGCCGGCTG CGCGCCTATG ACGGGGTGTA TTTCACCAAA TCCGGCGCCC GCAAGCTGGC GCATTATGTC GAGCGCGAGA TCGCCCGCCT GCTCGCCGCG CGATCCGGGC CGATCGCCCT GCCGACCGAT CCCGGCACGC CGGACGCGAG CGCAAAGCCG GACGGCCCGG CGCCGCGGCC GATCGCCGGC CCGATCATGC CGCTGGTGGC GTCCTCGGTC TCGACCGAGC GCCTGCTGGG CGGCCCCGGC GTCGCGCCGG CGCCGGTCGA TGCGCTGGTG GCGCGCACGC TGGTGAAGGG CGAGCCGCTC GCCGCCCCGG CCGGTCGCGC CGACGACTAC GCCTGGCCGC GCCGCGAGAT CGTGGTCGAG CGCGCGCAGG AGCCGCCGCC GCCGAAGAGC GCGGCGCCGG TGGCGAGCAA CGCACCGGGC AGCGCCGCGC CGGGTGCGAG CGGCCAGCCG CAACCGCAGA AGCGCATCGC CCGCGCCGCG CCGCCACCGC CGCCCGCCGC CTCCGGCTTC TTCGGCTTCG CGCCGGCGCA GCCCCAGCCG CAGATGCGCC GCGCGCCCCC GCCGCCGCCG CCCGCGTCGG GATTCTTCTC GATCTTCCGC TGA
|
Protein sequence | MSDKPKSLLT ALTRRGPLLA IMALLLVGVA GPASAQFFGF GGPSQPPPRP QRGIGNGGGG YNGGGGGFFG SDVFAPFQHQ APKRAPVRED YSRAPAPDKR DAALTPERNV VVLGDAMADW LAYGLEQAYA EQPDMGVIRK HKTVSGLLRY QPKGEPSDWI AAARDILAAE NPDAIVVMLG LSDRVPITEP VAEKDKKKDG KGKPEDADAK PNAKPDDKTA DGAADDDDED DDTPQIMTPE KGKRSGVAQF RDDRWIELYT KKLEDMIAVL KTKGVPVLWV GLPAVRGTKS TSDAQFLNAL YRDAAAKAGI TYVDVWDGFV DEAGRYVLQG PDFEGQTRRL RAYDGVYFTK SGARKLAHYV EREIARLLAA RSGPIALPTD PGTPDASAKP DGPAPRPIAG PIMPLVASSV STERLLGGPG VAPAPVDALV ARTLVKGEPL AAPAGRADDY AWPRREIVVE RAQEPPPPKS AAPVASNAPG SAAPGASGQP QPQKRIARAA PPPPPAASGF FGFAPAQPQP QMRRAPPPPP PASGFFSIFR
|
| |