Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4246 |
Symbol | |
ID | 3912059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4829175 |
End bp | 4830803 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637886151 |
Product | NHL repeat-containing protein |
Protein accession | YP_487845 |
Protein GI | 86751349 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.359028 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.663745 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGACCA AACTTGCGCT CGGCGCGAGC GCGATGGCGC TCGGACTGCT GGCCGTTTCC GCCGCGCGCG CGGATGATTA CCACGTCACG AAATTGGTGC AGGGCTCGGC GTTTCACGGC GTGCATGGCC TCGGCATCGA CAAGGCCGGC CGGCTGTTCG CCGGCTCGGT CGCGGGCGCA GCGCTCTACG AGGTCGATCG CGACAAGGGC ACGGCGAAGA TCGCGGTGCC GACGCCGGAA GGCATGGCCG ACGACATCGC GTTCGCGCCC GACGGCACCA TGGCGTGGAC CGCGTTCCTC ACCGGCGACC TCTATGCGCG CAAGGGCGAC GGCCCGATCA AAAAGCTCGC CTCGGGTCTG CCCGGCATCA ACTCGCTCGC CTTCCGCAAG GACGGCCGGC TGTACGCCAC GCAGGTGTTT CTCGGCGACG CGCTGTACGA GATCGACGTC GCGGGCGAAA AGCCGCCGCG CAAGATCATG GAGAAGATCG GCGGGCTGAA CGGCTTCGAA TTCGGTCCCG ACGACAAGCT CTACGGCCCA TTGTGGTTCA AGGGCCAGGT CGCCGCGGTC GATGTCGACA AGGGCGAGCT CAACGTCGTC GCCGACGGTT TCAAGATTCC GGCGGCGGCG AATTTCGACA GCAAGGGCAA TCTCTACGTG CTCGACACCG CGCTCGGCCA GCTCGTCCGG GTCGATATCA AGACCGGCAA GAAACAGCTC GCGGCGCAGC TGAAGCCGTC GCTCGACAAT CTGGCGATCG ACGCGCAGGA CCGCATCTTC GTCTCCAACA TGGCCGACAA CGGCATCCAG GAGGTCGATC CGGCGACCGG CACCGCCAGG CAGGTGATCA TCGGCAAGCT GGCATTTCCG GGCGGCATCG GCGTGGTCTC CGACGGCGGC AAGGACACGA TCTATGTGGC GGATGTTTTC GCCTATCGCA GCGTCGATGG CGCCACCGGC GAGGTCCGCG AACTGGCGCG GATGCATGCC GACGGCGTCA CGCTGGAATA TCCGATGAGC GCCACCGCCA AGGGCAACGA GGTGATGCTG TCGAGCTGGT TCACCGGCAC CGTGCAGGTG ATAGACCGCG CGAGCGGCAA GACCATCGAG ATGCTGCACG ACTTCAAGGC GCCGTATGAT GCGATCCGGC TGGCGAACGG CAAGCTGGTG GTCGCCGAAC TCGGGACCAA ATCGCTGGTC GAAGTCGGCG GCGAGCACGG CAAGGACCGA AAAGCCATCG CCACCGACCT TAACGGCCCG GTCGGCCTCG CCGCGGCGCC CGACGGCGCG GTCTACGTCA CCGAGGCGTT CGCCGGACAA GTGACCAAGA TCGATCCGGC CACCGGCGCC AAGACCGTGG TGGCGAAGGA TCTGAAGATG CCGGAGGGGT TGGCGCTGGC GCCGTCCGGC AAGCTCGTCG TGGCAGAAGT CGGCGCCAAA CGCGTGGTTG AGATCGATCC TGCGACCGGC AAGGTCACCG AACTCGCCGG CAACCTCCCG ATCGGCCTCG TCCCCGCCCC CGGCCTGCCG CCGACCAACA TGCCGACCGG AATCGGCGTC GGCGCCAGCG GCACGATCTA CGTGTCGTCG GACATCGAGA ACGCGATCTA CAAGATCGCG AAGAAGTAG
|
Protein sequence | MRTKLALGAS AMALGLLAVS AARADDYHVT KLVQGSAFHG VHGLGIDKAG RLFAGSVAGA ALYEVDRDKG TAKIAVPTPE GMADDIAFAP DGTMAWTAFL TGDLYARKGD GPIKKLASGL PGINSLAFRK DGRLYATQVF LGDALYEIDV AGEKPPRKIM EKIGGLNGFE FGPDDKLYGP LWFKGQVAAV DVDKGELNVV ADGFKIPAAA NFDSKGNLYV LDTALGQLVR VDIKTGKKQL AAQLKPSLDN LAIDAQDRIF VSNMADNGIQ EVDPATGTAR QVIIGKLAFP GGIGVVSDGG KDTIYVADVF AYRSVDGATG EVRELARMHA DGVTLEYPMS ATAKGNEVML SSWFTGTVQV IDRASGKTIE MLHDFKAPYD AIRLANGKLV VAELGTKSLV EVGGEHGKDR KAIATDLNGP VGLAAAPDGA VYVTEAFAGQ VTKIDPATGA KTVVAKDLKM PEGLALAPSG KLVVAEVGAK RVVEIDPATG KVTELAGNLP IGLVPAPGLP PTNMPTGIGV GASGTIYVSS DIENAIYKIA KK
|
| |