Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2749 |
Symbol | |
ID | 3910542 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3132596 |
End bp | 3134113 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637884649 |
Product | proline-rich region |
Protein accession | YP_486362 |
Protein GI | 86749866 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.174945 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.312279 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC GATATCAGGA CCGACCCTAC CTGGCGGAAG GCGAGGCCCA CGCAGCTTAC GCCAAGCGCA ATCCCGAGAA TGACCCGCTC GCTGAACTCG CACGATTGAT CGGTCAGACC GATCCGTTCG GGCAGGAAGC ACCGCCATCG GGCCGGCCGT CGTCGCGTTC GACCGACTTT CGTCTGAGCA GCCCGCCGCC GATCGAAGAC GAGGTGCCGC CGCCGACCCC CTCATGGTTG CAGCATCGTC GTGCTGCCGA TCCAGCACCG TTGTCGCCGC CCGAACCCGA GCCTGACTTC AGCAGGCCGC CGTCCTTCGT CACAGCCGCA TCGCCCCGGC TGGCCGACCC CGTCTATGAT CCAGCGCCGT TCGATCGGCA ATCGTTCGAT CAGGTCGCCG AAGAGCCGAA TAATTACGAC CCTCACTATG CTGTGCAGCA GCCGTTGCCG TTGGAGCCGC CGCAATTCGT ACCCGGCCGC TACGACGATG CGCTGTATGG CCAGCTCGAT CCCGCCGACG TTCGTCCGGA TCCGAACTAT CCCGAGGCGC CTTATGGCTA CGATGACGGA TATGCCGACG AGCCGGACAG TCGGGCGTAC AAGCCCCGTC GCAACAACAT GATGACTGTG GCCGCTGTGT TGGCACTCGC CGTTGTCGGC ACCGGCGGCG CGTTCGCCTA TCGCAGCTTC ACCAGCGGCC CCCGCACCGG CGAACCGCCG GTGATCAAGG CCGACGCCAG CCCGACCAAG GTGATGGCTG CGCCGTCGGC CTCCGCAGAT GCTGCAGGCA AACCGATTCA GGACCGGCTC GCCGCCGGCA ACAACATCGA AGCACTGGTG TCCCGCGAAG AGCAGCCCGC CGACCCGTCG CGTGCTGGGC AGGGTACGCG CGTGGTGCTG CCGCAACTCA ACCAGAATCC GAATCCGCCC GCGGTGTCCG CCGTTGCGCC CGGGCCGAAG CCCAACCTTC CGCCGCCCAA CAACGGCACG ATTGCCGGCG AGGAGCCTCG GCGGATCAAG ACCTTCAGCA TCCGTCCCGA CCAGGGCGAT CCGGGCGCGG CGCCGGTCAA TGTGGCTGCT CCTGCGACCC GGCAGGCGTC CCGCGCGCCG GCTCCGACGG CGCCTGCGCA GCGCCCGGCA GCGCGCCAGC TTGAAGATGC CAATGCTTCC GCGGGCAATA CGCCGCTGTC GCTGGCGCCG AATTCAGGCG GGTCGCCAGC TGCCAACCAG CGCGTCGCAG CGCTGCCGCC CACGGAATCG GCCGGCGCGG GAGGCTATGT GGTGCAGGTG TCGTCGCAGC GCAGCGAGGC CGACGCGAAA TCGTCGTATC GGACGCTGCA GGGTAAGTTC CCGTCCGTGC TCGGCCAGCG CGCGCCGTTG ATCAAGCGCG CCGATCTCGG CAGCAAGGGC GTGTATTATC GCGCCATGGT CGGCCCATTC GGCAGTTCCG AAGAGGCGTC GAGACTCTGC GGCAACCTGA AAAGTGCCGG CGGACAGTGC GTCGTCCAGA GGAATTAA
|
Protein sequence | MTDRYQDRPY LAEGEAHAAY AKRNPENDPL AELARLIGQT DPFGQEAPPS GRPSSRSTDF RLSSPPPIED EVPPPTPSWL QHRRAADPAP LSPPEPEPDF SRPPSFVTAA SPRLADPVYD PAPFDRQSFD QVAEEPNNYD PHYAVQQPLP LEPPQFVPGR YDDALYGQLD PADVRPDPNY PEAPYGYDDG YADEPDSRAY KPRRNNMMTV AAVLALAVVG TGGAFAYRSF TSGPRTGEPP VIKADASPTK VMAAPSASAD AAGKPIQDRL AAGNNIEALV SREEQPADPS RAGQGTRVVL PQLNQNPNPP AVSAVAPGPK PNLPPPNNGT IAGEEPRRIK TFSIRPDQGD PGAAPVNVAA PATRQASRAP APTAPAQRPA ARQLEDANAS AGNTPLSLAP NSGGSPAANQ RVAALPPTES AGAGGYVVQV SSQRSEADAK SSYRTLQGKF PSVLGQRAPL IKRADLGSKG VYYRAMVGPF GSSEEASRLC GNLKSAGGQC VVQRN
|
| |