Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2041 |
Symbol | |
ID | 3909856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2320339 |
End bp | 2321835 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883934 |
Product | peptidase S1C, Do |
Protein accession | YP_485659 |
Protein GI | 86749163 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTTG CGATCACCGC CTTGAGCTTC CGGTTCAAGC CTCTGCTGAC GGCTTTGTGC CTCGGCGCGG CTGCCGCCTT GACCGCGGCG CCGGCGCAGG CCCGCGGCCC CGAGGGCATC GCGGACGTCG CCGAGAAGGT CATCGACGCC GTGGTCAACA TCTCGACCAG CCAGACCGTC GAGGCCAAGA GTGCGCCCAG CGAGGGCAAC AGCGCCAAGC CGAATCTGCC GCCGGGGTCG CCCTTCGAGG AGTTTTTCGA GGACTTCTTC AAGAACCGCC GCGGCGAGAA GGGCGGTGGC GGCCCGCGCA AGACCAACTC GCTCGGCTCG GGCTTCATCG TCGACACCGC CGGCATTGCC GTGACCAACA ATCACGTCAT CGCCGACGCC GACGAGATCA ACCTGATCAT GAACGACGGC ACCAAAATCA AGGCGGAGCT GGTCGGCGTC GACAAGAAGA CCGATCTGGC GGTGCTGAAG TTCAAGCCGC CGGCGAACAA GCCGCTGGTG GCGGTGAAGT TCGGCGACAG TGACAAGCTG CGGCTCGGCG AATGGGTGGT GGCGATCGGC AACCCGTTCT CGCTCGGCGG CACGGTCACC GCCGGCATCG TCTCGGCGCG CAACCGCGAC ATCAATTCGG GGCCGTATGA CAGCTACATC CAGACCGACG CCGCGATCAA TCGCGGCAAT TCCGGCGGCC CGCTGTTCAA CCTCGACGGC GAAGTCATCG GCGTCAACAC GCTGATCATC TCGCCGTCCG GCGGCTCGAT CGGCATCGGA TTCGCGGTGC CTTCGAAGAC CGTGGTCGGG GTGGTCGATC AGCTCCGCCA GTTCGGCGAG CTGCGCCGCG GCTGGCTCGG CGTGCGGATC CAGCAGGTCA CCGACGAGAT CGCCGAAAGC CTCAACATCA AGCCGGCGCG CGGCGCGCTG GTCGCCGGCA TCGACGACAA GGGCCCGGCC AAGCCCGCCG GCATCGAGCC CGGCGACGTC GTCGTCAAGT TCGACGGCAA GGACGTCAAG GAGCCGAAGG ATCTGTCGCG CGTGGTCGCC GACACGGCGG TCGGCAAGAC CGTCGACGTG GTGATCATCC GCAAGGGCAA GGAAGAGACC AAGCAGGTCA CGCTCGGCCG CCTCGACGAC GGCGCCAAGC CGCAGCCGGC CTCCGCGAAG TCGCAGCCGG AGCCGGAAAA GCCGGTGACA CAGAAGGCGC TCGGGCTCGA CCTCGCCGCG CTGTCGAAGG ACCTGCGCGG CAAGTACAAG ATCAAGGACA GCGTCAAGGG CGTCGTCGTG GTCGGCGTCG ACACCGGCTC CGATGCCGCC GAGAAGCGGC TGTCGGCCGG CGACGTGATC GTCGAAGTGG CGCAGGAAGC GGTCACCAGC GCCGCCGATA TCAAGAAGCG GATCGATCAG GTCAAGAAGG ACGGCAAGAA GTCGGTGCTG CTGCTGGTTT CGAACGGAGC CGGCGAACTG CGCTTCGTGG CGCTCAGCCT GCAATAG
|
Protein sequence | MPVAITALSF RFKPLLTALC LGAAAALTAA PAQARGPEGI ADVAEKVIDA VVNISTSQTV EAKSAPSEGN SAKPNLPPGS PFEEFFEDFF KNRRGEKGGG GPRKTNSLGS GFIVDTAGIA VTNNHVIADA DEINLIMNDG TKIKAELVGV DKKTDLAVLK FKPPANKPLV AVKFGDSDKL RLGEWVVAIG NPFSLGGTVT AGIVSARNRD INSGPYDSYI QTDAAINRGN SGGPLFNLDG EVIGVNTLII SPSGGSIGIG FAVPSKTVVG VVDQLRQFGE LRRGWLGVRI QQVTDEIAES LNIKPARGAL VAGIDDKGPA KPAGIEPGDV VVKFDGKDVK EPKDLSRVVA DTAVGKTVDV VIIRKGKEET KQVTLGRLDD GAKPQPASAK SQPEPEKPVT QKALGLDLAA LSKDLRGKYK IKDSVKGVVV VGVDTGSDAA EKRLSAGDVI VEVAQEAVTS AADIKKRIDQ VKKDGKKSVL LLVSNGAGEL RFVALSLQ
|
| |