Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_1724 |
Symbol | |
ID | 3972377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 1870473 |
End bp | 1871480 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637924837 |
Product | peptidase S41 |
Protein accession | YP_531602 |
Protein GI | 90423232 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0570735 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATGC TGGACCGTCG CTCGTCATTC CTGATCGCCT GCTTCGCGAC AGTTTCCACG CTTGTGGCGG CGCCGTCGAT GGCCGTTGCC GAGGTTGTTT CCGGTCCGAC GCAAGCCATG TCCTTGGATC AGACAACGGT CAGGAAGGTG ATTAGCGGCA TCGCTGTGGA GCTGCGCGAA GGTTATGTGT TCGCGGATAA GGGGGTGCAG GCTGCGGATG CGTTAGAGAA AGCATTGGCA GGGAACGCCT ATGCCGGCTT GACCGATAAA GCCCAGTTCG CTCTGCGACT GACTGAACAA CTCCGGGCGA TCACCAAGGA CAGTCATATG AGGGTGATCT TTGGCTCTCC GTTCCGCAAC CAGCCTCCGC AGGCCGCGCC TCAGGGTGCC GGTTTTGAGG TGAAACGATT GGACGGCAAT ATCGGATACA TCCACTTGTC GCGGTTCGTG CCCCCCGAAA TCTTCAATCC GGCGGCCAAC GACGCCATGC GCAGCGTCTC CGATACCGAC GCGCTTCTCA TCGACATCCG GGACAATGGC GGCGGACACC CGGCATCTGT CGCGTACTTT GTCAGTTTCT TCCTCGATCC GGACAAGCGC GTTCATATCA ACGACCTCAT CTGGCGCAAT CGTGGCACAT CGACTTTCAG AACGGAGTCG TTTTGGAGTT CGCCGACGCC GGTATCCTAT CTTGGCAAGC CCGTCTATGT GCTGGTGGGC TCAAAGACCT ATTCGGCGGG CGAGGAGCTT GCCTACGATT TGCAGGTGCT CAAGCGCGCG ACCGTGGTCG GCGAACAAAC CCGCGGGGGA GCGAACCCGG GCGGTCTGAC TGACCTCGGT TCCGACATCT TCGTTGTCGT GCCGACGGGC AGGGCTGAAA ACCCGACCAC GGGCAGCAAT TGGGGGGGCG TCGGCGTGCG TCCTGACGTT CAGGCCAAGC TTGAGACCAC ACAGGAAACT GCCGTGGCTC TGGCGAAGGG GCAGCCAGCA GCGCCGTTGG CGAGATAG
|
Protein sequence | MQMLDRRSSF LIACFATVST LVAAPSMAVA EVVSGPTQAM SLDQTTVRKV ISGIAVELRE GYVFADKGVQ AADALEKALA GNAYAGLTDK AQFALRLTEQ LRAITKDSHM RVIFGSPFRN QPPQAAPQGA GFEVKRLDGN IGYIHLSRFV PPEIFNPAAN DAMRSVSDTD ALLIDIRDNG GGHPASVAYF VSFFLDPDKR VHINDLIWRN RGTSTFRTES FWSSPTPVSY LGKPVYVLVG SKTYSAGEEL AYDLQVLKRA TVVGEQTRGG ANPGGLTDLG SDIFVVVPTG RAENPTTGSN WGGVGVRPDV QAKLETTQET AVALAKGQPA APLAR
|
| |