Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3569 |
Symbol | |
ID | 3837025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 4114465 |
End bp | 4115454 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637827693 |
Product | O-sialoglycoprotein endopeptidase |
Protein accession | YP_428650 |
Protein GI | 83594898 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAAG CACTCCTCTC CCAGGTTGAG GAGCACCGCC CCTATGGCGG GGTCGTGCCC GAAATCGCCG CGCGCTCACA CCTTGACCAC GTGGACTCCC TGGTGATCCG CGCCCTTGGC GAGGCCGGGC TGACGGTTCA CGACATCGAC GCCGTCGCCG CCACCGGCGG ACCCGGGCTG ATCGGCGGGG TGATCGTCGG CGTGATGACG GCCAAGGCCA TCGCCCAGGT GGCGGGCAAG CCGTTCATCG CCGTCAACCA TCTGGAAGGC CACGCCCTGA CCGTGCGGAT GACCGCCGGC ATCGATTTCC CCTATCTGCT GCTTCTGGCC TCGGGCGGGC ATTGCCAGCT TCTGGCGGTC GAGGGGGTGG GGCGGGCCAA GCGCCTGGGC ACCACCATCG ACGACGCCGC CGGCGAGGCC TTCGACAAGG TGGCCAAGAT GCTGGGCCTG GGTTATCCGG GCGGACCGGC GGTGGAGCGG GCGGCCCGGC GCGGCGATCC CCGGCGCTTT CGCCTGCCGC GCCCCCTGCT CGACCGCCCG GGCTGCGACC TGTCTTTCTC GGGGCTGAAG ACCGCCGTGC GCCAGACCGT GGAAAAGCTG CCGCCCGGGC CGTTGAGCGA GGGCGATATC GCCGATCTCT GCGCCAGTTT CCAGGCCGCC GTCGCCGACT GTCTGGCCGA CCGCTGCCGG GTGGCCGCCG GGATCTTCAG CGCCCGCCAT GGCCGTGGCC GGCCGCTGGT GGTGGCCGGC GGCGTGGCGG CCAACGCCAG CTTGCGCGCC GCCCTGACCG AGGTCGCCCG CCAAGCCGAT ATGACCTTCG TCGCCCCGCC CTTGGCGCTG TGCACCGACA ACGCGGCGAT GATCGCCTGG GTCGGCGTCG AGCGCCTGCG CCTGGGGCTG GTCGACACCA TGGACTTCAA ACCCCGCCCG CGCTGGCCGC TCGACCCCGA CGCGCCCAAG GCGGCCGGAG CCGGAGGCGT AAAAGCTTAA
|
Protein sequence | MAEALLSQVE EHRPYGGVVP EIAARSHLDH VDSLVIRALG EAGLTVHDID AVAATGGPGL IGGVIVGVMT AKAIAQVAGK PFIAVNHLEG HALTVRMTAG IDFPYLLLLA SGGHCQLLAV EGVGRAKRLG TTIDDAAGEA FDKVAKMLGL GYPGGPAVER AARRGDPRRF RLPRPLLDRP GCDLSFSGLK TAVRQTVEKL PPGPLSEGDI ADLCASFQAA VADCLADRCR VAAGIFSARH GRGRPLVVAG GVAANASLRA ALTEVARQAD MTFVAPPLAL CTDNAAMIAW VGVERLRLGL VDTMDFKPRP RWPLDPDAPK AAGAGGVKA
|
| |