Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2201 |
Symbol | |
ID | 3835628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 2555293 |
End bp | 2556819 |
Gene Length | 1527 bp |
Protein Length | 508 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637826303 |
Product | peptidase S1C, Do |
Protein accession | YP_427288 |
Protein GI | 83593536 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTCAAT CCGTGCTTGA TCGCACCGCT TCCAACGGAG CCGGGTCCGC GGGCCGCCTG CGCCGCGCCC TGGCGCCGGC CCTGGTCGCC TTCTCCCTGG CCATCGCGCC GTTCTCCGCC CAGGCCCGCG AGATCCCCGA AAGCTTCGCC GATCTCGCCG AGGGTTTGTT GCCCGCGGTG GTCAACATCT CCACCACCCA GACCATCGAT GCCGATCGTG GTCCGGAAAT GCCCCAGTTC CCGCCGGGGT CGCCCTTCGA GGAGTTCTTC AAGGATTTCT TCGAACGCCA TGGCGGGCCG GGCGGCGGCG TCCAGCCCAA GACCCCGCGC CGGGCGACCT CGCTGGGCTC GGGCTTCATC GTCGACGCCG CGGGCTATAT CGTCACCAAC AACCACGTCA TCCAGGATGC CGACGAGATC ACGGTGATCT TGCATGACGA TACGGCGATC AAGGCCGAGC TGGTCGGCAA GGACGAAAAG ACCGACGTCG CCCTGCTGCG CATCAAGACC GACAAGCCGC TGACCGCCGT GCCCTGGGGC AATTCGGAAG CCGCCCGGGT GGGTGATTGG GTGATGGCGA TCGGCAATCC CTTCGGCCTC GGCGGCACGG TGACCGCCGG CATCATCAGC GCCAAGACCC GCGATATCAA CGCCGGACCC TATGACAGCT TCATCCAGAC CGATGCGGCG ATCAATAAGG GCAATTCCGG TGGGCCGCTG TTCAACATGC ATGGCGAGGT GATCGGCATC AATACGGCGA TCTTCAGCCC CTCGGGCGGG TCGATCGGCA TCGGTTTCTC GGTGCCCTCC AACCTTGCCC ATCAGGTGAT CGACGATATC AAGAAGTTCG GCCGTACCCG TCGCGGCTGG ATTGGCGTGC GCATCCAGTC GGTGACCGAC GAGATCGCCG AGGGCCTGGG TCTTGAAAAA TCCGCCGGCG CCCTGATCGC CGCCGTTACC CCCGGTGGGC CTGCGGCCGC CGCCGGTCTC AAGGTCGGCG ATGTCATCGT GTCGTTCGAT GGCCGTCCGG TCCCCGATAT GCGGACCCTG CCGCGGATCG TCGCCGAAAC CGAAATCGGC AAGGACGCCG CCATCGGTGT CTGGCGCGAG GGCAAGCGCC AGGACCTGAA GATGAAGGTC GGTGAACTGG AGGTGGCCGA GGACGAGGGG TTGCTGAACG AACCCGAAAC CAGCGGCGTC ACCGATTCCC AGGGCGGCAC CGCCGTCGCC ACCATCGGCC TGACGGTGAC CAAGCTCGAT GACCGCCTGC GCAGCCAGTT CGGCTTCGAT GCCGCCTCGG AAGGCGTGGT GGTGACCGAT GTCACCAATG AAAGCGACGC CCAGGAAAAG GGTCTGGAGC CCGGGACGCT GATCGTCAAG ATCAACCAGA CCGAGGTCGC CTCGCCCGAA GACGTGGTCA AGGCGGTGGC CAAGGCCAAG GACGAGGGAC GCAAGACCGT TCTTCTGCTG GTCGAGCTGC GCGGAACCCG GACCTTCATT CCGGTCAAAA TGGCCGACAA GAAGTAG
|
Protein sequence | MFQSVLDRTA SNGAGSAGRL RRALAPALVA FSLAIAPFSA QAREIPESFA DLAEGLLPAV VNISTTQTID ADRGPEMPQF PPGSPFEEFF KDFFERHGGP GGGVQPKTPR RATSLGSGFI VDAAGYIVTN NHVIQDADEI TVILHDDTAI KAELVGKDEK TDVALLRIKT DKPLTAVPWG NSEAARVGDW VMAIGNPFGL GGTVTAGIIS AKTRDINAGP YDSFIQTDAA INKGNSGGPL FNMHGEVIGI NTAIFSPSGG SIGIGFSVPS NLAHQVIDDI KKFGRTRRGW IGVRIQSVTD EIAEGLGLEK SAGALIAAVT PGGPAAAAGL KVGDVIVSFD GRPVPDMRTL PRIVAETEIG KDAAIGVWRE GKRQDLKMKV GELEVAEDEG LLNEPETSGV TDSQGGTAVA TIGLTVTKLD DRLRSQFGFD AASEGVVVTD VTNESDAQEK GLEPGTLIVK INQTEVASPE DVVKAVAKAK DEGRKTVLLL VELRGTRTFI PVKMADKK
|
| |