Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A2662 |
Symbol | |
ID | 3836097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | - |
Start bp | 3092389 |
End bp | 3093864 |
Gene Length | 1476 bp |
Protein Length | 491 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637826768 |
Product | peptidase S1C, Do |
Protein accession | YP_427746 |
Protein GI | 83593994 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAAAG CGGGGGAGGA CCGGATGGAC CAATCGGGGC GGTGGGCCAG GGGCGTTCTG GCCGGGGCGA GCGTCGCGGT CTTGCTGGCG TTCGGGGGAC CGGCGGCGGG GGCGGCCGAA CAGGTGCCAG GGGCCGGGAC GGCCGAACAG GTGCCGATGG CGGCGGCCGA GATCAAGCTG ACCTTCGCGC CGGTGGTCCG CGCCGTCGCT CCGGCGGTGG TCAATATCTT CAGCCAGCGC GTCATCACCG AGTCGCAAGT TCCGCCGATG TTCGCCGACC CGCTGTTCCG CCGCTTCCTC GAGGAGCGCG GCATGCTGGG CAAGCCGCGC GAGCGGGTTC AGCGGTCCTT GGGCTCGGGG GTGATCTTGC GCTCCGAGGG CGTGGTGGTG ACCAATGCCC ATGTGGTCAA TGGCGCGAGC GAAATCACCG TGGCCCTCAA CGATCGCCGG GAATTCCCCG CCGAACTGGT CGGGCTTGAT CCGCGGGCCG ATCTGGCGGT GCTGCGGATC AAATCCGATA CGCCGCTGCC GTCGCTGGCC CTGGCCGACG GCGAGCCGCC CGAGGTTGGC GATCTGGTTT TGGCCATCGG CAATCCCTTT GGCGTCGGTC AAACGGTCAC CAGCGGCATC GTTTCGGCCC AGGCGCGGAC CACGGCGGGA ATTTCCGATT ACCGCTTCTT CATTCAGACC GACGCGGCGA TCAACCCCGG CAATTCCGGC GGCGCCCTGG TCGATCTGAG CGGCCGGCTG GTCGGCATCA ACACCGCCAT TTATTCGCGC GACGGCGGCA GCGTCGGCAT CGGCTTCGCC ATTCCGGTCG AAATGGTGCG CTCGGTGGTC GAGGGGATCT TGGAAGACGG CAAGGTCCGC CACCCCTGGC TGGGCGCCGA TGGCCAGTCG GTGACGACCG AACTGGCAAG CCACATGGGC TTGGATCGCC CGCTGGGCGT CGCCATCACC GATGTGGCCA AGGGCGGGCC GGCGGCCAAG GCCGGGCTTG CCGAAGGCGA TGTCATCCTG GCGCTTGATG GCCGGCCGGT GTTCGAGGGG GAAACCCTGC GCTATCGCAT CGCCACCCAC CGCCCCGGCG ACAAGGTGGT GTTGGGGATC CGCCGCGACG GCAAAGATAC CACCCTGGCC GTCACCCTGG AGGCGCCGCC GGAGGATCCG CCGCGCGATA CCACCTTGCT CAAAGGCTCC CATCCGTTCG ACGGGGCGGA GGTTTCCAAT ATGAATCCGG CCCTGGCCGA AGAACTGGGG TTGGAGAAGG CCTCCCGGGG GGTGACGGTC ACCGGCCTGG GCGATGGCGT CGCCGCGCGG ATCGGCATCC GCCCCGGGGA CCGGGTCATC GAGGTCAACG GCAAGGCCAT TGACGCGGTT AGGGACCTGA AGGCGGCGAT CAGCAAACCC AGCCGGGTGT GGCAGGTGGT GGTCGATCGC GACGGTCGGG CGGTGCGGCT GGTGCTCGGC GGCTGA
|
Protein sequence | MIKAGEDRMD QSGRWARGVL AGASVAVLLA FGGPAAGAAE QVPGAGTAEQ VPMAAAEIKL TFAPVVRAVA PAVVNIFSQR VITESQVPPM FADPLFRRFL EERGMLGKPR ERVQRSLGSG VILRSEGVVV TNAHVVNGAS EITVALNDRR EFPAELVGLD PRADLAVLRI KSDTPLPSLA LADGEPPEVG DLVLAIGNPF GVGQTVTSGI VSAQARTTAG ISDYRFFIQT DAAINPGNSG GALVDLSGRL VGINTAIYSR DGGSVGIGFA IPVEMVRSVV EGILEDGKVR HPWLGADGQS VTTELASHMG LDRPLGVAIT DVAKGGPAAK AGLAEGDVIL ALDGRPVFEG ETLRYRIATH RPGDKVVLGI RRDGKDTTLA VTLEAPPEDP PRDTTLLKGS HPFDGAEVSN MNPALAEELG LEKASRGVTV TGLGDGVAAR IGIRPGDRVI EVNGKAIDAV RDLKAAISKP SRVWQVVVDR DGRAVRLVLG G
|
| |