Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A0024 |
Symbol | |
ID | 3834125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 28454 |
End bp | 29599 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637824094 |
Product | peptidase |
Protein accession | YP_425116 |
Protein GI | 83591364 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.02023 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCGAA GGCGGTGCGG TGTCTGGTCC ATGCTTGGCG GTGCGGCCCT GGTTTTATCG ACGATCGGTC CGGCCGGCGC CCTGCCGCCC GCGCCAGCCG CCGCCGGGGC CGGTCTGGGG CCGTCTTTGG CGCTCACCCC CGTGTTCATG GTTGGCGAGG AGGGCGTGCT CACCCTGGCG CCGACGCTTG AGATCGTCAC TCCGGCGGTG GTCAATATCG CGGTGAAGGC CACGGTGGCG GCGCGGCCCA ATCCCTTGCT GTCCGATCCG CTGTTTCGCC AGTTCTTCGG CGTGCCGCCC GGGGCCGAAG GCCCGCGCGA GCGCACGGTG GTATCGGCCG GGTCGGGGGT GATCGTCGAT GCGGTGCGCG GCACCATCTT GACCAACCAC CATGTCGTCG ACGGCGCCGA GGATATCACC GTCACCCTCA AGGATCGCCG GGTGCTCAAG GCGACGCTGC TGGGCAGCGA TCCCGGCACC GACATCGCCG TGCTCCGCGT CAAGGCCGAT CGTCTGACCG CCTTGCATCT GGCCGATTCG GATCGGGCCC AGGTTGGCGA TCTGACCATC GCCATCGGCA ATCCCTTCGG TCTGGGCCAA ACGGTGACCA CCGGGGTGAT CAGCGCCAAG GGGCGCAGCG GCGTTATCCC CGACGGCTAC GAGGATTTCC TGCAGACCGA CGCGTCGATC AACCCGGGCA ATTCCGGGGG CGCCCTGGTC AATTCCCGGG GCGATCTGGT TGGCATCAAT ACCGCGATCT TGTCGTCGGG CGGCGGCAGC GTCGGCATCG GCTTTGCCAT TCCCAGCAAT ATCGCCCGCG CGGTGATGGA ACAGATCCTC AAGGACGGAA CGGTTCGGCG CGGTCATCTT GGCGTGTCGA TCCAGACCGT CAGTCCGGCC GTGGCCGAAA GCCTGGGCCT GCCCCGGGCG GCCGGGGTTA TCATCGCCGC GGTCGAGCGG GGATCGACCG CCGAAAAAGT CGGGCTGCGC ACCGGCGATG TGATCTTGGC GGTCGACGGC AGGCCTTCGG AAACCGCCGA GGTGCTGCGC CGCCAGATTG GCCTTGCCCA GATCGGCGAC CGGGTGAGGC TGACGGTGAT GCGCGAGGGC AAATCCTTCG ATCTTCAGGC CCGCATCGGC TCATGA
|
Protein sequence | MTRRRCGVWS MLGGAALVLS TIGPAGALPP APAAAGAGLG PSLALTPVFM VGEEGVLTLA PTLEIVTPAV VNIAVKATVA ARPNPLLSDP LFRQFFGVPP GAEGPRERTV VSAGSGVIVD AVRGTILTNH HVVDGAEDIT VTLKDRRVLK ATLLGSDPGT DIAVLRVKAD RLTALHLADS DRAQVGDLTI AIGNPFGLGQ TVTTGVISAK GRSGVIPDGY EDFLQTDASI NPGNSGGALV NSRGDLVGIN TAILSSGGGS VGIGFAIPSN IARAVMEQIL KDGTVRRGHL GVSIQTVSPA VAESLGLPRA AGVIIAAVER GSTAEKVGLR TGDVILAVDG RPSETAEVLR RQIGLAQIGD RVRLTVMREG KSFDLQARIG S
|
| |