Gene Rru_A2201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2201 
Symbol 
ID3835628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2555293 
End bp2556819 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content65% 
IMG OID637826303 
Productpeptidase S1C, Do 
Protein accessionYP_427288 
Protein GI83593536 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTCAAT CCGTGCTTGA TCGCACCGCT TCCAACGGAG CCGGGTCCGC GGGCCGCCTG 
CGCCGCGCCC TGGCGCCGGC CCTGGTCGCC TTCTCCCTGG CCATCGCGCC GTTCTCCGCC
CAGGCCCGCG AGATCCCCGA AAGCTTCGCC GATCTCGCCG AGGGTTTGTT GCCCGCGGTG
GTCAACATCT CCACCACCCA GACCATCGAT GCCGATCGTG GTCCGGAAAT GCCCCAGTTC
CCGCCGGGGT CGCCCTTCGA GGAGTTCTTC AAGGATTTCT TCGAACGCCA TGGCGGGCCG
GGCGGCGGCG TCCAGCCCAA GACCCCGCGC CGGGCGACCT CGCTGGGCTC GGGCTTCATC
GTCGACGCCG CGGGCTATAT CGTCACCAAC AACCACGTCA TCCAGGATGC CGACGAGATC
ACGGTGATCT TGCATGACGA TACGGCGATC AAGGCCGAGC TGGTCGGCAA GGACGAAAAG
ACCGACGTCG CCCTGCTGCG CATCAAGACC GACAAGCCGC TGACCGCCGT GCCCTGGGGC
AATTCGGAAG CCGCCCGGGT GGGTGATTGG GTGATGGCGA TCGGCAATCC CTTCGGCCTC
GGCGGCACGG TGACCGCCGG CATCATCAGC GCCAAGACCC GCGATATCAA CGCCGGACCC
TATGACAGCT TCATCCAGAC CGATGCGGCG ATCAATAAGG GCAATTCCGG TGGGCCGCTG
TTCAACATGC ATGGCGAGGT GATCGGCATC AATACGGCGA TCTTCAGCCC CTCGGGCGGG
TCGATCGGCA TCGGTTTCTC GGTGCCCTCC AACCTTGCCC ATCAGGTGAT CGACGATATC
AAGAAGTTCG GCCGTACCCG TCGCGGCTGG ATTGGCGTGC GCATCCAGTC GGTGACCGAC
GAGATCGCCG AGGGCCTGGG TCTTGAAAAA TCCGCCGGCG CCCTGATCGC CGCCGTTACC
CCCGGTGGGC CTGCGGCCGC CGCCGGTCTC AAGGTCGGCG ATGTCATCGT GTCGTTCGAT
GGCCGTCCGG TCCCCGATAT GCGGACCCTG CCGCGGATCG TCGCCGAAAC CGAAATCGGC
AAGGACGCCG CCATCGGTGT CTGGCGCGAG GGCAAGCGCC AGGACCTGAA GATGAAGGTC
GGTGAACTGG AGGTGGCCGA GGACGAGGGG TTGCTGAACG AACCCGAAAC CAGCGGCGTC
ACCGATTCCC AGGGCGGCAC CGCCGTCGCC ACCATCGGCC TGACGGTGAC CAAGCTCGAT
GACCGCCTGC GCAGCCAGTT CGGCTTCGAT GCCGCCTCGG AAGGCGTGGT GGTGACCGAT
GTCACCAATG AAAGCGACGC CCAGGAAAAG GGTCTGGAGC CCGGGACGCT GATCGTCAAG
ATCAACCAGA CCGAGGTCGC CTCGCCCGAA GACGTGGTCA AGGCGGTGGC CAAGGCCAAG
GACGAGGGAC GCAAGACCGT TCTTCTGCTG GTCGAGCTGC GCGGAACCCG GACCTTCATT
CCGGTCAAAA TGGCCGACAA GAAGTAG
 
Protein sequence
MFQSVLDRTA SNGAGSAGRL RRALAPALVA FSLAIAPFSA QAREIPESFA DLAEGLLPAV 
VNISTTQTID ADRGPEMPQF PPGSPFEEFF KDFFERHGGP GGGVQPKTPR RATSLGSGFI
VDAAGYIVTN NHVIQDADEI TVILHDDTAI KAELVGKDEK TDVALLRIKT DKPLTAVPWG
NSEAARVGDW VMAIGNPFGL GGTVTAGIIS AKTRDINAGP YDSFIQTDAA INKGNSGGPL
FNMHGEVIGI NTAIFSPSGG SIGIGFSVPS NLAHQVIDDI KKFGRTRRGW IGVRIQSVTD
EIAEGLGLEK SAGALIAAVT PGGPAAAAGL KVGDVIVSFD GRPVPDMRTL PRIVAETEIG
KDAAIGVWRE GKRQDLKMKV GELEVAEDEG LLNEPETSGV TDSQGGTAVA TIGLTVTKLD
DRLRSQFGFD AASEGVVVTD VTNESDAQEK GLEPGTLIVK INQTEVASPE DVVKAVAKAK
DEGRKTVLLL VELRGTRTFI PVKMADKK