Gene Rru_A0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A0024 
Symbol 
ID3834125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp28454 
End bp29599 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content68% 
IMG OID637824094 
Productpeptidase 
Protein accessionYP_425116 
Protein GI83591364 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.02023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGAA GGCGGTGCGG TGTCTGGTCC ATGCTTGGCG GTGCGGCCCT GGTTTTATCG 
ACGATCGGTC CGGCCGGCGC CCTGCCGCCC GCGCCAGCCG CCGCCGGGGC CGGTCTGGGG
CCGTCTTTGG CGCTCACCCC CGTGTTCATG GTTGGCGAGG AGGGCGTGCT CACCCTGGCG
CCGACGCTTG AGATCGTCAC TCCGGCGGTG GTCAATATCG CGGTGAAGGC CACGGTGGCG
GCGCGGCCCA ATCCCTTGCT GTCCGATCCG CTGTTTCGCC AGTTCTTCGG CGTGCCGCCC
GGGGCCGAAG GCCCGCGCGA GCGCACGGTG GTATCGGCCG GGTCGGGGGT GATCGTCGAT
GCGGTGCGCG GCACCATCTT GACCAACCAC CATGTCGTCG ACGGCGCCGA GGATATCACC
GTCACCCTCA AGGATCGCCG GGTGCTCAAG GCGACGCTGC TGGGCAGCGA TCCCGGCACC
GACATCGCCG TGCTCCGCGT CAAGGCCGAT CGTCTGACCG CCTTGCATCT GGCCGATTCG
GATCGGGCCC AGGTTGGCGA TCTGACCATC GCCATCGGCA ATCCCTTCGG TCTGGGCCAA
ACGGTGACCA CCGGGGTGAT CAGCGCCAAG GGGCGCAGCG GCGTTATCCC CGACGGCTAC
GAGGATTTCC TGCAGACCGA CGCGTCGATC AACCCGGGCA ATTCCGGGGG CGCCCTGGTC
AATTCCCGGG GCGATCTGGT TGGCATCAAT ACCGCGATCT TGTCGTCGGG CGGCGGCAGC
GTCGGCATCG GCTTTGCCAT TCCCAGCAAT ATCGCCCGCG CGGTGATGGA ACAGATCCTC
AAGGACGGAA CGGTTCGGCG CGGTCATCTT GGCGTGTCGA TCCAGACCGT CAGTCCGGCC
GTGGCCGAAA GCCTGGGCCT GCCCCGGGCG GCCGGGGTTA TCATCGCCGC GGTCGAGCGG
GGATCGACCG CCGAAAAAGT CGGGCTGCGC ACCGGCGATG TGATCTTGGC GGTCGACGGC
AGGCCTTCGG AAACCGCCGA GGTGCTGCGC CGCCAGATTG GCCTTGCCCA GATCGGCGAC
CGGGTGAGGC TGACGGTGAT GCGCGAGGGC AAATCCTTCG ATCTTCAGGC CCGCATCGGC
TCATGA
 
Protein sequence
MTRRRCGVWS MLGGAALVLS TIGPAGALPP APAAAGAGLG PSLALTPVFM VGEEGVLTLA 
PTLEIVTPAV VNIAVKATVA ARPNPLLSDP LFRQFFGVPP GAEGPRERTV VSAGSGVIVD
AVRGTILTNH HVVDGAEDIT VTLKDRRVLK ATLLGSDPGT DIAVLRVKAD RLTALHLADS
DRAQVGDLTI AIGNPFGLGQ TVTTGVISAK GRSGVIPDGY EDFLQTDASI NPGNSGGALV
NSRGDLVGIN TAILSSGGGS VGIGFAIPSN IARAVMEQIL KDGTVRRGHL GVSIQTVSPA
VAESLGLPRA AGVIIAAVER GSTAEKVGLR TGDVILAVDG RPSETAEVLR RQIGLAQIGD
RVRLTVMREG KSFDLQARIG S