Gene Rru_A3569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3569 
Symbol 
ID3837025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp4114465 
End bp4115454 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content72% 
IMG OID637827693 
ProductO-sialoglycoprotein endopeptidase 
Protein accessionYP_428650 
Protein GI83594898 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAAG CACTCCTCTC CCAGGTTGAG GAGCACCGCC CCTATGGCGG GGTCGTGCCC 
GAAATCGCCG CGCGCTCACA CCTTGACCAC GTGGACTCCC TGGTGATCCG CGCCCTTGGC
GAGGCCGGGC TGACGGTTCA CGACATCGAC GCCGTCGCCG CCACCGGCGG ACCCGGGCTG
ATCGGCGGGG TGATCGTCGG CGTGATGACG GCCAAGGCCA TCGCCCAGGT GGCGGGCAAG
CCGTTCATCG CCGTCAACCA TCTGGAAGGC CACGCCCTGA CCGTGCGGAT GACCGCCGGC
ATCGATTTCC CCTATCTGCT GCTTCTGGCC TCGGGCGGGC ATTGCCAGCT TCTGGCGGTC
GAGGGGGTGG GGCGGGCCAA GCGCCTGGGC ACCACCATCG ACGACGCCGC CGGCGAGGCC
TTCGACAAGG TGGCCAAGAT GCTGGGCCTG GGTTATCCGG GCGGACCGGC GGTGGAGCGG
GCGGCCCGGC GCGGCGATCC CCGGCGCTTT CGCCTGCCGC GCCCCCTGCT CGACCGCCCG
GGCTGCGACC TGTCTTTCTC GGGGCTGAAG ACCGCCGTGC GCCAGACCGT GGAAAAGCTG
CCGCCCGGGC CGTTGAGCGA GGGCGATATC GCCGATCTCT GCGCCAGTTT CCAGGCCGCC
GTCGCCGACT GTCTGGCCGA CCGCTGCCGG GTGGCCGCCG GGATCTTCAG CGCCCGCCAT
GGCCGTGGCC GGCCGCTGGT GGTGGCCGGC GGCGTGGCGG CCAACGCCAG CTTGCGCGCC
GCCCTGACCG AGGTCGCCCG CCAAGCCGAT ATGACCTTCG TCGCCCCGCC CTTGGCGCTG
TGCACCGACA ACGCGGCGAT GATCGCCTGG GTCGGCGTCG AGCGCCTGCG CCTGGGGCTG
GTCGACACCA TGGACTTCAA ACCCCGCCCG CGCTGGCCGC TCGACCCCGA CGCGCCCAAG
GCGGCCGGAG CCGGAGGCGT AAAAGCTTAA
 
Protein sequence
MAEALLSQVE EHRPYGGVVP EIAARSHLDH VDSLVIRALG EAGLTVHDID AVAATGGPGL 
IGGVIVGVMT AKAIAQVAGK PFIAVNHLEG HALTVRMTAG IDFPYLLLLA SGGHCQLLAV
EGVGRAKRLG TTIDDAAGEA FDKVAKMLGL GYPGGPAVER AARRGDPRRF RLPRPLLDRP
GCDLSFSGLK TAVRQTVEKL PPGPLSEGDI ADLCASFQAA VADCLADRCR VAAGIFSARH
GRGRPLVVAG GVAANASLRA ALTEVARQAD MTFVAPPLAL CTDNAAMIAW VGVERLRLGL
VDTMDFKPRP RWPLDPDAPK AAGAGGVKA