Gene RSc3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc3101 
Symbol 
ID1221965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp3338217 
End bp3340211 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content67% 
IMG OID637239519 
Productputative serine protease protein 
Protein accessionNP_521222 
Protein GI17547820 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.672072 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGT CCGAGGTCCT GCCGTCGAAG CCGGCCGGCT CGGCGGCGCC GAAGGTCGAC 
GGCATCCCGT CGGGCCTGTC GCCGAAGGCG CGGGCGGAGC ACAACGCCGC CAAGCTCGAC
GGCGCGTTGA TTCGCCTGCG GCGCAATACC GCGCCGCGCG CGACGGCCAC CCCCAACAGC
ATTCTCGAGA GTTCGGCGGT GGTGGCCCGC TCCGGCAACT GCGTCGACGT CGACGTCATG
ACGCAGGGCG ATGCCCAGCA GGCTGTCGCC CAGCTGGAGC GGCTGGGATT CCAGACGGCT
GCGGTGTATC GGAACGTCAT CAGCGGTTGC CTGCCCGTCG CGAAGATCGA CGAGGCCGCG
GCCATTCAGC AGATCCAGCG CATCAGCAAG GTGAGCCGGT CCACGCGCTC CGGCGTGGTC
CAGGGCCAGG GCGACTACGC CCAGCTGAGC CGCTCGCTGC GGCAAGCAGT GAAGGGCCTG
GGCATCGAGC TGACCGGCAA GGGCATTACC GTCGGCCTGA TCTCGGATTC GTTCAACTGC
AATAGCCAGC TGAACCAGGA TGCACGCTAC GTTGCACAGA ACGGCCGCCA GGACACCATG
GAGGACGATA TCGCACGCGG TGAGCTGCCG GGCAACGGCC GCATCCGGAT CGTGAAGGAG
CTGCGCGATT GCACCGACGG CACGGACGAA GGCCGTGCCA TGGCCGAGAT CATCCACGAC
GTGGCGCCGG GCGCGGACAT CGTCTTCTAT TCCGGGGGCG CGGGGATGGC CGATTTCGCG
CAAGGCATCG AGACGCTGGC CTTGCCGAAG AACAAGAGCA ACGCCCAAGG CGTTGCCGGC
GGCGGCGCGC AGGTCATCGT TGACGACCTC CAATATTCCT ATGAACCCGC TTTCCAGTCC
GGGATTGTCG GCGCGGCGAT CGACAACGTG GTCAAGAACC ACGGGATCGC GTACTTCACC
GCCGGCGGCA ACGATGGCGT AGGAGCGTCC CCTGTCTCGT ACATCAACAA CAACGCGCGC
TTTGCCGACC AGCCGATCGA TCCGAACGGC ACCGGCACGC CCGGCCGTCC GCTGAACTTC
GACCCCTCGG GCGCCAGCCA GGTGTTCTCC CTCCCGGTGC GGGCGACGCG CCAGATCGTC
GGCTTCTACC GGTTCAGCCT CCAGCTTTAC TGGGACCAGC CGTTCGACAA CAGCACCAGC
TCGCTCCAGG TGTGCCTGGC CGACAAGAAC GGCAAGCCGT TCAACGTCAT GCTCGACGGC
GAGCCATACC CCAGCTGCAC CGATGCCTCG GTGATCGGCC AGCAGGCCAT TGCATGGGGG
ACGCTGCTGG GAACCGAGCC GGAAGCCACC CTCCAGGTCT TCCTGGTGGA TGGCACGGCG
CCGCGGCGTG TCCGTCTGCA GACCAGCCGG ATCGTCATCG GCCAGTTCGG CACAGCGGAT
GCGGCCCTGT TCGGCCATGT CCTTTCGCCC AATGCGATCG CGACCGGCGC AGCCAATTAC
CTCGCGACGC CGATGTGCGA TCCGTCGCTC AAGACCGCGC AGCTGGAGCG CTTCTCGTCG
CACGGCGGCG GCCTGATGCT GTTCGACAAT GACGGCCGCG CGCTGCCGCG CCCCGTTCTC
GACGGCAAGC CGGACCTGGT CGGGCCGGAC GGTGCGAGCT CGGTGTTTTT CGGCATCCAG
GCCAAGGATG GCGACCGTGG CTTCGGCGTC TACAACCTGA ACTGCCGCTA CTACCCTGCC
TATCCGTATC AGTTCTACGG CACTTCGGCC GCGGCGCCGC ACGTCGCCGG CGTGGCGGCA
TTGATGCGCC AGGCGGTACC CAAGGCTACG CCGGAGCAGA TCTACAGCGC CCTGCGCAAG
ACGGCCGTGG ATATGGATGC GCCCGGCCAT GACAACGCCA CCGGCGCCGG CTTCGTCCAG
CCCGAGCGCG CGCTGCGCGA ACTGATCTGG CAGGCGCTGA ACCAGTATCG CTTCGCCAAT
CCGCCGGGAC GATGA
 
Protein sequence
MAQSEVLPSK PAGSAAPKVD GIPSGLSPKA RAEHNAAKLD GALIRLRRNT APRATATPNS 
ILESSAVVAR SGNCVDVDVM TQGDAQQAVA QLERLGFQTA AVYRNVISGC LPVAKIDEAA
AIQQIQRISK VSRSTRSGVV QGQGDYAQLS RSLRQAVKGL GIELTGKGIT VGLISDSFNC
NSQLNQDARY VAQNGRQDTM EDDIARGELP GNGRIRIVKE LRDCTDGTDE GRAMAEIIHD
VAPGADIVFY SGGAGMADFA QGIETLALPK NKSNAQGVAG GGAQVIVDDL QYSYEPAFQS
GIVGAAIDNV VKNHGIAYFT AGGNDGVGAS PVSYINNNAR FADQPIDPNG TGTPGRPLNF
DPSGASQVFS LPVRATRQIV GFYRFSLQLY WDQPFDNSTS SLQVCLADKN GKPFNVMLDG
EPYPSCTDAS VIGQQAIAWG TLLGTEPEAT LQVFLVDGTA PRRVRLQTSR IVIGQFGTAD
AALFGHVLSP NAIATGAANY LATPMCDPSL KTAQLERFSS HGGGLMLFDN DGRALPRPVL
DGKPDLVGPD GASSVFFGIQ AKDGDRGFGV YNLNCRYYPA YPYQFYGTSA AAPHVAGVAA
LMRQAVPKAT PEQIYSALRK TAVDMDAPGH DNATGAGFVQ PERALRELIW QALNQYRFAN
PPGR