Gene RoseRS_2451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2451 
Symbol 
ID5209420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3028958 
End bp3032566 
Gene Length3609 bp 
Protein Length1202 aa 
Translation table11 
GC content62% 
IMG OID640596056 
Productpeptidase C1A, papain 
Protein accessionYP_001276778 
Protein GI148656573 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.692064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGTCG TCGCTCACTT CCGCAGACTG GCAGGCATGC TCACCTTCCT GAGCATCTCC 
CTCGCGCTAC TCGTTCTTCC ATCGCTGGTG CAATCATCGT CGCACGATGT CATCGTCACG
CTGGATCATG ATGGAACCAG GATTACCCTG ACCGATGGGC AACGCCTGAT CGTCAAACTG
GAGGGACATC CCGCAACCGG CTACGTTTGG GATAGTGAAC AATCGTCGTC GTTATTGCAA
CCGCTGGGCG ACCCGGTATT CGAGGCGCCG TCTCCGACGG GAGGGGAAAC GGGCGCACCG
GCATTGCAGG TGTTGACGTT CCTGCCGGTA CGCGCCGGGG AAGAGACGCT CACCCTGGTC
TACCGCCGAC CGTGGGATCG CGCTGTTCAG CGCACCTTCA GCATTCGGGT CGAAACCCTC
GGTCGCTTCA CCGTCCTGTC GCCTGCGGCG CAACCGTCAT CCGGTTCCAC ACCTCCACCG
GTGGTGATGG GGGCACAGGA GGGTTTGCCC GCAGCCTTCA ACTGGTGTGA TCAGGGCGCC
TGCACGCCGG TGAAAGATCA GGGGGTATGC GGATCCTGCT GGGCATTCGC CACGACGGGG
GTGGTCGAGT CCGCGCTCAA GCGCATCGAT GGCGTCGAAC GCGATCTGTC GGAGCAGTAC
CTGATCTCGG CGGGAACACA CGGTACGTGT AATGGAGGGG GACCGGCATA TGACCTGTTC
ATCGGTGATC TTCCAGCGCA TCAAACAGAG GCGGGCGCCG TCTATGAAAG CGACCTTCCG
TACCTTGGTC AGGATGTGCC GCTCACCCGC GCACTGCCCC ACCACGAACG TCTCCTGGCG
TGGAACCAGG TCTTCAATGC CGACATCGCT ACCATCAAGC GCATCATCTA TGAGCACGGT
CCGGTGTCCG CCTATGTCTG CGCGGGTTCG CGCTTTATGT GGTACCGATC AGGAGTCTTC
GAGACCGATG AGTCAGCCGC ATGCAACGGA GGCATCAACC ATGCAGTGGT GCTGGTGGGA
TGGGACGATA GCAGAGGCAG TCGGGGCGCG TGGCGTCTGC GCAACTCGTG GGGCAGCATG
TGGGGTGAAG GCGGATATAT GTGGATCGGC TACGGGGTGT CGGGTATCGG TCGACGGATC
GACTATGCGT ACTATGATCG CCTGGCGCCG GGCACGTCCG CGATTTCGGG ACAGGTAACG
TCGCTGGGGA GTGGCATAGC GAATGTTGTG GTGTCTGATG GCGTCCGGAG TGCATTCACC
GATCAGTATG GCATGTACGT TGTGAAGCAT GTTCCTCCGG GCACATATAC CCTCACTCCA
TCGCGCAGCA GTCTCGTCTT TTCTCCATCC AGCCGCACGG TGACTATCAA CGCGGGCAGA
AATCTGAATC GGCAGGACTT TGCGATCCTT CCAACCTACA CCGTCAGCGG GCAGGTGACC
GATGGCGCAG GGAACGGCAT CGCAGGCGTG ACCATCTCTG ATGGAACGCG CAGCGCCACA
ACTGATGCCC AGGGGCGCTA TGCTCTGACG AATGTGCCGC AAGGCGGGTA TTGGCTCACT
CCATCCCACA ATACCTATGT GTTCAATCCG ACTCAACGCT GGATCACGGT TAATGGCGAT
CTCAACGGGC AGGATTTTGT CGCAACCTGT CTCTCCTGCA CCATCAGCGG GCGGGTGACC
GATGGCGCCG GAAACGGCAT CGCAGGCGTT ACCATCTCTA ATGGGACGCG CAGCGCCACG
ACCGATGCCC AGGGGCGCTA TGCCTTGAAC GTACCGCCGG GCGAGTATTG GCTCGTGCCG
TCTCGCAACG GCTACACGTT CAATCCAGAG CGGCGCCGGA TTACAGTCAA CCGCCATCTG
AGCGGGCAGG ATTTCACGGC GACCCTTGCG ACCTATGTCA TTCGCGGGCG GGTGACCGAT
AGCGCAGGGA ACGGCATCGC AGGCGTGACC ATCTCCGATG GAACGCGCAG CGCCACGACC
GATGCCCAGG GGCGCTATGC TCTGACGAAT GTGCCGCAAG GCGGGTATTG GCTCACTCCA
TCCCACAATA CCTATGTGTT CAATCCGACT CAACGCTGGA TCACGGTTAA TGGCGATCTC
AACGGACAGG ATTTCACGGC GACCCTCGTG ACCTATGTCA TTCGCGGGCG GGTGACCGAT
AGCACAGGGA ACGGCATCGC AGGCGTGACC ATCTCTGATG GAACGCGCAG CGCCACAACT
GATGCCCAGG GGCGCTATGC TCTGACGAAT GTGCCGCAAG GCGGGTATTG GCTCACTCCA
TCCCACAATA CCTATGTGTT CAATCCGACT CAACGCTGGA TCACGGTTAA TGGCGATCTC
AACGGACAGG ATTTCACGGC GACCCTCGTG ACCTATGTCA TTCGCGGGCG GGTGACCGAT
AGCACAGGGA ACGGCATCGC AGGCGTGACC ATCTCTGATG GAACGCGCAG CGCCACAACT
GATGCCCAGG GGCGCTATGC TCTGACGAAT GTGCCGCAAG GCGGGTATTG GCTCACTCCA
TCCCACAATA CCTATGTGTT CAATCCGACT CAACGCTGGA TCACGGTTAA TGGCGATCTC
AACGGGCAGG ATTTTGTCGC AACCTGTCTC TCCTGCACCA TCAGCGGGCG GGTGACCGAT
AGCGCCGGAA ACGGCATCGC AGGCGTTACC ATCTCTAATG GGACGCGCAG CGCCACGACC
GATGCCCAGG GGCGCTATGC CTTGAACGTA CCGCCGGGCG AGTATTGGCT CGTGCCGTCT
CGCAACGGCT ACACGTTCAA TCCAGAGCGG CGCCGGATTA CAGTCAACCG CCATCTGAGC
GGGCAGGATT TCACGGCGAC CCTTGCGACC TATGTCATTC GCGGGCGGGT GACCGATAGC
GCCGGAAACG GCATCGCAGG CGTGACCATC TCCGATGGAA CGCGCAGCGC CACGACCGAT
GCGCAGGGCT TCTACGCGCT GAGCGGCGTC CCGGCGGGCG CATACACGCT CACTCCATCC
CGCGACGGGT ACGCTTTCGC GCCCGCCTCG CGCACCGTGA CGGTCACCGG CGAAGTGAGC
GGGCAGGATT TCACGGCGAC CCTCGTGACC TACGCGATTC GCGGGCGGGT AACCGACGGC
GCGGGGAACG GCGTGGCAGG GGTGACCATC TCTGACGGCA CGCGCAGCGC CACGACCGAT
GCGCAGGGCT TCTACGCGCT GAGCGGCGTC CCAGCGGGCG CATACACGCT CACTCCATCC
CGCGACGGGT ACGCCTTTGC GCCCGCCTCG CGCACTGTGA CGGTCACCGG CGACCTGAGC
GGGCAGGATT TCACGGCGAC CCTCGTGACC TACGCGATTC GCGGGCGGGT AACCGACGGC
GCGGGGAACG GCGTGGCAGG GGTGACCATC TCTGACGGCA CGCGCAGCGC CACGACCGAT
GCGCAGGGCT TCTACGCGCT GAGCGGCGTC CCGGCGGGCG CATACACGCT TACCCCTTCG
CTTGACGGGT ACGCCTTCGC GCCCGCCTCG CGCACCGTGA CGGTCGCTGG CGATCTGAGC
GGGCAGGATT TCACGGTCTC TTCTTCAGCC GGGCAGTACC GGGTATTCCT GCCGCTGACC
GTTCGCTAG
 
Protein sequence
MLVVAHFRRL AGMLTFLSIS LALLVLPSLV QSSSHDVIVT LDHDGTRITL TDGQRLIVKL 
EGHPATGYVW DSEQSSSLLQ PLGDPVFEAP SPTGGETGAP ALQVLTFLPV RAGEETLTLV
YRRPWDRAVQ RTFSIRVETL GRFTVLSPAA QPSSGSTPPP VVMGAQEGLP AAFNWCDQGA
CTPVKDQGVC GSCWAFATTG VVESALKRID GVERDLSEQY LISAGTHGTC NGGGPAYDLF
IGDLPAHQTE AGAVYESDLP YLGQDVPLTR ALPHHERLLA WNQVFNADIA TIKRIIYEHG
PVSAYVCAGS RFMWYRSGVF ETDESAACNG GINHAVVLVG WDDSRGSRGA WRLRNSWGSM
WGEGGYMWIG YGVSGIGRRI DYAYYDRLAP GTSAISGQVT SLGSGIANVV VSDGVRSAFT
DQYGMYVVKH VPPGTYTLTP SRSSLVFSPS SRTVTINAGR NLNRQDFAIL PTYTVSGQVT
DGAGNGIAGV TISDGTRSAT TDAQGRYALT NVPQGGYWLT PSHNTYVFNP TQRWITVNGD
LNGQDFVATC LSCTISGRVT DGAGNGIAGV TISNGTRSAT TDAQGRYALN VPPGEYWLVP
SRNGYTFNPE RRRITVNRHL SGQDFTATLA TYVIRGRVTD SAGNGIAGVT ISDGTRSATT
DAQGRYALTN VPQGGYWLTP SHNTYVFNPT QRWITVNGDL NGQDFTATLV TYVIRGRVTD
STGNGIAGVT ISDGTRSATT DAQGRYALTN VPQGGYWLTP SHNTYVFNPT QRWITVNGDL
NGQDFTATLV TYVIRGRVTD STGNGIAGVT ISDGTRSATT DAQGRYALTN VPQGGYWLTP
SHNTYVFNPT QRWITVNGDL NGQDFVATCL SCTISGRVTD SAGNGIAGVT ISNGTRSATT
DAQGRYALNV PPGEYWLVPS RNGYTFNPER RRITVNRHLS GQDFTATLAT YVIRGRVTDS
AGNGIAGVTI SDGTRSATTD AQGFYALSGV PAGAYTLTPS RDGYAFAPAS RTVTVTGEVS
GQDFTATLVT YAIRGRVTDG AGNGVAGVTI SDGTRSATTD AQGFYALSGV PAGAYTLTPS
RDGYAFAPAS RTVTVTGDLS GQDFTATLVT YAIRGRVTDG AGNGVAGVTI SDGTRSATTD
AQGFYALSGV PAGAYTLTPS LDGYAFAPAS RTVTVAGDLS GQDFTVSSSA GQYRVFLPLT
VR