Gene Rcas_2326 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2326 
Symbol 
ID5539807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2997320 
End bp2999758 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content63% 
IMG OID640894459 
Productpeptidase C1A papain 
Protein accessionYP_001432427 
Protein GI156742298 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4870] Cysteine protease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.706763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.190341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCATCG TCGCACTCTT CCGCAGACTG GCCAGCGCCT GCACATTCCT GGTCATTATC 
GCCGCCGGGG TGCTCACCCT TCCATCGGCG GTTCGATCAT CACCGAATGA CATTATTGTC
ACCCTGGCGC ACCACGGAAC CAGCGTCGCC GTCGCCGAGG GGCAGCGCCT GATCGTCAAA
CTCGAAGGGC AACCTGGAAC CGGCTACGGC TGGGAACTGA AAGAACCTTC ATCGTTGTTA
CAGACGAGCG ATCCGGTGTT CGAGCCACTG TCCGCTTCGT CCGCCGCCGC CACGAGCGCG
CCGATGGTTC AGGCATTCAG ATTCACGCCG GTGCGCGCCG GCAGCGAAAC CCTCACCCTC
GTCTACCGTC GTCCGTGGGA GCGCCAAGCC GCTCCGCTGC GCACCTTCAC CATCCAGGTC
GAGACCACTG GACGATTCGC TGCTCCGTCG CCATCCGCGC AGACTGCCGC CCTGCCGCAG
TCGCTTCCGC CAGTAGCGAT GGGATCAGAC GAAGGGTTGC CGGCGGCATT CAACTGGTGC
GAGCAGGGAA TCTGCACTCC TGTGAAGGAT CAGGGCGGAT GCGGCTCTTG CTGGGCATTC
GCCACTGCTG GCGTCGTCGA GTCGGCGATC AAGCGCATCG ATGGAGTCGA GCGCGACCTC
TCCGAACAGT ACCTGCTGTC GGCAGGGACC CACGGCGGCT GCGACGGCGG CATGCCCGCC
TATGATCTGT TCATCGGTGC GACTCCCGCG CATCAGACTG AAGCAGGCGT CGTGTACGAG
AGCGACCTGC CGTATACCGG GCAGGATGCG CCGCTTACCC GCGCATTGCC GCATCACGAG
CGCATGCTGG CGTGGAACTC GCTGTTCAAT GCCGATGTGG CGACGATCAA ACGCATCATC
CGCGAGCATG GTCCGGTATC CGCGTATGTG TGCGCCGGGT CGCGTTTTAT GTGGCACCGA
TCCGGCGTGT TCGAGACCGA CGAATCCGCC GCGTGCAACG GCAGCATCAA CCATGCCGTC
GTGCTGGTGG GTTGGGACGA CAGCAAAGGA ACCAGGGGGG CGTGGCGACT GCGCAACTCG
TGGGGGGCGA CATGGGGCGA GAACGGATAT ATGTGGATTG GTTACGGCAT ATCGGGTCTT
GGTCAGTGGA TCGATTATGT ATACTACGAC CGACTGGAAC CAGGCGCATA TGCTATCTCC
GGTCGTGTGC GGGAGCAGTG GCATGGAACC GCTGGCGTCA GCGTCTCGGA CGGTTCACGC
AGCGCCATTA CCGATCAGTA TGGCGTGTAT GTATTGAAGA ACGTATCGCC GGGAACGTAC
ACCCTCACTC CATCGCGCAG CAACGCCGCC TTCTCGCCCG CAACACGAAC CGTGACAGTT
GGGAACGGGA AAAATGCCGG CAATCAGAAC TTTGCGCTGC TGGCAACCTA CCGCATCAGC
GGACGAGTAA CCGGCAGTTT CGGCGAAGGT CTGCCCGGCG CGCGCGTCTC GGACGGAACG
CGCAGCGCCG TCACTGATGC GAATGGAAAC TACGTGATCG AAGGCGTGCT ATCGGGCGCA
TATGCTCTCA CTGCTTCACT GAGCGGCTAC ACCTTTACCC CCAACCCGCT CTGGGTGGTT
GTCAACAGCG ATGTCGGCGG ACAGGATTTC ACGGTGGTGT GCTCCTCCTG CACCATCAGC
GGCAGGGTGG TCGATAGCGC GGGCAATGGC GTGGCAGGGG CGACCATCTC CGACGGGATG
CGCAGCGCCA CAACGAACGC GCAGGGGTTC TATACCCTGA TCAGCGTTCC ACCGGGAACA
TACACCCTGA CCCCATCGCA CAGCGACTAC ATCTTCACGC CATCGGCGCG ATCAATCACG
GTTAACCGCC ATCTGAGCGA CCAAAACTTC ACCGCGATAT GCGCTTCCTG CTCCATCAAC
GGGCAGGTGA CCGACAACGC AGGGAACGGT GTCGCCGGAG TAACCATCTC CGACGGCGCG
CGCAGTGTGA TGACCGATGC TCAGGGACGC TATGCGCTGA CGAATGTGCC GCCAGGCGTC
GCCACACTCG TTGCTACCCG CAACGGCTAT GCCTTTAGCC CGTCAAGCCG CTCGCTCACG
GTCGACCGCC ATCTGAGCGG GCAGGATTTC ACGGCGATCC CTGCACCCTA CACCGTGAGT
GGACGGATCA CCGACAGCGC CGGCAATGGC ATCGGCGGCG TGACCGTCTC CGATGGCGCG
CGCAGCGTTG TCACCGATGG GAGCGGCGTC TTTACCCTGC GCAACATTCC AGCCGGGACG
TACACGCTCA CGCCGTCGCA TGGCAGCGAC ACATTTACAC CCGCAAACCG CACAGTTACC
GTCACCGGCG ACATAAGCGG ACAAGATTTC GCGCTTGCGT CGCCGGCGCC CGCAGCGCCT
GCGGACTACA CCGTCTTTCT GCCGCTGACC GTTCGCTAA
 
Protein sequence
MLIVALFRRL ASACTFLVII AAGVLTLPSA VRSSPNDIIV TLAHHGTSVA VAEGQRLIVK 
LEGQPGTGYG WELKEPSSLL QTSDPVFEPL SASSAAATSA PMVQAFRFTP VRAGSETLTL
VYRRPWERQA APLRTFTIQV ETTGRFAAPS PSAQTAALPQ SLPPVAMGSD EGLPAAFNWC
EQGICTPVKD QGGCGSCWAF ATAGVVESAI KRIDGVERDL SEQYLLSAGT HGGCDGGMPA
YDLFIGATPA HQTEAGVVYE SDLPYTGQDA PLTRALPHHE RMLAWNSLFN ADVATIKRII
REHGPVSAYV CAGSRFMWHR SGVFETDESA ACNGSINHAV VLVGWDDSKG TRGAWRLRNS
WGATWGENGY MWIGYGISGL GQWIDYVYYD RLEPGAYAIS GRVREQWHGT AGVSVSDGSR
SAITDQYGVY VLKNVSPGTY TLTPSRSNAA FSPATRTVTV GNGKNAGNQN FALLATYRIS
GRVTGSFGEG LPGARVSDGT RSAVTDANGN YVIEGVLSGA YALTASLSGY TFTPNPLWVV
VNSDVGGQDF TVVCSSCTIS GRVVDSAGNG VAGATISDGM RSATTNAQGF YTLISVPPGT
YTLTPSHSDY IFTPSARSIT VNRHLSDQNF TAICASCSIN GQVTDNAGNG VAGVTISDGA
RSVMTDAQGR YALTNVPPGV ATLVATRNGY AFSPSSRSLT VDRHLSGQDF TAIPAPYTVS
GRITDSAGNG IGGVTVSDGA RSVVTDGSGV FTLRNIPAGT YTLTPSHGSD TFTPANRTVT
VTGDISGQDF ALASPAPAAP ADYTVFLPLT VR