Gene Rcas_2773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2773 
Symbol 
ID5540259 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3585820 
End bp3588813 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content62% 
IMG OID640894899 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001432862 
Protein GI156742733 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0362893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGTC GCGCTGCCAG TCTACGCCTG TTCATCGCTG CGTTTCTGTT GATTGCAGCG 
CTCATCGTTC CGACTGACAG ATTCACCATT CAGGCAACAG GCATTCTCAC AGCCTCTCCG
ATCACGATCA GCGAAACGTT GCCATTGGGT CAACAGGTCA CGCGACCACT GACGATCACC
AACCTCGGTT CGACGACTGT CACAGCGTTG CTCTATGAAG CGCGCGCGCA GCCGTTGTTG
GCGATGGCGC ATGCAATCGG TCCCGCAAGC GTCCCTCTGC CGCAGCAAGA TCACACGCTC
GATCCGCGTC TGGCGGCGCA ACTCGACGAA CCCGCAGCGC AGGGTTCGTT CATTATCTAT
TTGCGCGATC AGGCGGACCT GAGCAGCGCA TATGGCATCA CCGACTGGTC AGAACGTGGG
CGCTTCGTCT ACCGAACGCT GGTAGAACAC GCTGAACGCA CACAACGCAC CCTGCGCGCC
GAGTTGACGG CGCGCGGTCT GACCTATCGA CCGTTCTGGG TCGTCAACGC CATTCAGGTG
GAAGGTGCGC TCGCCGATGC GCAGGCGCTG GAGCAGCGCG CTGACGTTGC TCTGGTGCGC
GCCGACGCAA GTATCATGGT CGCGCTGCAA ACGCTGCCGT CCAGCCTCGA TACGCGCTGT
AGCGCAGACG GCAATCCGAT GTGCTGGAAT ATTCGCGCCA TCCGCGCAGA TCGTGTGTGG
AACGAGTTTG GCATCACCGG TCAGGGCGTG ACGGTCGCCT CCATCGATAC CGGCGGGCTT
TTCAGTCATC CGGCGCTGCG CGATCAGTAT CGCGGTGCGC TCGGCAACGG CGCATACGAC
CACAACTACA ACTGGTACGA TCCGCAAGGG GCATTCCCTG CGCCGAACGA TCAGAGCGGT
CATGGAACGC ATACCATCGG TATCATGGTC GGCAGGCGCA TCGGAGGCGA GCGGTTTGGC
GTTGCTCCCG GCGCGCGCTG GATTGCTGCG CAGGGGTGTG AAGGATCATT CTGCAACGAA
AGTGACCTGA TCGCCGCCGC GCAGTGGGTC CTGGCGCCGA CTGACCTCCA CGACCGCAAT
CCGCGCCCCG ATCTGCGCCC GTTGATCGTC AACAACTCGT GGGCGGGCGG CGGCAACGAC
CCGTGGTATG CCGGATATAC CGCCGCCTGG CGCGCGGCCG GCATCTTTCC CGTTTTTGCG
GCAGGCAACG GCATGGGCGT CTGTCGCTCA ATCGCATCCC CCGGCGACTA TGCCGATGTC
GTCGCTGTTG GCGCCACCGA CCGCAACGAC GCCATCGCCC CGTTCAGCCT GCGTGGTCCG
ACTGCCGATG GTCGCATGAA GCCGGACTTC GTCGCTCCGG GGGAGGGCGG CATCTACTCA
ACACACCTCA GTGACGGGTA TGCCACTCTG CGCGGCACAT CGATGGCAGC CCCCCACGTT
GCCGGGGTTG TGGCACTGCT CTATTCGGCA AACCCGGCTC TGATCGGCGA TTTTGAGTCG
ACCTACGCCA TCCTGCGCGA CACAGCGCGC AGAATAGCCG ATGAACAGTG TGGCGTCGTA
TCCGGCGGCG GCAATCATGT GTATGGTTGG GGCTTGATCG ACGCCCATGC GGCGGTTGCG
CGAGCGCGCG TCGATGTGCC GTGGCTGCGC CTTTCACCGA CGACGGTAAC GCTCAATCCT
GGTCAGAATG CAACGCTGGA CGTGACGTTC GACTCCAACG GCGTTGCTGC GCCGGGGACC
TATACTGCGC GCATTCAGAT ATATGCCGGT GATCTGACCC AACCGCCAGC GACCGTTGAG
GTGACCATGA ATGTGATCGC TTCGGGAACC ATCGTCGGCG GCATTGTGCG CGATGCCGAG
ACCGGCGAAG CGCTGCGCGC GACAGTCAGC GTCAGCGGCG GCGCCAGCAC GCCTACTTCC
AATGACGGAT CATACGCTCT CATCCTGCCG TCGGGCGTTT ACACGTTGAC GGCGTCCGCA
CTCTCGTATG CGCCGCAGCA GCGTGTGATT ACTGTACCGG TGAGCGGATC GGTCGATTTT
GGATTATTGC TCGATGCGCC GCATTTGACC CTTTCGACCG ACCATGTGAC GGCTACGCTC
GATTTCAACA CCACCGTCGA GCAAACCGTA ACGATAACCA ATACCGGTAC CCGTCCTTTG
ACTTTTGAAG CCAGCGTCGG ATATGCGCCA TTCGGGGTCT ATCGCAGTGA TGAGCCAGGC
GGACCGGTCT ATCAGTGGAT CGACCTGCCC GTTGATGCGC CGACGCTCGA ATTGACCGAT
ACAACCCGGA TCGACAACAT CCCCCTGGGC TTCGACTTTC CACTCTACAC CCTCACCGTC
ACTGAAACGT CGGTCACATC GGATGGGACG CTTTCGTTTG GTTGGCCCTC CTCATATACC
GGTCTGGTCG AACGTTGTTT GCCGGGGAGC GAAGCATTCT TCTACCTGCT GGCGCCCTTC
CGCGCCGACC TTGATCCCGC GCGTGGCGGG CAGGTGCGGT ACGGAACCGT CAACAGCAAC
GCAACATTCG TCGTCAGTTT CGAGGATGTG CCATTGGCGC AGGGTCCGCC GGATCAGAGA
TACACGTTTC AGGCGCTTCT CCATCGTGAT GGACGGATTG TGTTTCAGTA CGCCGACCTC
AGCGCGCTCC CGGAGCGCTT GAGCGTTGGC GTTCAGAAGA CCATGAATCA GGTTCAGCGG
ATCGGATGTG GCGCCGATAC CCCTGTCACA CCAGGGCTTG CCATCGAGTT CCGACCGCAG
TTCAGCCCGG AGGGGTGGCT GGAAGTGGCG CCGGATCGGG GAACCGTAGC GCCGGGCGAC
AGCGCCACGC TCAGGCTCGC CTATCGCTGG CAGGGTCCAC CGCAGGGCGC GCGCCTGCGC
ACCACGGTCA CAGTGATCAG CAGTGATCCT CGCCGCAGAA ACGCCACAAT TATGGCGGAG
GCGGCCATGC GTCCGGCGCC GTATGCCGTC TGGTTGGGGA TCGTGGCTCG GTAA
 
Protein sequence
MSGRAASLRL FIAAFLLIAA LIVPTDRFTI QATGILTASP ITISETLPLG QQVTRPLTIT 
NLGSTTVTAL LYEARAQPLL AMAHAIGPAS VPLPQQDHTL DPRLAAQLDE PAAQGSFIIY
LRDQADLSSA YGITDWSERG RFVYRTLVEH AERTQRTLRA ELTARGLTYR PFWVVNAIQV
EGALADAQAL EQRADVALVR ADASIMVALQ TLPSSLDTRC SADGNPMCWN IRAIRADRVW
NEFGITGQGV TVASIDTGGL FSHPALRDQY RGALGNGAYD HNYNWYDPQG AFPAPNDQSG
HGTHTIGIMV GRRIGGERFG VAPGARWIAA QGCEGSFCNE SDLIAAAQWV LAPTDLHDRN
PRPDLRPLIV NNSWAGGGND PWYAGYTAAW RAAGIFPVFA AGNGMGVCRS IASPGDYADV
VAVGATDRND AIAPFSLRGP TADGRMKPDF VAPGEGGIYS THLSDGYATL RGTSMAAPHV
AGVVALLYSA NPALIGDFES TYAILRDTAR RIADEQCGVV SGGGNHVYGW GLIDAHAAVA
RARVDVPWLR LSPTTVTLNP GQNATLDVTF DSNGVAAPGT YTARIQIYAG DLTQPPATVE
VTMNVIASGT IVGGIVRDAE TGEALRATVS VSGGASTPTS NDGSYALILP SGVYTLTASA
LSYAPQQRVI TVPVSGSVDF GLLLDAPHLT LSTDHVTATL DFNTTVEQTV TITNTGTRPL
TFEASVGYAP FGVYRSDEPG GPVYQWIDLP VDAPTLELTD TTRIDNIPLG FDFPLYTLTV
TETSVTSDGT LSFGWPSSYT GLVERCLPGS EAFFYLLAPF RADLDPARGG QVRYGTVNSN
ATFVVSFEDV PLAQGPPDQR YTFQALLHRD GRIVFQYADL SALPERLSVG VQKTMNQVQR
IGCGADTPVT PGLAIEFRPQ FSPEGWLEVA PDRGTVAPGD SATLRLAYRW QGPPQGARLR
TTVTVISSDP RRRNATIMAE AAMRPAPYAV WLGIVAR