Gene Rcas_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3153 
Symbol 
ID5540651 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4091527 
End bp4094799 
Gene Length3273 bp 
Protein Length1090 aa 
Translation table11 
GC content59% 
IMG OID640895274 
Productpeptidase S41 
Protein accessionYP_001433225 
Protein GI156743096 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0793] Periplasmic protease
[COG4946] Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACG ATCGGTATGT TATCCGTCCC TACTTTCGCG CGCCCTCGAT TGATCCCGCC 
GGCAGTCGTA TCGCATTCGT CTATGCAGGC GACATCTGGC TTGTTGATGT CAGCGGCGGG
CAAGCCGAGC GACTGACCGC TCATCCTGCA AATCACATGC TGCCGCGCTG GTCGCCGGAT
GGTTCAGCCA TCGCCTACAC CTCATCTCGC ACCGGACAGG GCGACGTGTA TGTGCTGCCG
CTCAACGGCG GCGAGGTGCG ACGGATCACA TATCACGAGG CGCAGAGCGC CGTCGAGTGC
TGGTCGCCTG ACGGCGCTGC CATCTATTTC ACCTCTCAAC GCGAGCGCCA GGGCGCCGCG
ATCTATCGCA TCGCCGCATC AGGCGGTACG CCGGTTCGCT GGATTGCACA ACCATACGAG
CGCCTCGGCA CAGTCGCGGT TTCACCCTGT GGTCGCTGGC TGGCATTCAG CCTGATGCGC
GATCTGTGGT GGCGACGGGG ACCGAATCCG TATGGCGGCG CAGAACTCTG GCTGGTATCG
AATGCGCCTG ACGCCGATGA TTTCTTTCAA TTGAGCGACG CGCCCGGTAT GAATTACTGT
CCCATGTGGT CGCCCGATAG TCAGTTGATC TTCTTTGTTT CTGATCGCGA TGGAAGCGAA
AATCTGTGGG TGGCGCCGCG AGAGGGAGGC GTCGCAGAGC GCATCACGGC ATTCACCGAC
GGGCGCCTGC TCTGGCCCTC GATCAGCGCC GATGGCAGAA TGATCGCATT CGAGCGTGAT
TTTGGCATCT GGACGCTCGA TCTCGCTTCC GGTGCAACCA CGCGCGTCCC GATTCAGGTG
CGCCAGGATA CCAAAGTGAC GCCGGTGCGG GTGCAGACCT ACACGCGCGA TGCAGGTGAA
CTGGCGCTCT CGCCCGATGG AAAGAAAGTC GCCTTTACGG TGCGCGGGAA GATTTTCGCC
GATTTTGCCG AAAAGGAAAC GGAACGCGAT CAGCGCCACG GGCCGGCCTA TCGCATCTCG
CACACGGCAT TCCGCGATAG CGATATTGCC TGGTCGCCCG ACAGCCGGCG CCTGGTCTAT
GTTTCTGATC GTCACGGCGA CGAAGAAGTG TACTGCTACG ATTTCCTCGC GCGCAGCGAA
ACCCGCCTGA CGTTTGGAGC GAAACGTAAG AGCGCGCCTT GTTTCTCGCC TGATGGGCGC
TGGATTGCAT ATGCAAGCGG TGATGACGAA ATCCGGCTGA TCGAGGTCTC AAGCGGAGAG
GATCGCCCAT TTATTCGGGC ATACTTTGTC TTTGGCGCAT CGTTCGCCTG GTCTCCCGAT
AGTCGCTGGC TGGCATTCTG CGCTCAGGAC GAACGGTTTT TCAGCAATGT GTACGTGCAA
CATATCGAGG AAGAACAGGC GCATCAGATT TCGTTCCTCA GCAATCTCCA GGCGGCAGGA
CCGCTGTGGT CGCCTGATGG ACGCTTTCTT GTCTTCACGA CCGGACAGTA TCGCGCCGAG
TCGCAGATCG CGCGCATCGA TCTGCAACCT CAACCACCGG TGTTCCGTGA GACGGAGTTC
GAGCGGTTGT TTGAATTGCA CAAAAGTGAT TGTCGGGACA AACAGACGGC TGGACCGCCA
TCCACAGTCG GACCGCCCCA TACGCGCAGT GCAGACGCCG ATCGTGAGCC ACAACCGATG
GTTGTTCCTG TCATCGAACC GGATGATGAG TCCGACCGCG CTGCTCGTCA GCATAACGCG
CCATCCAGGT TATCCAGATC CACGTCACCG AGCGAGATAA CAATTGTGTT CAACGGTATC
GAACGCCGGT TGCGCCTGTT GACGCCACCG CAAATGGATG CCATTGCATT GTGTATCAGC
CCGGACGGAC GCGATCTCGT GTGCAGCGCC ACCATTGCGG GGCGGCAGAA TCTCTGGACA
TTGCCGCTTG ATGAGCCGCG CGCCGGTCAG GGACCGCGCC AGTTGACCAG CACTCCGGGC
ACGAAATCGT GTGCCTGCTT CACACCTGAT GGGAAACAGA TCTATTTCCT TGATGGTGGA
ACCATCGCGC TGCGAAAGTT TCCGAAGGGA GAACAGACGA CGTTGCCCGT ATCGGCTGAG
GTGGTTGTCG ATTTCACACA GGAAAAGCGG CAGATATTCG AAGAGTGCTG GCGGTTGCTG
CGCGATTGCT TTTACGATGA GCGCTTTCGC GGCATCGACT GGAAAACGAT ACACGACCGG
TATGCGCCAC TGATCCAGGG GGCGCAGACC CCTCCTGAAG TGTATCTCTT GATCAACCTG
ATGGTCGGAG AACTCCGTTC TTCACACGTT GGATTGTTCA GAAGCGACGG GAATGGCGGT
CAGGACGGCT ACCTCGGCAT CACGCTCGAT ACCATCGAAT ATCTGCGCAG CGGTCAATTC
CGCGTTGCCG GGGTTATTCC CGACGGCCCG GCGGCAGTCG CCCACAATGG TGCGCTTCAA
CCCGGTGATG TGTTGCTGGC AGTCAATGGC GTCGCATTGA AACCGGAAAC ATCGCTCGAT
GCGTTGTTGC AACGCACGGC GGGCCAGCGG GTTATTCTGC GTGTGGTCAA TCCTGCCGGC
GAGCAGCGCG ATGTCGAGGT GCGCCCCATC ACGGCGGAAC AGTATGACTG GTTGCGATAC
CGGGCATGGG TGCTTGAAAA TGAACAGATC GTTCATCGGG CGAGTAACGG GCGCATCGGC
TATGTGCACA TTCGTAAGAT GAGTTACGAT GCATACCAGC AGTTCCTGAC CGATCTCGAT
GTCGAAATGC ATAACAAAGA AGGCTTGGTC GTCGACATCC GCTTTAACTC AGGCGGGCAT
ACGGCAACGT TTATTCTCGA TGTTCTGATG CGTCGCAGCG TTCTGTTCAG CGCCTTTCGC
AATCGCTCTG TCGCCGATTC GTCGCATATC TTTGGCAATC GTGTGCTGAA CAAGCCGACG
GTGCTGGTGA CGAACGAGGC ATCGAGTTCT AATGCAGAAA CCTTCAGCGA ATCGTACCGT
CGCCAGGGTC TGGGGAAGGT GGTCGGAAAA CCAACGGCTG GCGCGGTTAT CGGCACGTTC
ACTCGACTCC TGATTGATGG GACATCGCTC CGGTTGCCGC AACTGCGGGT CACGACACCT
GAAGGCGAAG AGCTGGAAGG GCGCGGTCGA CCGGTAGACG TCGATGTGCC GCTGCGACTG
GGGGAGTGGC GCTATGGGCG TGATGCTCAA CTCGAGGCGG CAGTGCGCGT GTTGCTCGCC
GATCTCGATG GGCCGGACCA TCATATCGAA TAG
 
Protein sequence
MTDDRYVIRP YFRAPSIDPA GSRIAFVYAG DIWLVDVSGG QAERLTAHPA NHMLPRWSPD 
GSAIAYTSSR TGQGDVYVLP LNGGEVRRIT YHEAQSAVEC WSPDGAAIYF TSQRERQGAA
IYRIAASGGT PVRWIAQPYE RLGTVAVSPC GRWLAFSLMR DLWWRRGPNP YGGAELWLVS
NAPDADDFFQ LSDAPGMNYC PMWSPDSQLI FFVSDRDGSE NLWVAPREGG VAERITAFTD
GRLLWPSISA DGRMIAFERD FGIWTLDLAS GATTRVPIQV RQDTKVTPVR VQTYTRDAGE
LALSPDGKKV AFTVRGKIFA DFAEKETERD QRHGPAYRIS HTAFRDSDIA WSPDSRRLVY
VSDRHGDEEV YCYDFLARSE TRLTFGAKRK SAPCFSPDGR WIAYASGDDE IRLIEVSSGE
DRPFIRAYFV FGASFAWSPD SRWLAFCAQD ERFFSNVYVQ HIEEEQAHQI SFLSNLQAAG
PLWSPDGRFL VFTTGQYRAE SQIARIDLQP QPPVFRETEF ERLFELHKSD CRDKQTAGPP
STVGPPHTRS ADADREPQPM VVPVIEPDDE SDRAARQHNA PSRLSRSTSP SEITIVFNGI
ERRLRLLTPP QMDAIALCIS PDGRDLVCSA TIAGRQNLWT LPLDEPRAGQ GPRQLTSTPG
TKSCACFTPD GKQIYFLDGG TIALRKFPKG EQTTLPVSAE VVVDFTQEKR QIFEECWRLL
RDCFYDERFR GIDWKTIHDR YAPLIQGAQT PPEVYLLINL MVGELRSSHV GLFRSDGNGG
QDGYLGITLD TIEYLRSGQF RVAGVIPDGP AAVAHNGALQ PGDVLLAVNG VALKPETSLD
ALLQRTAGQR VILRVVNPAG EQRDVEVRPI TAEQYDWLRY RAWVLENEQI VHRASNGRIG
YVHIRKMSYD AYQQFLTDLD VEMHNKEGLV VDIRFNSGGH TATFILDVLM RRSVLFSAFR
NRSVADSSHI FGNRVLNKPT VLVTNEASSS NAETFSESYR RQGLGKVVGK PTAGAVIGTF
TRLLIDGTSL RLPQLRVTTP EGEELEGRGR PVDVDVPLRL GEWRYGRDAQ LEAAVRVLLA
DLDGPDHHIE