Gene Sare_4136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4136 
Symbol 
ID5705581 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4699833 
End bp4703165 
Gene Length3333 bp 
Protein Length1110 aa 
Translation table11 
GC content72% 
IMG OID641273564 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001538917 
Protein GI159039664 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0500799 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGCGC TGGCGTTGAC CACGCCGACC GCCGCTCCCG CCGCGCCAAC CGCCGTTCCC 
GCCGCGCCGA CCGCCGCGTC GGGTGCCACG GCAGGCGCCC CGAATGTCGA GTCCGGCAAG
TTGCACACCG TTACCCTGAT CACCGGGGAC CGAGTCACCG TGACCGCCGC CGGCAATGCC
ACGGTGCGTC CCGGCCCGGA CCGCAAGGAC ATGCGCTTCC TGATGGACCA CGAGCGCGGC
CAGCTCTCCG TCGTGCCACA GGACGCGGTC GCGCTGATCC AGGCCGGTCG GGTCGACCGT
CGGCTCTTCG ACATCACCGG GCTGATCGCC GCCGGCTACG ACGATGCCCG TCGGGACACA
CTGCCGTTAC TCGTGTCGTA CCTGGACGCC CCGGGCGGCC GCGGAGCGGT CGCGCCGGCT
GGGATGCGGG TGACCCGCGA CCTACCGGCG ATCAACGGCG CCGCGGTGAC CACCGGCAAG
TCCGACGCCG CTGCGGTCTG GTCCGCCCTC AACCAAGGCG GGGCCGACGC CCGGTTCGGC
ACGCAGGAGG GCGTCGAACG GCTCTGGCTC GACGGCCGTC GCACCATCAC CCTCGACCAC
AGCGCCAGCC AGATCGGGGC TCCCGCCGCC TGGTCGACGG GCCTCACCGG AACGGGGGTG
ACCGTGGCGG TGCTGGACAC CGGCATCGAC GCCACCCACC CCGACGTGAT CGGCAAGATC
GCCGAGGCGC GCAACTTCAC CGACACGGAT GCCCACGACA CCGTCGGGCA CGGAACCCAC
GTCGCCTCGA CCATCGCCGG CAGCGGTGCC GCGTCCGACG GTCGGTACAA GGGCGTGGCG
CCCGACGCGA CGCTGCTCGA TGGCAAGGTC TGCGGGGAGG TCGGCTGCGC CGAGTCGGCG
ATCCTGGCCG GCATGCAGTG GGCCGCGGTC GACAAACGCG CTGACGTCAT CAACATGAGT
CTGGGTGGGC ACAACGGCCC AGAGATCGAC CCGATGGAGG AGGCGGTCGG AGCGCTGACC
GCGCAGACCG GCGCGCTCTT CGTGATCTCG GCCGGCAACA CCGGCCGGGA CGGCACGGTG
GGCTCTCCGG CCAGCGCGGA CGCCGCCCTG GCCGTCGGCG CGGTCGACCG GGAGGACGAA
CTCGCCGTCT TCAGCAGCCG GGGGCCGCGG GCCGGTGACG ACGCGCTGAA GCCCGACATC
ACCGCCCCCG GGGTCGACAT CGTCGCCGCT CGGTCGGCCC ACAGCCGCAT CGGTGAACCG
GTAGGGGAGC GCTACGCCCG GCTTTCCGGT ACCTCCATGG CCGCCCCGCA CGTGGCCGGT
GCGGCGGCGT TGCTCGCCCA GCAGCATCCG GACTGGGCGG CGGAGCAGCT CAAGTCGACC
CTGATGGCGG CCGCCCGGCC GCATCCGGGG CAGACCGCGT ACCAGCAGGG GGCCGGGCGG
GTCGACCTGA CCCGGGCGAT CGAGCAGACG GTGACCAGCG ACCCGGTCAG CGTCTCCTTC
GGCCGGGCGC GCTGGCCACA CGACGACGAC AAGCCGGTCA CCCGTACCGT CTCCTGGCGC
AACTCGGGGT CCAGCCCGGT CCCGCTCGAC CTCACCGTCG AGGCGGCCGG CCCCGGCGGC
AGGCCGGCCC CCGCCGGCAT GTTCACCCTC GGCACGGACC AGGTCACCAT CCCGGCCGGC
GGCCGGGTCG AGACCACCGT CACCGTGGAC ACCCGGCTGG GCGACGCCGA CGGCTACTGG
ACCGGTCGCG TGCTGGCTCG CTCCGGCGAC ACCGTGGCGG TCACCCCGCT GGCGGTGAAC
CGTGAGGTGG AGAGCTACGA CGTCACCCTG ACCCACCGAG ATCGGGCGGG AGCCGCCGCG
GCGGAATACT GGACCAGCCT GATCGGGCTG GACTCGTTCG GTCTTCGGTA CGCCTACGAC
GCCGACGGGA CGGTGGACGT GCGGCTTCCG AAGGGCCGGT ACGGGCTGAA GTCGACGATC
TTCGAGCCGG CCGGGGAGGG GCCCGGCGGC GCGACCGATC TGGTCGCGCC CGAGTTGGTG
ATCGACCGCG AACGAGTCAT CTCCCTGGAC GCGCGGACCG CGAAGCCGGT CCGGGTAACC
GTCCCCCGGC GGGATGCGAC ACCAGCTGTG GTCGATATCG GCTCGCACTT CTACAGTGCC
GACGGCTTCG CCTACAGCCA CCTTCTGCTG GCGAACGACT TCGACGACAT CGCCATCGGT
CAGATCGGGA ACGCCAGCGT GTCCGACGAG ACATTCGTCG CCACCATCAA TAGTCAGTGG
GCAGACCTGG AGGCGGCACA CAGCCCTTAC CTGTACGCGC TCAGTGAGAC GATCCCGGGA
CGGATGCCCA CTGGATTCGC CCGGCACTAC CGCACGAGCG ACCTCGCCAC CGTGAAGCAC
CGGTTCCGCG GTGGCTACGA GGGGATGGCC GCGGAGCGGT TGGTCCTGCC CGTGCCGGAG
TATGACACCT TTGGGACATT TGTCAGGCTG CCCACCACTG TGCCCGGCCA GCGGGTCGAG
TACTACAACA CCAAGGGGGT GCGCTGGGTG TCCGTGGTCA ACTTTGTGGA ACCGGCGGCG
GCGTGGCGGG AGCCGACAGC CTGGCTCTCG TCCGAAGAGA CAGCGTACCG GGCCGGTCGG
ACCACCCAGG AGACCTGGAA CCAGGCACCG CACGGGCCGT CTTTCCCCGT GCAGACCGTG
AGCCACCCGC TCGACGTCAT TCACGTGAGT CGGTGGGGCG ACACCATTCA CGCCGGTATC
CCGGTCTTCA GCGACGCCGC CGGCCACCGC GGCAACTCTC TGGAAGAGAC CGAGCGGATG
CGGCTGTGGC GCGACGGGAA GCTGGTCGGC GAGTCGGAGC AGGCCCGCCT GGGCGAGTTC
CCGGTGCCAC CGGCCGAGGC GGACTACCGG CTCACCACCG AGGCGACCCG CAGCTTCACC
GACCTCAGCA CCGAGGTGGA GTCCACCTGG ACCTTCCGCT CGAAGCACGA GGCCGGTGCG
GAGCCCGTCC GGCTACCGCT GTCGTCGATC CGGTTCATCC CACCGCTGGC CGCCGACAAC
AGCGCCCGCA CGGGGCGGCT GCTGCCGATC CCGGTCGAGG TGCGGCGCCA GTCCGGTGCG
GGGACCGCGA CCGTCGCGAA GCTGACCGTT GACGCCTCGT ACGACGGCGG GACGACGTGG
CGGAAGGTGC CCGTGCGGCG TGCGGGCGAC GGCTGGACCG CCCTGGTCCG GCACCCCGAG
GCTCCGGGGT ACGTGTCGCT GCGGGCCCAC GCGCGGGACG CCGACGGCAA CACGGTGAGC
ACGCGAATCC TTCAGGCGTA CCGCCTGAAG TGA
 
Protein sequence
MLALALTTPT AAPAAPTAVP AAPTAASGAT AGAPNVESGK LHTVTLITGD RVTVTAAGNA 
TVRPGPDRKD MRFLMDHERG QLSVVPQDAV ALIQAGRVDR RLFDITGLIA AGYDDARRDT
LPLLVSYLDA PGGRGAVAPA GMRVTRDLPA INGAAVTTGK SDAAAVWSAL NQGGADARFG
TQEGVERLWL DGRRTITLDH SASQIGAPAA WSTGLTGTGV TVAVLDTGID ATHPDVIGKI
AEARNFTDTD AHDTVGHGTH VASTIAGSGA ASDGRYKGVA PDATLLDGKV CGEVGCAESA
ILAGMQWAAV DKRADVINMS LGGHNGPEID PMEEAVGALT AQTGALFVIS AGNTGRDGTV
GSPASADAAL AVGAVDREDE LAVFSSRGPR AGDDALKPDI TAPGVDIVAA RSAHSRIGEP
VGERYARLSG TSMAAPHVAG AAALLAQQHP DWAAEQLKST LMAAARPHPG QTAYQQGAGR
VDLTRAIEQT VTSDPVSVSF GRARWPHDDD KPVTRTVSWR NSGSSPVPLD LTVEAAGPGG
RPAPAGMFTL GTDQVTIPAG GRVETTVTVD TRLGDADGYW TGRVLARSGD TVAVTPLAVN
REVESYDVTL THRDRAGAAA AEYWTSLIGL DSFGLRYAYD ADGTVDVRLP KGRYGLKSTI
FEPAGEGPGG ATDLVAPELV IDRERVISLD ARTAKPVRVT VPRRDATPAV VDIGSHFYSA
DGFAYSHLLL ANDFDDIAIG QIGNASVSDE TFVATINSQW ADLEAAHSPY LYALSETIPG
RMPTGFARHY RTSDLATVKH RFRGGYEGMA AERLVLPVPE YDTFGTFVRL PTTVPGQRVE
YYNTKGVRWV SVVNFVEPAA AWREPTAWLS SEETAYRAGR TTQETWNQAP HGPSFPVQTV
SHPLDVIHVS RWGDTIHAGI PVFSDAAGHR GNSLEETERM RLWRDGKLVG ESEQARLGEF
PVPPAEADYR LTTEATRSFT DLSTEVESTW TFRSKHEAGA EPVRLPLSSI RFIPPLAADN
SARTGRLLPI PVEVRRQSGA GTATVAKLTV DASYDGGTTW RKVPVRRAGD GWTALVRHPE
APGYVSLRAH ARDADGNTVS TRILQAYRLK