Gene Sare_0314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0314 
Symbol 
ID5703609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp348670 
End bp351087 
Gene Length2418 bp 
Protein Length805 aa 
Translation table11 
GC content75% 
IMG OID641269840 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001535235 
Protein GI159035982 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000378195 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGCCCG GACACCCCCA AGACCCAGCC GTCCCCAGCC CGACCGGTCC GGAACGGAGC 
AGCCCCGGCC CGGTGGTCCT CGCCGTCCTC ATCGGCCTGT GGATCGCCGG GCTCACCGTG
GTCACCCAGT TCGGGGGCTG GCTGATCGAC CAGTTCCTGC TCCTGACCGG CCTGGATCGG
CTGGTCTGGC TCTGGCCGGT GGCCACCGTC GGGTTGGTGG CCCTGGCCGG CACACCGACC
CTGCTGCTCG CCCTGCTGCC CCGCTCGCGC ACGGTCCGCA CCGTCGGCCA GTTCTGGTTG
GCCGCCACGT TGGCATTCGG CGCCCTGAGC CTGCCGCGGG CGCTGCCGCC GGTGCAGCAC
GAGGCGTACC TCGCGACGCT CGCCGGCGTG GCCGGGCTCG CCATGCTCCT CGCGCACCGG
CTGGTCGGCC GGCGAGTCGT GCCGTCCGAC CCGGGCGACG GGTCGAACCG GTCCCGCGCG
GCCCGCCCCA CGCCGGTGAC GTTGCTCGGG CTCGCGGGCG GACTGGCCCT GCTACTGCCG
TGGGTCTGGG TCGGCGCCCT CGGCGGCCTG TTGGAGACAC TGCTCGCCGC GCTCGCCGCG
ACGGCTGTCG GTGGGCTGGC TGCCGCGCTG CTGCCGGCCG CCTTCTGGTC CCGCTTCGCG
TCCGGCACCC CACCCCGACC GGGCCGGTTG GTGCTGGTGG GCGGTCTGGT TGCCGGCGTC
GCGCTGCTGC TACTGGGGGC GGGGGTCGGG CAGTCCGGCG CCCAGTTGCC GCTGCTGCTG
ACGCTGCCGC CGGCCGGGTT CGCGCTGGCC GCCCTGGCGG CCGCCGCCCG GCGGACGGCG
ACCGGTGGTG CGGCCGCAAC CGCCTGGCTG GTCGGCCTGA GTGTGTTCGG TCCACTCGCC
TGGACCGATC CCGAGGAAGT CGGCCTGGTG CTGGCCGGCA CCCGGGAGAC GCCCTTCTGG
GTGGCGGTCG CCGCCGGTGC CGGGTTCGCC GTCGCGGCCG TACTCGCTGT CGCGTACGGG
GTGACACTTG CCCGACCGCA AGGTGCCCGG CCCGGGCGGG CGGTGGCCGG GGTGGCGGCG
GCCACCCTGC TGGTGACGAC CGGGGTGGTG GCCGTCGGGC TGGGACAACC CGGGCTGCAC
GGCGAACGGC TCTTCGTGGT GCTGCGGGAG CAGGCCGACC TGAGCGGCGT ACCCGCCACT
ACCGGGCGGG CCGGCCGCGA CAACCGAGCC ACCGAGGTCT ACCAACGGCT GGTGGAGACG
GCCGAGCGGA CGCAGGCCGA CCTGCGCCGG GAACTGCGCC GGTTCCGGCT CGACCACACG
CCCTACTACC TGGTCAACGC GATCGAGGTC GACGGCGGAC CGCTGGTGCG GACGTGGCTG
GCTGACCGCC CCGAGGTGGA CCGGGTGCTG GTCAGTCAGC GGGTCCGCCC GCTGCCCGTA
CCGGCCCCCG TCACCCAGGG GGACGCTCCG GCACCGCGCG GCGCGGATTG GAACGTTCGG
TTGATCGGCG CCGACCGAGT CTGGTCGGAG TTGGGGGTCA CCGGCGCGGG GGTGACCGTG
GGCAGTTCCG ACTCGGGTGT AGACGGCGGG CACCCGGCGC TGGCTGCAGG CTTCCGCGGC
GGTGACGACT CGTGGTTCGA CCCCTGGAAC GCCACCTCGG CCCCGAGTGA CCGCGGTGGG
CACGGCACCC ACACGCTCGG CAGCGCCGTC GGCCGTGACG GTGTCGGGGT GGCCCCCGGG
GCAGCGTGGA CCGGCTGCGT CAACCTGGAC CGCAACCTCG GCAACCCCGC GCTCTATCTG
GACTGCCTCC AGTTCATGCT GGCGCCCTTC CCGCCGGGCG GGGACCCGCT CACCGACGGG
CGCCCACAAC GCGCGCCGGA CATCCTCACC AATTCGTGGG GCTGCCCGGA GCTCGAGGGC
TGCGACCCCG GGGCGCTTCG CCCGGCCACC GCCGCGCTGG CCGCAGCCGG CATCCTGGTC
GTGTTCGCCG CCGGCAACAC CGGGCCGCGC TGCCGCTCGA TCGAGGATTC ACCGGCCTTC
CACCCGGACG TCCTCACCGT CGGCGCGGTG GACCGACAGC GACGGGTCAC CGACTTCTCC
TCGCGCGGCC CGGCGCCGGG CGGGGGCGCC AAGCCCGACC TGGTGGCTCC CGGTGCAGAC
GTGCTCAGCG CGATGCCGGG GGGCGGGTAC GGCGTACTCA GTGGCACGTC GATGGCGACT
CCGCAGGTGG CGGGTGTGGC CGCGCTGATG TGGTCGGCCA ATCCGGCGTT GATCGGGGAC
CTGCCCCGGA CGCGCCGGAT CCTGACCGCG ACCGCCAGCC CGGTCCCGGT CAGCGGCGAG
TCGTGCGGGG ACGAGGCGCA CGTCGTGGGA GCCGGCCTGG TTGACGCGTA CGCCGCCGTC
CGTGCCGCTC AGGGCTGA
 
Protein sequence
MRPGHPQDPA VPSPTGPERS SPGPVVLAVL IGLWIAGLTV VTQFGGWLID QFLLLTGLDR 
LVWLWPVATV GLVALAGTPT LLLALLPRSR TVRTVGQFWL AATLAFGALS LPRALPPVQH
EAYLATLAGV AGLAMLLAHR LVGRRVVPSD PGDGSNRSRA ARPTPVTLLG LAGGLALLLP
WVWVGALGGL LETLLAALAA TAVGGLAAAL LPAAFWSRFA SGTPPRPGRL VLVGGLVAGV
ALLLLGAGVG QSGAQLPLLL TLPPAGFALA ALAAAARRTA TGGAAATAWL VGLSVFGPLA
WTDPEEVGLV LAGTRETPFW VAVAAGAGFA VAAVLAVAYG VTLARPQGAR PGRAVAGVAA
ATLLVTTGVV AVGLGQPGLH GERLFVVLRE QADLSGVPAT TGRAGRDNRA TEVYQRLVET
AERTQADLRR ELRRFRLDHT PYYLVNAIEV DGGPLVRTWL ADRPEVDRVL VSQRVRPLPV
PAPVTQGDAP APRGADWNVR LIGADRVWSE LGVTGAGVTV GSSDSGVDGG HPALAAGFRG
GDDSWFDPWN ATSAPSDRGG HGTHTLGSAV GRDGVGVAPG AAWTGCVNLD RNLGNPALYL
DCLQFMLAPF PPGGDPLTDG RPQRAPDILT NSWGCPELEG CDPGALRPAT AALAAAGILV
VFAAGNTGPR CRSIEDSPAF HPDVLTVGAV DRQRRVTDFS SRGPAPGGGA KPDLVAPGAD
VLSAMPGGGY GVLSGTSMAT PQVAGVAALM WSANPALIGD LPRTRRILTA TASPVPVSGE
SCGDEAHVVG AGLVDAYAAV RAAQG