Gene Sare_0938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0938 
Symbol 
ID5708049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1058778 
End bp1060016 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content73% 
IMG OID641270456 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001535844 
Protein GI159036591 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.980042 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCATC CCCGTCGGAT GGTGACGCCA CGGCGCCGAT CGGCCGGAAC CAGCGCCATC 
GTCTTCGTCT TTGCGCTCGT GCATGTGCTC TGTTTCGCCG GAACTGCGGG AGCGGATCCA
CTCGGGGACG GGCTGCCGCC GCTTCGGCAG CAGTCCGACG GTTGTGTGCC GGCCTCCGAC
GTCCGTATCC GTGACGTTCC CTGGGCCCAG CGACGCCTGA CGCCCGAGCG GGTCTGGCCG
CTGACCCGTG GGGCTGGCCA GGTGGTAGCG GTGATCGACT CCGGTGTGGC CCGGGTGCCC
CAACTCGCCG GCGGACGGCG CGACCAGGTG GAGATCATCG GCGGCCGGAC CGGCGTGGAC
GACGACTGCC CCGGGCACGG CACGTTCGTC GCTGGTCTGA TCGCAGCCCG CCCGGCGAGG
GACACGGGCT TCAGCGGAGT CGCCCCGGCG AGTACGATCC TGCCGATCCG GCAGACGCGC
AACGGACGCG ACGGCACCGC CGACGGTCTG GCCAAGGCCA TCCGGGTGGC CGCCGACCAG
GGGGCGGACA TCATCAACGT CTCCTCGGCG TCGCTGTTTC CCGACGACAC GCTGCGCCGG
GCCGTCGAGT ACGCCACCAG CAAGGACGTG CTGATCGTGG CGGCGGTCGC CAACGAGCTC
GGCAACGGCA ACGCCCACCC GTACCCCGCC GCGTACCCGC AGGTTCTCGC GGTCGGAGCG
ATCGGCTCCG ACGGTGCCGC CGCCGACTTC TCCGGCGCCG GAGAGTTCGT CGACCTGGTG
GCTCCGGGAA GCAGCATCGT CAGTGTCGGG CCGCGCGGTG GCGGTCACCT GACCGCCACC
GGCACCAGCT ACGCCGCACC GCTGGTCGCC GGCGCCGCCG CACTCGTGCG GGCCTACCAT
CCGCAGTTGA CAGCCGCGCA GGTAAAACAC CGGCTGCAGG TGACCGCCGA CCCGCCGAGC
AGTACGGTGC CCGACCCGCG ACTTGGTTGG GGTGTCGTCA ACCCGTACGC GGCGGTGACG
TCCATCCTGC CGAACGAGGC CGGTGCCACG CCGGCTGTCG CTCCGCCGGC CACTGTCTCA
GGCCCGACGT GGCCGAGCGG CGGCCTCTCG GGCCGCCGGT CGGCGTTCAT CATCACGGTG
GTTGCCACCG TGCTGGTTGC CGCGGTGGTG GTGGCCCGGG CGGTCGTGCC GCGCGGACGG
CGGCGGCGCT GGCGGCCGGC CGGATGGACG GGCCGGTGA
 
Protein sequence
MSHPRRMVTP RRRSAGTSAI VFVFALVHVL CFAGTAGADP LGDGLPPLRQ QSDGCVPASD 
VRIRDVPWAQ RRLTPERVWP LTRGAGQVVA VIDSGVARVP QLAGGRRDQV EIIGGRTGVD
DDCPGHGTFV AGLIAARPAR DTGFSGVAPA STILPIRQTR NGRDGTADGL AKAIRVAADQ
GADIINVSSA SLFPDDTLRR AVEYATSKDV LIVAAVANEL GNGNAHPYPA AYPQVLAVGA
IGSDGAAADF SGAGEFVDLV APGSSIVSVG PRGGGHLTAT GTSYAAPLVA GAAALVRAYH
PQLTAAQVKH RLQVTADPPS STVPDPRLGW GVVNPYAAVT SILPNEAGAT PAVAPPATVS
GPTWPSGGLS GRRSAFIITV VATVLVAAVV VARAVVPRGR RRRWRPAGWT GR