Gene Sare_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0044 
Symbol 
ID5707324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp51562 
End bp53118 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content68% 
IMG OID641269569 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001534971 
Protein GI159035718 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.61029 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTAT CCCGCAGGTC CGTATTCGTC GGACTGGCCA CGATGGCCAT GGTGGCGTCC 
GCAACCCCCG CCATGGCCGC CGAGCCGGTC GGCACGATCA GAAGTGCCGG CGGCGCCACC
GCCGTCGCCG ACAGCTACAT CGTGGTCTTC AAGGACAGCC GGGTCAGCCG TGGCGCTGTC
GAGCAGTCCG TCGACCGCCT GCTTGACCGG CACGGTGGCC AGATGGCCCG GATGTACACC
GCAGCACTCC GCGGAGCAGA GGTGCGGGTG GACGCCAGCG CCGCCGCCCG AATCGCGGCC
GACCCGGCCG TAGCCTACGT CGAGCAGAAC CACACCGTCT CGATCGCCGG TACCCAGCCC
AACCCCCCGT CCTGGGGCCT GGACCGAGTC GACCAGCGAA ACCTGCCGCT GGACAGTTCC
TACACGTACC CGAACACCGC CAGTGACGTG ACCGCCTACA TCATCGACAC CGGAATCCGC
ACCACTCACA CGGACTTCGG TGGTCGGGCC ACGTGGGGCA CCAACACGGC CGACAACAAC
GACACCGACT GCAACGGGCA CGGCACGCAC GTCGCCGGCA CCGTCGGTGG CTCGGCGTAC
GGCATCGCCA AGGAAGCCAA ACTGGTCGCG GTCAAGGTGC TGAACTGCGC CGGCAGCGGC
AGCTACGCCG GGGTCATCGC CGGCGTCGAC TGGGTCACCG CGAACGCGGA CAAGCCGGCC
GTGGCGAACA TGAGCCTCGG TGGCGGTGCG AACAGCTCGG TGGACAACGC GGTGACCAAC
TCGATCAACT CCGGTGTCAC CTACGCGCTG GCGGCGGGGA ACAGCAACGC CAACGCCTGT
AACTACTCGC CGGCCCGTAC CCCTGCGGCG ATCACCGTCG GGTCGACGAC CAGCACCGAC
GGACGGTCCT GGTTCTCCAA CTACGGCACC TGCCTGGACC TCTTCGCACC GGGCTCGTCG
ATCACCGCGC CGTGGAACGA CAGCGACAAC GGCACGAACA CGATCAGCGG CACGTCGATG
GCCTCGCCGC ACGCCGCGGG TGCCGCGGCG CTGGTCCTCT CGGCCAACCC GTCGTACACC
CCGCAACAGA TTCGGGACGC TCTGGTCGAC AACGCCACGG ACAACGTGGT GGGCGGCCCG
GGCAGTGGCT CGCCGAACAA GCTCCTCTAC ATCGGTGACG GCGGCACCCC GCCGCCCCCG
CCCCCGCCCG GCTGCACCGG CACCAACGAC ACCGACGTAG CGATCCCGGA CGCCGGTGCC
GCGGTGACCA GCTCGATCAC CATCACCGAC TGTGACGGAA ACGCCTCGGC GGCCTCGACC
GTGGCAGTGG ACATCCCCCA CACCTGGCGT GGTGACCTCG TGATCGACCT GATCGCGCCG
GACGGCTCGT CCTACCGGCT CAAGACCAAC AACCTGTCCG ACTCCGCCGA CAACGTCAAC
GAGACCTACA CGGTGAACCT CTCCAGCGAG GTAGCGGACG GCACCTGGAA ACTCCAGGTC
CAGGACGTCT ACCGCGCGGA CACCGGCTAC ATCAACACCT GGACCCTGAC GGTCTGA
 
Protein sequence
MGLSRRSVFV GLATMAMVAS ATPAMAAEPV GTIRSAGGAT AVADSYIVVF KDSRVSRGAV 
EQSVDRLLDR HGGQMARMYT AALRGAEVRV DASAAARIAA DPAVAYVEQN HTVSIAGTQP
NPPSWGLDRV DQRNLPLDSS YTYPNTASDV TAYIIDTGIR TTHTDFGGRA TWGTNTADNN
DTDCNGHGTH VAGTVGGSAY GIAKEAKLVA VKVLNCAGSG SYAGVIAGVD WVTANADKPA
VANMSLGGGA NSSVDNAVTN SINSGVTYAL AAGNSNANAC NYSPARTPAA ITVGSTTSTD
GRSWFSNYGT CLDLFAPGSS ITAPWNDSDN GTNTISGTSM ASPHAAGAAA LVLSANPSYT
PQQIRDALVD NATDNVVGGP GSGSPNKLLY IGDGGTPPPP PPPGCTGTND TDVAIPDAGA
AVTSSITITD CDGNASAAST VAVDIPHTWR GDLVIDLIAP DGSSYRLKTN NLSDSADNVN
ETYTVNLSSE VADGTWKLQV QDVYRADTGY INTWTLTV