Gene Sare_4440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4440 
Symbol 
ID5705918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5017358 
End bp5018554 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content76% 
IMG OID641273856 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001539205 
Protein GI159039952 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.223031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00412922 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTACGAA GCCGGTCCCG CCCCCTCCTC GCCGCTGTTT CGGCGGCGAT GCTGACCGCC 
GTGGTGTCGT CGTTGACCGC CGCCCCCGCA GGCGCGACAC CGCGGTGCGC GTCCCCCCTC
GCCCCGGCTC CCCCGATCGC CACCATGCCG TGGCCGCAGC AACGGTACGC GTCGGAACAG
CTACTGTCCC TCGCCACCGG GGCAGGGGTG ACCATCGCGG TGGTCGACTC CGGGGTGGAC
CGCTCGCACC CACAACTCGC CGGTCGAGTT CTCCCCGGCG CGGACCTGCT CGATCCCGGC
GGAGACGGCA CCCGGGACTG CGCCGGACAC GGCACGGGCG TGGCGAGCAT CATCGTGGCC
GCCCGCCACG ATGGGGTGGC CTTCCACGGC CTCGCGCCGC AGGCCCGGAT CCTGCCGGTA
CGCGTCAGCG AACAGCAGGT GGTCGACGGG CGACAGTCCG GACGCACGGT GGGCGTGGCC
GACTTCGCCG CGGCGATCCG ATGGGCGGTC GACAACGGCG CCGAGGTGCT CAACCTCTCC
GTCGTGCTGC ACGTCGACGC CCCGGCCGTG CGGGCGGCCA TCGCCCACGC GCTGGCCCGG
GACGTGGTCG TGGTGGCAGC CGCCGGCAAC CTCCACGACC AGGGCGACCC CCGCTCCTAC
CCGGCCGCGT ACGACGGGGT GCTGGGGGTG GGCGCGATCG GTGCGGACGG GGTGCGCGCC
TCCTTCTCCC AGGACGGTCC GGAGGTGGAT CTGGTAGCGC CCGGTGCCGA CGTGGTGACC
GCCGCGCCCG GCCAGGGGCA CCACCGGGCC GAGGGCACCA GCTACGCGGC GCCCTTCGTG
GCGGCCACCG CCGCCCTGCT GCGCGGGCAC CGGCCGGAGC TGACGGCGGA GCAGGTGGTA
CGACGAATCC TGGTCAGCAC CGATCCCGCC CCCGGAGGCG GATACGGCGC GGGCGTGCTG
AACCCGTATC GGGCGCTCAC CGAGAGCGGG GGTGCGGCGG CGCCGGCCCG ACCGGCCACC
GCGCTGCTCG ACGACCGGGC CGACCCGGAC CGGATCGCCG AACAGGCCCG TCGGGCGGCG
GCCCAGGATA GGGCGCTGGT GGTGGCCCTG GTGGGCGGGG CGTTGGTCAC GGTGGCGGTG
CTGCTCGCCC TCGTGCTGCC GCGCGGCATC CGCCGTCGCT GGCGGCCTCC GGCGTGA
 
Protein sequence
MLRSRSRPLL AAVSAAMLTA VVSSLTAAPA GATPRCASPL APAPPIATMP WPQQRYASEQ 
LLSLATGAGV TIAVVDSGVD RSHPQLAGRV LPGADLLDPG GDGTRDCAGH GTGVASIIVA
ARHDGVAFHG LAPQARILPV RVSEQQVVDG RQSGRTVGVA DFAAAIRWAV DNGAEVLNLS
VVLHVDAPAV RAAIAHALAR DVVVVAAAGN LHDQGDPRSY PAAYDGVLGV GAIGADGVRA
SFSQDGPEVD LVAPGADVVT AAPGQGHHRA EGTSYAAPFV AATAALLRGH RPELTAEQVV
RRILVSTDPA PGGGYGAGVL NPYRALTESG GAAAPARPAT ALLDDRADPD RIAEQARRAA
AQDRALVVAL VGGALVTVAV LLALVLPRGI RRRWRPPA