Gene Sare_4661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4661 
Symbol 
ID5705718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5281539 
End bp5283227 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content69% 
IMG OID641274059 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001539405 
Protein GI159040152 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAGAC GCTTCACCGC TGGTGCCGTC GCCACCGTGA CGGGCCTGGC GCTGACCGTA 
GCTGGGCTGG GTGTCCCGGC GGGCGCCGCA CCGAGCAGCA CCCAAACCTT CACTGTGGTC
GCTGAGGACG GCGTTACCGC TGATGTGGCC CTTGCGGAGA TAGCGGCGGC CGGAGGCACC
GTCGTATCCC GGATCGACGA TGTCGGCGTG TTCCAGGTGA CCAGCGACCA GGCGGACTTC
GCCGCCCGGA CCGCCGCCGC CGGCGCCCTG GTCGGCGCCG TCGAGCAGAA GGCCATCGGT
CACAAGCCCA GGCTGGACCC GGTCGAGCAG GAGGCGCTGC TGGCCGCCGC TACCGGCAAG
GGCTCCGGCG CCCGCAAGTC CAAGCGGATG GACCCGCTGG ACGACAAGCT GTGGGGCCTG
GACATGATCA GGGCCGACCG CGCCCGCAAG GTGGAGCCTG GCGACCGGCG AGTCACCGTC
GGCGTCCTGG ACACCGGCCT CGACGCCAGC CACCCGGACA TCGCGCCGAA CTTCAACTGG
GCGTTGTCCC GTAACTTCGC GCCGGACATG CCCGAGGTGG ACGGCGAGTG CGAGGTGGCG
AGCTGCCTCG ACCCGGTCGG CACCGATGAC GGCGGCCACG GCACCCACGT GGCGGGCACC
ATCGGCGCCG CCGCAAACGG ATTCGGCCTC TCGGGCGTCG CGCCGAAGGT CTCGTTGGTG
GAGCTGAAGG GCGGCCAGGA CTCCGGCTAC TTCTTCCTGG AGCCGGTGGT CCAGTCGCTG
ATGCACGCCG GTAGGGCGGG CCTGGACGTG GTGAACATGT CCTTCTACGT CGACCCGTGG
CTCTACAACT GCACCGCCAA CCCGGCCGAC TCCCCCGAGC ACCAGGCCGA GCAGCGGGCC
ATCATCAAGG CGATGAAGCG GGCGCTGAAC TTTGCCCACA AGCGGGGCGT GACGCTGGTC
GGCTCACTCG GCAACAACCA CGAGGACCTG GGCGACCCCC GGATCGACAC GTCCAGCCCG
GACTTCGGCG ACACCCCGCC GTACCCGCGC GAGATCGACA ACGACAGCTG CTGGGACCTT
CCGGTCGAAG GCCCGCACGT CATCGGCGTC TCCGCCATCG GCCCCTCCGG CAAGAAGGCC
GCCTACTCCA ACTACGGCAC CGAGCAGATC GGCATCGCCG CTCCCGGGGG CTGGTTCCGC
GACGGTTTCG GCACCGACAC CTTCCGCACC TACGGCAACC TGATCCTCTC CACCTACCCC
GAGAAGGTGC TCAAGGAAGA CGGTCTGGTG GACGCGGACG GCAACATCGA TCCGAGCGCC
GAAGGGCTCG TGTTCAAGGA ATGCAAGAGC AACGGTGAGT GCGGCTACTA CCGCTACCTC
CAGGGCACCT CGATGGCGTC GCCGCACGCC TCGGGTGTGG CCGCGCTGAT CGTCAGCAAG
CATGGCAAGA AGCAGGGCCG GGCCGGTTAC GGCCTGGACC CGGACCTGGT CGAGCGGCAC
CTCTACCGCA CCGCCACCGA GCAGGCGTGC CCGAACCCGC GCCTGCAGCA GTACCGCGAC
GAAGGCCGCG ACGAGACCTA CGACGCGTAC TGCGCCGGTG GGCGCAACTT CAACGGCTTC
TACGGGTACG GCGTCATCGA CGCGTACGCG GCGGTAGCCA CCCCACTCAA GTCACACGGC
CGACCGTAG
 
Protein sequence
MSRRFTAGAV ATVTGLALTV AGLGVPAGAA PSSTQTFTVV AEDGVTADVA LAEIAAAGGT 
VVSRIDDVGV FQVTSDQADF AARTAAAGAL VGAVEQKAIG HKPRLDPVEQ EALLAAATGK
GSGARKSKRM DPLDDKLWGL DMIRADRARK VEPGDRRVTV GVLDTGLDAS HPDIAPNFNW
ALSRNFAPDM PEVDGECEVA SCLDPVGTDD GGHGTHVAGT IGAAANGFGL SGVAPKVSLV
ELKGGQDSGY FFLEPVVQSL MHAGRAGLDV VNMSFYVDPW LYNCTANPAD SPEHQAEQRA
IIKAMKRALN FAHKRGVTLV GSLGNNHEDL GDPRIDTSSP DFGDTPPYPR EIDNDSCWDL
PVEGPHVIGV SAIGPSGKKA AYSNYGTEQI GIAAPGGWFR DGFGTDTFRT YGNLILSTYP
EKVLKEDGLV DADGNIDPSA EGLVFKECKS NGECGYYRYL QGTSMASPHA SGVAALIVSK
HGKKQGRAGY GLDPDLVERH LYRTATEQAC PNPRLQQYRD EGRDETYDAY CAGGRNFNGF
YGYGVIDAYA AVATPLKSHG RP