Gene Sare_1426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1426 
Symbol 
ID5704815 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1649511 
End bp1653236 
Gene Length3726 bp 
Protein Length1241 aa 
Translation table11 
GC content68% 
IMG OID641270935 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001536316 
Protein GI159037063 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.123666 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00136869 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTTCCG ATCTAGGACC CTTCGCCGCT GGCGGGTCGC GGCGGCGGCT GATCGCCGCC 
GTCGTGGCGG CCGGGATGGT CGCGGCTTCG GCCGCCGCGA CCCATCCGGC CGCCGCGACC
GCCGGCCAGC CCCCGACAGC CGCCCCCGTC GCCACGCCCG TCACCGCCCC GGAGACGCAC
ACCGTCACCC TGGTCACCGG CGACGTCGTC ACGGTCCGGA CCCTCGCCAA CGGCAAGACC
ATCACCGAGG TCGACCAGCC AGACGACGCC ACCGGCGGCT TCAGTGTCCA ACAGAGCGGC
GACGACCTGT ACGTCCTGCC CGACGAAGCG ATACCCCTCC TGACCAACGA CCAACTCGAC
CGACGACTGT TCAACGTCAC CGACCTCATC GAGATGGGCT ACGACGACGC CCACACCACC
GAACTACCCC TGATCGCCAC CTACCCCAGA ACGACAGCCC GCACCACCGC CGCACTACCC
GGCACAGCCC TGACCCACGA CCTACCCGCC ATCAACGGCC ACGCCCTCAC CGCCGACAAG
GAACAGACCC GCACCGTCTG GTCCACCATC ACCGCCAGCA TCGCCGCCGG TCAGAGTCGC
GGCTCCGGCG ACGCGCACGA CAGCGGTGTC CCCAAGCTCT GGCTGGACGG TCAGGTGCGC
ACCGCCCTGG CCGACAGCGT CCCGCAGGTC GGCGCCCCCG AGGCCTGGGA CGCCGGCTAC
GACGGCGACG GCGTCACCGT CGCCGTCCTC GACACCGGCA TCGACCCCAC CCACCCCGAC
CTCGCCGACC AGATCACCGA AAAGGTCAGC TTCGTCCCCG ACCAGGACGC CTCCGACCGG
CAGGGGCACG GCACCCACGT CGCCTCGATC ATCGCCGGGA CCGGCGCGGC CTCCGACGGC
GACAACACCG GCGTCGCACC GGGAGCCGAT CTGATCATCG GCAAAGTCCT CAACAACAAC
GGCATCGGCT ACGACTCCTG GATCATCGCC GGCATGCAGT GGGCCGCCGA ATCCGGCGCC
GACGTGGTCA ACATGAGCCT CGGCCACGCG GCGCGCACCG ATGTCCTCGA TCCGTTGACC
CTCGCGGTGG ACGCCCTCTC CGCGCAACAC GACACCCTGT TCGTCATGGC CGCCGGCAAC
AGCGGAACGA CCATCGCCAC ACCCGGCAAC GCGGAAAGCG CCCTCACCGT CGGCGCGGTG
GACAAACAGG ACCGACTCGC CGGGTTCTCC AGCGTCGGAC CACTGGCCTA CAGCGGAGCA
ATCAAGCCCG ACATCACCGC CCCCGGCGTC GCCGTCACCG CCGCCCGCTC CCAACAAAGC
TCCGGCGACG GCATGTACGT AGGCAAGACC GGCACCTCGA TGGCAGCTCC ACACGTGGCC
GGAGCGGCGG CCATCCTCGC CCAACAACAC CCCGACTGGA CCAACACCCA ACTCAAGAAC
GCCCTCATGA GCAGTGCCGA GGCGCTGAGC GACAGCTACA ACGCGTTCCA GGTGGGCACC
GGCCGGCTGG ATGTGGCGGC GGCGGTGGGT AGCACCGTAC GCGCCACCGG CTCGGCGTTC
GTCGGCTACT TCGAATGGCC GCACCAGCCC ACCGACGCCC CCGTCACGGA GCCGGTGACG
TTCACCAACA GCGGCACCAC CGCCGTCACC CTCGACCTGA CCACCACTGG CAGCGACGCG
TTCACCCTCG ACACGTCCAA GGTGACCGTG CCAGCGGGCG GGCAGGTGGA CGTCCCCGTG
ACCGCCGACC CGGGTGGGAT CACCATCGGC TCGCACACCG GCTACCTCGT CGGCACCGAC
CCGACCACCG GTGAGACCGT CACCCGCACG GCCCTCGGGC TCCTCAAGGA AGACGAACGC
TACGGCCTGA CCATCAAGGT CCGCGACCGC GACGGCCAGC CCACCGAGGC ATTCGTCGTG
GTACGGAAGG CCGGCGATTG GTTCCCCCGG TTCATCAGCA TCGACGGTGA GCGGACGCTG
CGCCTCCCAC CAGGGACCTA CACGATGGAG ACGAAGGTGG AGGTCCCGGG TGAACGAGCT
GACTCCCTCG GCCTCACCCT GCTGGCCGCC CCGGCAACCG TGCTCGACAA ACCCACCGAG
GTGATGCTGG ACGCCAGTCA GGCCCGACTC CTCGAGTTCA CCACACCCCG GCGCACGGAA
GATCGACAGC GCCTGCTCGG GTACACCGTC GACTACGGCA CCGGCCCCAC CTACTACTAC
CATTCCATCG CGCCGGCATA CGACGACCTG TACGTCCTAC CGACCGAGAA GAGCACCGAG
ACCGCGTTCG CCATGGCAGC CCAGTGGCGC AAGGGTGAGC CCGTCCTGAG CCTACGCGCC
TTCGGCCTAC TTCCCATCGA CGCCGCAGTC CAACCAGGCA GCACCATCAC CACCGGCACA
CAATGGCTAC GCCCCGTCTA CGCCGGCACC GGCACCCCCG AGGACTACGC CAACCTCAAC
GCCAGGGGCA AAATCGCCAT CGTCACCCAC AGCGACAACG TCAGCCCACC CGACCGCGCC
GCCGCCGCAG CAGCAGCCGG AGCAACCCTC CTACTCGTCG TCAACGACAA CCCTGGAATC
CTCCACGAAC ACGTCGGCGA CTCACCCATC CCCGTCGCCA GCATCCACCG CGACATCGGC
AACCTCCTCA CCAAACTCGC CGAACACGGC ATACCAAAAC TGAAAGTCAG CCAAGAGCAA
TACCCCGACA CCATCTACGA CCTGACACAA GTCTGGCGAA AGCAGGTACC AGACCAGCCA
CTCACCTACC ACCCGAGCCA CCAGGACCTG GCCCGCATCG ACGCCCGCTA CCACGCCGAC
CAGGACGCCG AAGGATCCGG CTACCGCGCT CACACGATCC TCGGCCCTGC GCTCGGTGCA
CGCGAGCCGG AGTGGCACCC GGGCATTCGC ACCGAGTGGG TCACCCCGGA CGTCCCCTGG
GTCGAAAACC ACGTGCAGCG TGGTCTGGAT TGGGGAGTCG TGGCGGACGA GCACACCTAC
GCCAAGGGCA CGACCAGCAG GGTGGACTGG TTCGCACCCG CCATCCGCCC AGCCTTCATC
CAGTCAGCCC GGCTTAAGAA CAGTCGATAC CAGGATCGCA TGACCGTCTC CGTAGGGGCC
TGGAGCCCGT CGGACACTGT GCTCGACTCC AGCGGAGGGA TTCCCTCGGC ACAGCAACAC
ATCAAGCTCT ATCAGGGTGA CACCCTGCTG CACGAGGACC CGAACTACAG CTTCCTCTCC
AACCGGGAGG TGCCGGCCGG CACGCTGCCG TATCGGCTGG TGTTGGATGG GTCGCGCTCG
GCTGACGAGT GGCGGCTGTC CACCCGCACC CACACCGAAT GGGACTTCAT CTCCAGCACC
AACCAGGCCG ACCCGTCCGA CGCCGTCCCG ATCACGCTGC TCCAACTCGA CTACGAGATG
GAAACCGATC TCCGAGGCGA CGTCGAGGCC GGCACCGACC AGGCGATCAG CGTTACCGCG
CGACCGCAAC CCGGCGGCTC CGACATTGGC ACCGGCACCA TCACCACAGT TGAACTCGAC
GTCTCCTACG ACGATGGCAC CACCTGGCAA CGAGTAACCC TCAACCAGGG CGACAACAAC
CGCTACACCG GCACCCTGAC ACTGCCGACA CAACCCGACG GCTTCATCTC CATCCGAGCC
GCCGCCGAAA CCGACACCGG ATTCGCCATC CGACAGGAAA TCACTCGCGC CTACGGCCTG
CGATGA
 
Protein sequence
MSSDLGPFAA GGSRRRLIAA VVAAGMVAAS AAATHPAAAT AGQPPTAAPV ATPVTAPETH 
TVTLVTGDVV TVRTLANGKT ITEVDQPDDA TGGFSVQQSG DDLYVLPDEA IPLLTNDQLD
RRLFNVTDLI EMGYDDAHTT ELPLIATYPR TTARTTAALP GTALTHDLPA INGHALTADK
EQTRTVWSTI TASIAAGQSR GSGDAHDSGV PKLWLDGQVR TALADSVPQV GAPEAWDAGY
DGDGVTVAVL DTGIDPTHPD LADQITEKVS FVPDQDASDR QGHGTHVASI IAGTGAASDG
DNTGVAPGAD LIIGKVLNNN GIGYDSWIIA GMQWAAESGA DVVNMSLGHA ARTDVLDPLT
LAVDALSAQH DTLFVMAAGN SGTTIATPGN AESALTVGAV DKQDRLAGFS SVGPLAYSGA
IKPDITAPGV AVTAARSQQS SGDGMYVGKT GTSMAAPHVA GAAAILAQQH PDWTNTQLKN
ALMSSAEALS DSYNAFQVGT GRLDVAAAVG STVRATGSAF VGYFEWPHQP TDAPVTEPVT
FTNSGTTAVT LDLTTTGSDA FTLDTSKVTV PAGGQVDVPV TADPGGITIG SHTGYLVGTD
PTTGETVTRT ALGLLKEDER YGLTIKVRDR DGQPTEAFVV VRKAGDWFPR FISIDGERTL
RLPPGTYTME TKVEVPGERA DSLGLTLLAA PATVLDKPTE VMLDASQARL LEFTTPRRTE
DRQRLLGYTV DYGTGPTYYY HSIAPAYDDL YVLPTEKSTE TAFAMAAQWR KGEPVLSLRA
FGLLPIDAAV QPGSTITTGT QWLRPVYAGT GTPEDYANLN ARGKIAIVTH SDNVSPPDRA
AAAAAAGATL LLVVNDNPGI LHEHVGDSPI PVASIHRDIG NLLTKLAEHG IPKLKVSQEQ
YPDTIYDLTQ VWRKQVPDQP LTYHPSHQDL ARIDARYHAD QDAEGSGYRA HTILGPALGA
REPEWHPGIR TEWVTPDVPW VENHVQRGLD WGVVADEHTY AKGTTSRVDW FAPAIRPAFI
QSARLKNSRY QDRMTVSVGA WSPSDTVLDS SGGIPSAQQH IKLYQGDTLL HEDPNYSFLS
NREVPAGTLP YRLVLDGSRS ADEWRLSTRT HTEWDFISST NQADPSDAVP ITLLQLDYEM
ETDLRGDVEA GTDQAISVTA RPQPGGSDIG TGTITTVELD VSYDDGTTWQ RVTLNQGDNN
RYTGTLTLPT QPDGFISIRA AAETDTGFAI RQEITRAYGL R