Gene Strop_3754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3754 
Symbol 
ID5060232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4302612 
End bp4306073 
Gene Length3462 bp 
Protein Length1153 aa 
Translation table11 
GC content70% 
IMG OID640476012 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001160563 
Protein GI145596266 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.338541 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACGTT ATCTGCCGGC TGCCGGAGCG TGCGCTGGGA ATGACATTCT GCCCTTCGTC 
AATGCGACTC CCGTAGATGT GACCCTCGAC GAGGAGGCCC CGTTGCACCG GAGAAGAACA
ACGGCCGTCA TCGGCCTCGT GCTCACGCTG GCGTTGACCG CGCCGACCGC CGCTTCCGCC
GCGCCGACCG CCGCTTCCGC CGCCACGGCG GGAGCCCCGA AGGTCGAGCC CAACAGGCTA
CACACCGTTA CCTTGATCAC CGGGGACCGA GTCACGGTGA CCGCCGCCGG CAACACCGAG
GTGCGTCCCG GCCCGGACCG GAAGGACATG CGCTTCCTGA TCGACCACGA GCGCGGCGGC
CAGCTCTCCG TCGTGCCACA GGACGCGGTC GCGCTGATCC AGGCCGGTCG GGTCGACCGT
CGGCTCTTCG ACATCACCGG GTTGATCGAC GCCGGCTACG ACGATGCCCG TCGGGACACG
CTGCCGTTGC TCGTGTCGTA CTCGGGGGAA CCGGGCAGCC GTGGTGCGGG CGTGTCGGCC
GGCGTGCGGG TGACCCGCGA CCTGCCAGCG ATCAACGGTG CCGCGGTGAC CGCCGGCAAG
TCTGACGTCG CCGCGGTCTG GTCCGCCCTC AACGTGGGCG CGACCGACGT CCGGTTCGGC
GCGGAGGAGG GCGTCGAGCG GCTCTGGCTC GACGGCCGTC GCACGATCAC CCTTGACCAC
AGCGTCAGCC AGATCGGGGC TCCCACCGCC TGGTCGGCGG GCCTCACCGG AACGGGGGTG
ACCGTGGCGG TACTGGACAC CGGCGTCGAC GCCACCCACC CCGACCTGAT CGGCAAGATC
GCCGAGGCGC GCAACTTCAC CGAGACGCCC GATGCCCACG ACACCGTCGG GCACGGAACC
CACGTCGCCT CGACCATCGC CGGCAGCGGT GTCGCGTCCG ACGGTCGGTA CCAGGGTGTG
GCTCCCGACG CGACGCTGCT CGATGGCAAG GTCTGCCAGG ACGTCGGCTG TCCCGAGTCG
GCGATCCTGG CCGGCATGCA GTGGGCCGCG GTCGACAAGC GAGCTGATGT GGTGAACATG
AGTCTGGGCG GGTGGGACAG CCCAGAGATC GACCCGCTGG AGGAGGCGGT CGGAGCGTTG
ACCGCGCAGA CCGGCGCGCT CTTCGTGGTC TCGGCCGGCA ATGCCGGCGG AGACGGTACG
GTGGGCTCTC CGGCCAGCGC GGACGCTGCC CTGGCTGTCG GCGCCGTCGA CCGGGAGGAC
GAACTCGCCG ACTTCAGCAG TCGGGGGCCA CGCGCAGGTG ACGACGCGCT GAAGCCCGAC
ATCACCGCCC CCGGGGTCGA CATCGTTGCC GCTCGGTCGG CCCACGGTCG CATCGGTGAA
CCGGTAGGAG AGCACTACGC CCGGCTCTCC GGTACCTCAA TGGCCGCCCC GCACGTGGCC
GGTGCGGCGG CACTGCTCGC CCAACAGCAT CCGGACTGGA CGGCGGAGCA GCTCAAGTCG
ACTCTGATGG CGGCCGCCCG GCCGCATCCG GCGCAGACCG CGTACCAGCA GGGAGCCGGG
CGGGTCGACC TGACCCGGGC GATCGGGCAG ACGGTGACCA GCGATCCGGT GAGCGTCTCC
TTCGGCCTGG TTCGCTGGCC ACACGACGAC GACCAGCCGG TCACCCGTGC CGTCTCCTGG
CGCAACTCGG GATCCAGCCT GGTCGCGCTC GACCTCACGG TCGAGGCGGC CGGCCCTGGC
GGCCGGGCCG CGCCCGTCGG CATGTTCATC CTCGGCACGG ACCGGGTCAC CATCCCCGCC
GGCGGGCGGG CCGAGACCAC CGTCACCGTG GACACCCGGC TGGGTGACGT CGACGGCTAC
TGGACCGGTC GCGTGGTGGC CCGCTCCGGT GACATCGTGT CGGTCACCCC GTTGGCGGTG
AACCGTGAGG TGGAGAGCTA CGACCTCACC CTGACCCACC GAGATCGGGC GGGAGCGGCG
ACGGCGGAAT ACTGGACCAA CCTGGTCGGG CTGGATTCGC TCGGTCTTTG GTCCGCCTAC
GACGCCGACG GGACGGTGGA GGTGCGGCTT CCGAAGGGCC GGTACGGGCT GAAGTCGACG
ATCTTCGAGC CGGCCGGGGA GGGGCCTGTC GGCGTGACCG ATCTGGTTGC GCCGGAGTTG
GTGATCGACC GTGAACGAGA TATCACCGTG GACGCGCGGA CCGCGAAGCC GATCCGGGTA
ACCGTCCCCC GACGGGATGC GACCCCGGCG GTGGTCGGCA TCGGCTCGTC CTTCTACAGC
GCCGACGGCG ACTCCTACAA CCTCTTCCTG CGGGCGGACG ACTTCGACGA CATCACCATC
GGTCAGATCG GAAACGGCAG CTTCTCCGAC GAGATATTCA TCGCCACCAT CAGTAGTCAG
TGGGCCGACC TGGAGGCGGC ACACAGCCCC TACCTGTACG CGCTCAGTGA GACGATCCCC
GGACGGGCGC CCACCGGCTT CGTCCGGGAG TACCACAAGA GGGATCTCGC CACCGTGAAG
CACCGGTTCC ACGGTGGCTA CGAGGGGATG GCCGCGGAGC GGTACGTCCT GGCCACACTG
GAGCAGCCCA TTTTTGGAGC ATTCGTCAGG CTGCCCACCA CGGTGCCCGG CCAACGAGTC
GAGTACTACA ACACCAGGGG GGTCCGCTGG AGCGGCGTGA TCAATTTTGC TGCACCGGAC
CTGACGCTGG CGTGGCAGGA GCCGACGGCC TGGCTGGTGT CCGAGCCGAC AGCGTACCGG
GCCGGCCGGA CCACCCGGGA AACCTGGAAC CAGGCACCGC ACGGTCCCTC GTTCCCCGTG
CAGACGGTGA GCGACCCGGG GGACTTCATT CATGTGAGCC GGTTGGGCGA CACCATTCAC
GCCGGGATCC CGGTCTTCAG CGACGCCACC GGCCACCGCG GCAACTCTCT GGAAGAGACC
GAGCGGATGC GGCTGTGGCG CGACGGGAAG CTGGTCGGCG AGTCGGAGGT GGCACGTTTG
GGCGAGTTCC CGGTGCCGCC CGCGGAGGCG GACTACCGGC TTACCGTCGA GGCGACCCGC
GGCTTCACCG ACCTCAGCAC CGAGGTGGAG TCCACCTGGA CCTTCCGCTC GAAACACGAG
ACCGGTCCGG AGCCGGTCCG GCTACCGCTG TCGTCGATCC GGTTCAACCC GCCGTTGGCC
GCCGACAACA GTGCCCGCGC CGGTCGGCTG CTGCGGATCC CGGTCGAGAT GCGGCGACAG
TCCGGCGCGG GGACCGCGAC CGTCGCGAAG CTGACCGTAG ACGCCTCGTA CGACGGTGGG
AAGACGTGGC GGACGGTGCC CGTACGGCGT ACGGGTGACG GCTGGACCGC CCTGGTCCGG
CACCCCGACA CCCCGGGACA CGTGTCGCTG CGGGCTCACG CCCGGGACAC CGACGGCAAC
ACGGTGAGCA CGCGAATCCT GCAGGCGTAC CGCCTGAAGT GA
 
Protein sequence
MERYLPAAGA CAGNDILPFV NATPVDVTLD EEAPLHRRRT TAVIGLVLTL ALTAPTAASA 
APTAASAATA GAPKVEPNRL HTVTLITGDR VTVTAAGNTE VRPGPDRKDM RFLIDHERGG
QLSVVPQDAV ALIQAGRVDR RLFDITGLID AGYDDARRDT LPLLVSYSGE PGSRGAGVSA
GVRVTRDLPA INGAAVTAGK SDVAAVWSAL NVGATDVRFG AEEGVERLWL DGRRTITLDH
SVSQIGAPTA WSAGLTGTGV TVAVLDTGVD ATHPDLIGKI AEARNFTETP DAHDTVGHGT
HVASTIAGSG VASDGRYQGV APDATLLDGK VCQDVGCPES AILAGMQWAA VDKRADVVNM
SLGGWDSPEI DPLEEAVGAL TAQTGALFVV SAGNAGGDGT VGSPASADAA LAVGAVDRED
ELADFSSRGP RAGDDALKPD ITAPGVDIVA ARSAHGRIGE PVGEHYARLS GTSMAAPHVA
GAAALLAQQH PDWTAEQLKS TLMAAARPHP AQTAYQQGAG RVDLTRAIGQ TVTSDPVSVS
FGLVRWPHDD DQPVTRAVSW RNSGSSLVAL DLTVEAAGPG GRAAPVGMFI LGTDRVTIPA
GGRAETTVTV DTRLGDVDGY WTGRVVARSG DIVSVTPLAV NREVESYDLT LTHRDRAGAA
TAEYWTNLVG LDSLGLWSAY DADGTVEVRL PKGRYGLKST IFEPAGEGPV GVTDLVAPEL
VIDRERDITV DARTAKPIRV TVPRRDATPA VVGIGSSFYS ADGDSYNLFL RADDFDDITI
GQIGNGSFSD EIFIATISSQ WADLEAAHSP YLYALSETIP GRAPTGFVRE YHKRDLATVK
HRFHGGYEGM AAERYVLATL EQPIFGAFVR LPTTVPGQRV EYYNTRGVRW SGVINFAAPD
LTLAWQEPTA WLVSEPTAYR AGRTTRETWN QAPHGPSFPV QTVSDPGDFI HVSRLGDTIH
AGIPVFSDAT GHRGNSLEET ERMRLWRDGK LVGESEVARL GEFPVPPAEA DYRLTVEATR
GFTDLSTEVE STWTFRSKHE TGPEPVRLPL SSIRFNPPLA ADNSARAGRL LRIPVEMRRQ
SGAGTATVAK LTVDASYDGG KTWRTVPVRR TGDGWTALVR HPDTPGHVSL RAHARDTDGN
TVSTRILQAY RLK