Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1426 |
Symbol | |
ID | 5704815 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1649511 |
End bp | 1653236 |
Gene Length | 3726 bp |
Protein Length | 1241 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641270935 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001536316 |
Protein GI | 159037063 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.123666 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00136869 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTTCCG ATCTAGGACC CTTCGCCGCT GGCGGGTCGC GGCGGCGGCT GATCGCCGCC GTCGTGGCGG CCGGGATGGT CGCGGCTTCG GCCGCCGCGA CCCATCCGGC CGCCGCGACC GCCGGCCAGC CCCCGACAGC CGCCCCCGTC GCCACGCCCG TCACCGCCCC GGAGACGCAC ACCGTCACCC TGGTCACCGG CGACGTCGTC ACGGTCCGGA CCCTCGCCAA CGGCAAGACC ATCACCGAGG TCGACCAGCC AGACGACGCC ACCGGCGGCT TCAGTGTCCA ACAGAGCGGC GACGACCTGT ACGTCCTGCC CGACGAAGCG ATACCCCTCC TGACCAACGA CCAACTCGAC CGACGACTGT TCAACGTCAC CGACCTCATC GAGATGGGCT ACGACGACGC CCACACCACC GAACTACCCC TGATCGCCAC CTACCCCAGA ACGACAGCCC GCACCACCGC CGCACTACCC GGCACAGCCC TGACCCACGA CCTACCCGCC ATCAACGGCC ACGCCCTCAC CGCCGACAAG GAACAGACCC GCACCGTCTG GTCCACCATC ACCGCCAGCA TCGCCGCCGG TCAGAGTCGC GGCTCCGGCG ACGCGCACGA CAGCGGTGTC CCCAAGCTCT GGCTGGACGG TCAGGTGCGC ACCGCCCTGG CCGACAGCGT CCCGCAGGTC GGCGCCCCCG AGGCCTGGGA CGCCGGCTAC GACGGCGACG GCGTCACCGT CGCCGTCCTC GACACCGGCA TCGACCCCAC CCACCCCGAC CTCGCCGACC AGATCACCGA AAAGGTCAGC TTCGTCCCCG ACCAGGACGC CTCCGACCGG CAGGGGCACG GCACCCACGT CGCCTCGATC ATCGCCGGGA CCGGCGCGGC CTCCGACGGC GACAACACCG GCGTCGCACC GGGAGCCGAT CTGATCATCG GCAAAGTCCT CAACAACAAC GGCATCGGCT ACGACTCCTG GATCATCGCC GGCATGCAGT GGGCCGCCGA ATCCGGCGCC GACGTGGTCA ACATGAGCCT CGGCCACGCG GCGCGCACCG ATGTCCTCGA TCCGTTGACC CTCGCGGTGG ACGCCCTCTC CGCGCAACAC GACACCCTGT TCGTCATGGC CGCCGGCAAC AGCGGAACGA CCATCGCCAC ACCCGGCAAC GCGGAAAGCG CCCTCACCGT CGGCGCGGTG GACAAACAGG ACCGACTCGC CGGGTTCTCC AGCGTCGGAC CACTGGCCTA CAGCGGAGCA ATCAAGCCCG ACATCACCGC CCCCGGCGTC GCCGTCACCG CCGCCCGCTC CCAACAAAGC TCCGGCGACG GCATGTACGT AGGCAAGACC GGCACCTCGA TGGCAGCTCC ACACGTGGCC GGAGCGGCGG CCATCCTCGC CCAACAACAC CCCGACTGGA CCAACACCCA ACTCAAGAAC GCCCTCATGA GCAGTGCCGA GGCGCTGAGC GACAGCTACA ACGCGTTCCA GGTGGGCACC GGCCGGCTGG ATGTGGCGGC GGCGGTGGGT AGCACCGTAC GCGCCACCGG CTCGGCGTTC GTCGGCTACT TCGAATGGCC GCACCAGCCC ACCGACGCCC CCGTCACGGA GCCGGTGACG TTCACCAACA GCGGCACCAC CGCCGTCACC CTCGACCTGA CCACCACTGG CAGCGACGCG TTCACCCTCG ACACGTCCAA GGTGACCGTG CCAGCGGGCG GGCAGGTGGA CGTCCCCGTG ACCGCCGACC CGGGTGGGAT CACCATCGGC TCGCACACCG GCTACCTCGT CGGCACCGAC CCGACCACCG GTGAGACCGT CACCCGCACG GCCCTCGGGC TCCTCAAGGA AGACGAACGC TACGGCCTGA CCATCAAGGT CCGCGACCGC GACGGCCAGC CCACCGAGGC ATTCGTCGTG GTACGGAAGG CCGGCGATTG GTTCCCCCGG TTCATCAGCA TCGACGGTGA GCGGACGCTG CGCCTCCCAC CAGGGACCTA CACGATGGAG ACGAAGGTGG AGGTCCCGGG TGAACGAGCT GACTCCCTCG GCCTCACCCT GCTGGCCGCC CCGGCAACCG TGCTCGACAA ACCCACCGAG GTGATGCTGG ACGCCAGTCA GGCCCGACTC CTCGAGTTCA CCACACCCCG GCGCACGGAA GATCGACAGC GCCTGCTCGG GTACACCGTC GACTACGGCA CCGGCCCCAC CTACTACTAC CATTCCATCG CGCCGGCATA CGACGACCTG TACGTCCTAC CGACCGAGAA GAGCACCGAG ACCGCGTTCG CCATGGCAGC CCAGTGGCGC AAGGGTGAGC CCGTCCTGAG CCTACGCGCC TTCGGCCTAC TTCCCATCGA CGCCGCAGTC CAACCAGGCA GCACCATCAC CACCGGCACA CAATGGCTAC GCCCCGTCTA CGCCGGCACC GGCACCCCCG AGGACTACGC CAACCTCAAC GCCAGGGGCA AAATCGCCAT CGTCACCCAC AGCGACAACG TCAGCCCACC CGACCGCGCC GCCGCCGCAG CAGCAGCCGG AGCAACCCTC CTACTCGTCG TCAACGACAA CCCTGGAATC CTCCACGAAC ACGTCGGCGA CTCACCCATC CCCGTCGCCA GCATCCACCG CGACATCGGC AACCTCCTCA CCAAACTCGC CGAACACGGC ATACCAAAAC TGAAAGTCAG CCAAGAGCAA TACCCCGACA CCATCTACGA CCTGACACAA GTCTGGCGAA AGCAGGTACC AGACCAGCCA CTCACCTACC ACCCGAGCCA CCAGGACCTG GCCCGCATCG ACGCCCGCTA CCACGCCGAC CAGGACGCCG AAGGATCCGG CTACCGCGCT CACACGATCC TCGGCCCTGC GCTCGGTGCA CGCGAGCCGG AGTGGCACCC GGGCATTCGC ACCGAGTGGG TCACCCCGGA CGTCCCCTGG GTCGAAAACC ACGTGCAGCG TGGTCTGGAT TGGGGAGTCG TGGCGGACGA GCACACCTAC GCCAAGGGCA CGACCAGCAG GGTGGACTGG TTCGCACCCG CCATCCGCCC AGCCTTCATC CAGTCAGCCC GGCTTAAGAA CAGTCGATAC CAGGATCGCA TGACCGTCTC CGTAGGGGCC TGGAGCCCGT CGGACACTGT GCTCGACTCC AGCGGAGGGA TTCCCTCGGC ACAGCAACAC ATCAAGCTCT ATCAGGGTGA CACCCTGCTG CACGAGGACC CGAACTACAG CTTCCTCTCC AACCGGGAGG TGCCGGCCGG CACGCTGCCG TATCGGCTGG TGTTGGATGG GTCGCGCTCG GCTGACGAGT GGCGGCTGTC CACCCGCACC CACACCGAAT GGGACTTCAT CTCCAGCACC AACCAGGCCG ACCCGTCCGA CGCCGTCCCG ATCACGCTGC TCCAACTCGA CTACGAGATG GAAACCGATC TCCGAGGCGA CGTCGAGGCC GGCACCGACC AGGCGATCAG CGTTACCGCG CGACCGCAAC CCGGCGGCTC CGACATTGGC ACCGGCACCA TCACCACAGT TGAACTCGAC GTCTCCTACG ACGATGGCAC CACCTGGCAA CGAGTAACCC TCAACCAGGG CGACAACAAC CGCTACACCG GCACCCTGAC ACTGCCGACA CAACCCGACG GCTTCATCTC CATCCGAGCC GCCGCCGAAA CCGACACCGG ATTCGCCATC CGACAGGAAA TCACTCGCGC CTACGGCCTG CGATGA
|
Protein sequence | MSSDLGPFAA GGSRRRLIAA VVAAGMVAAS AAATHPAAAT AGQPPTAAPV ATPVTAPETH TVTLVTGDVV TVRTLANGKT ITEVDQPDDA TGGFSVQQSG DDLYVLPDEA IPLLTNDQLD RRLFNVTDLI EMGYDDAHTT ELPLIATYPR TTARTTAALP GTALTHDLPA INGHALTADK EQTRTVWSTI TASIAAGQSR GSGDAHDSGV PKLWLDGQVR TALADSVPQV GAPEAWDAGY DGDGVTVAVL DTGIDPTHPD LADQITEKVS FVPDQDASDR QGHGTHVASI IAGTGAASDG DNTGVAPGAD LIIGKVLNNN GIGYDSWIIA GMQWAAESGA DVVNMSLGHA ARTDVLDPLT LAVDALSAQH DTLFVMAAGN SGTTIATPGN AESALTVGAV DKQDRLAGFS SVGPLAYSGA IKPDITAPGV AVTAARSQQS SGDGMYVGKT GTSMAAPHVA GAAAILAQQH PDWTNTQLKN ALMSSAEALS DSYNAFQVGT GRLDVAAAVG STVRATGSAF VGYFEWPHQP TDAPVTEPVT FTNSGTTAVT LDLTTTGSDA FTLDTSKVTV PAGGQVDVPV TADPGGITIG SHTGYLVGTD PTTGETVTRT ALGLLKEDER YGLTIKVRDR DGQPTEAFVV VRKAGDWFPR FISIDGERTL RLPPGTYTME TKVEVPGERA DSLGLTLLAA PATVLDKPTE VMLDASQARL LEFTTPRRTE DRQRLLGYTV DYGTGPTYYY HSIAPAYDDL YVLPTEKSTE TAFAMAAQWR KGEPVLSLRA FGLLPIDAAV QPGSTITTGT QWLRPVYAGT GTPEDYANLN ARGKIAIVTH SDNVSPPDRA AAAAAAGATL LLVVNDNPGI LHEHVGDSPI PVASIHRDIG NLLTKLAEHG IPKLKVSQEQ YPDTIYDLTQ VWRKQVPDQP LTYHPSHQDL ARIDARYHAD QDAEGSGYRA HTILGPALGA REPEWHPGIR TEWVTPDVPW VENHVQRGLD WGVVADEHTY AKGTTSRVDW FAPAIRPAFI QSARLKNSRY QDRMTVSVGA WSPSDTVLDS SGGIPSAQQH IKLYQGDTLL HEDPNYSFLS NREVPAGTLP YRLVLDGSRS ADEWRLSTRT HTEWDFISST NQADPSDAVP ITLLQLDYEM ETDLRGDVEA GTDQAISVTA RPQPGGSDIG TGTITTVELD VSYDDGTTWQ RVTLNQGDNN RYTGTLTLPT QPDGFISIRA AAETDTGFAI RQEITRAYGL R
|
| |