Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2355 |
Symbol | |
ID | 5055818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2104907 |
End bp | 2108683 |
Gene Length | 3777 bp |
Protein Length | 1258 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640469906 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001154550 |
Protein GI | 145592548 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.945162 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAATAA AACCCATACT TACAGCATTG GTAATAACAA TGGTAATCCT TTATGCCCAG TCTCTGAAAA CTACAGGCAC AAACCCGGCG ATCTTTGACA CAAACGCGCC GGTGGTTGCC AAACTCGTTG TTTCTGACTT AAACTCGTTT GCATCTCAGT GGAAGGCGGC CCCGACTGTA CGATTGATGA AAATAGCAAA TAGAGATGTA AATGTCGTAT CGCTTTCCAT AAACGGCGTC CAGTTGTCTG GTAGAGTGGA TGGCAATGTG GCATCGCTTT ACTACAAGGG GCCAGCCGCG GCTTTGAAAA AAGTTGTGGA ATCTCCATAT GTCCAGTCCG TATACGTGAA GGCAATGCCG GAGATACCGC CTAAGGACTT CTTTACGGAG GTGCGCGATT TTGTGGAAAG GGGGGCGGGC ACTCCGCAAC CCACGCTCCC GCTCATGAGG GAGATAATCG GCGCCAGCAA AGTTGAACAG CTATTCGGCG TAAACGGCAC CGGCGTGGTG ATCGCCATAG TGGACACAGG CGTCGACTAC GGTCACCTAG ACCTCCAAGA CGCACTGGCA TGGCTGATAA AGACTAAGGA CAACAAGGAG ATTATAGCCG CCTCAATAGC GATCTCCGGC ACCACTCTCC AGTACAAGAC GCTGAGGGGT CAGACAGCCT CGGTACCCCT TGATCAAGTA GCCTCAGTAG AGCCGCTTGT CCTCGACGCA GACGAGTCGC AGGTTATTCT CTTCCAGGGC TTTACGGCCT CTGGAGGTTA CTTGCCCACC AGCGGCAAGG CATTTAATGT AGTAGACGGA CCCAGCCTCT ACGACGTCAC CGCATCTTGC AACTACGCAG TGGCGGGTCT GACAAGTAGG AGCGGCGTGT ATAAATTTGG CATGACGTAC TTGTACATTC CGTGGTACGG CGGCTATGTA AGTGTCGGCG TTGTGATGTA TGACCCAGAA CAGCCGGGCG TCTACACCGC GGCCCGTGTA GATATCAACA ACAACTGCAA CTTCTCTGAC GACACGGAGC TGAGATACTT CGGCAATAGG CTTATTATAG ACAACCCCAC GGCGCCTACT AGGAGCCTCG GCGTCGCCGG AGGCTACTTC TTCGACCGAG GCCTTTGGTT TGACTTCTAC GCCAAGTTCT ACCCCGGGTG GGACCTCTCC GGCAACTACC TCAGCATCTT CTACGACTTC AACGGCCACG GCACGGCCTG TAGCTCCGTG GCCGCGGGTA GGGGCAAGGC CACGTACAAC CTGGGCTTGC TGGGTCCACA AAAGCTGAGG GGGATCGCCC CCGGCGCCAA GGTGCTCGGC GTTAAGGGAC TGTGGTCGGG CATGGTGGAG CCCGGCATGA TGTGGGCTGC CGGCTTCGAC GTCGATAGCA ACGGCCAGTG GTACTGGACT GGGCAGAAGA GGGCCCACGT CATTAGCAAC AGCTGGGGCA TCTCGACGTT CACATACGAC TATGCCGGCT TCGGCTACGA TTTCGAGTCT GCAGTGGTCA ACGCCTTAGC CGCGCCGAGG TTCCTCGACA GAAACTACCC GGGCATTGTA ATAGTACATG CCGGCGGCAA CGGCGGCTAC GGCTTCGGCA CCATCACAAG CCCCGGCGCC GCCGTAGGGG CCATCACCGT AGGCGCCGCC ACCAGCGGAC ACTTCTGGCT AGCGATCGGC ACTCCGTATA ACGGCTTTAG GTGGGGCGAC ATAATCAGCT GGTCGCTGAG GGGGCCAACG CCAGCCGGCT ACGTTAAGCC CGATGTAGTC AACGTAGGCG CCTTCGGCAT TGCAACCTAT CCAGTGGGCT GGGGCAGATA CTACTACGGC ACTCCCGAAG ATTGGGATAC CTTTGGCGGC ACCTCGCAGG CGACGCCGCT AACAGCAGGC GTAGTTGCTC TTGTGCTCAG CGCGGTGGCC GACAGGGTTG ATCCCGCCTC TGTAGATCCC TATCTGGTTA GGCAGTTCAT CACAAGCACC GCGGTCGACA TAGGCTACAC GCCGTTTACC GCAGGCCACG GCTTTGTAAA CGCCACAGCA ACCGTTATCG CGGCGCGCTC CTACTACGGC TTGCCGGCGC CAATCGCGCC CGTGGCGTTG ATCCGTACGA ATTCTGTATT CAACCCAGAA AACTCCTGGG ATTTCCAGTG GAGGGTAAAT ATACCGCTAT ATTTCGGTTA CTTATTGAAC AACTTATTGA CCACTCAGTG GACGTCTTAT ATACAGACCT ATATCCCGCA ACCTAACATC GGTATGACAA GCCTCTACCT AAGCACGACG CCAGGCGGCC AAGCAGTGGG GCAACTTGTG GTCACTTCGA CAAATTGGGC GCTTTCTGTT AAAGCAGAGG CCTTTACGCT AACGCCGGTA TATAAGAAGT CTACGGTAAT TAGCATACCG GTTAGGGCAC TCGGAGGTTA TTTCACAATG CAACAACTCG GCTTTGATGA AAACGTGTTG AAGCAGGCCG ATCTTGTGGT GTTTAGGATG TCTTACAGCT TCTCGGCATT CGACCCAGAG TTTAACTACG CACTTAACAC GAGACCTGTT TTGTGGGTCT TCGGCTGGGC CGATCTCAAC AACGATGGAC AAATCTCCAC TAACGAGCTC ACGTGGCTGA ACTACGGCTA CCAACGCCAT ACGGTTACTG AGGTGCCAGT CTCGAAGCTA TCTGCAAAGC TCTTGCCTAA TCAGAGGCTT GTGCTAAGAG TGGATGCACG CCCGGTCGCA CCTCCATATC CCGCTACTGT GCCTGTAAAT GTTGAGGTTG TTGCGTACAA GAGGACTCCG GCACCAGATG TACAGATAAC GCCGACAAGC ATAGTGCTGA AACCGCGCCA GAGCTTCACC TTTACAGTAG TTGTTAGAGC GCCTCCAAAC GTCGCTCCCA CTGCCTACGA AAGGCTTGTT GTCCTCACGA TAAACGGCAC GAGCTACGTA GTGCCTCTCA GCTATGTTGT GAGAGCAAGC GTCCCAGTGA ACGCGGAGTA TGTCCTAACC GCCGGGAGGT CTGACAGCTG GTACAACGTC AGCGAGGTGA GGGGTGGCAA CGACTGGAGC TGGAGGTACG AGTCTGGGGA CTGGCGCGTC TACTTCATCT CCACTCCGAC ACTTGCCAGA GGGCTTTACG TGGACTTCGC GTGGAGCTGT ACCAACACGT CTATGATCGT CTACACATTG ACAGACAGCG GGTTCTTTGC AGGTTATTTA TGGAACCAGG GTGTCAGCTA CCACCAGTAT CTAGGCTCCG GGATATTTAC ATGGACAAGC ACCGGTGGCC AGGCTAGGAT TGTGGCTCTC CCCTCCGCAT CTTTCGCAGT GCCCATAAGC GTAAGCGGAT TTGCCCACAC AACTATGTCA GCCTCTTACC CCATCCGCGA GAGTAGGAGT TTTATCATTC TGGCGCGGAC TTCGCTATAC GGCGGCTGTG GCACCAGCGA GCCGATCAAG GGTGTGGTCA AGCCGTTTAT AGAGGGTAGA GATGCGCCGG TTTTAGCCAC CACGTCGCCG TTTATAAGAG TTGCGCTTAG CAAGCCGCCC ATAGACTACG ACTTCGCCGT CAAGTTCGCT TCTGTACTCG GCGGTGGGTT CGTAGTACCT ATGGCAACGA TCAGCGGCGA TTTAACCTTT AACATGTACT TCATAAGGAC CCCGTTGTTT GTAGACTACG TCGCATTGCT CTACTCATCC AGATACGCGA CGTGGTATAG GACTGGTGGG AATAACTTTG GCGGATATCC GTGGTACGCC GTTGAGGGGG TGTCTGTAAT TAGCTAA
|
Protein sequence | MRIKPILTAL VITMVILYAQ SLKTTGTNPA IFDTNAPVVA KLVVSDLNSF ASQWKAAPTV RLMKIANRDV NVVSLSINGV QLSGRVDGNV ASLYYKGPAA ALKKVVESPY VQSVYVKAMP EIPPKDFFTE VRDFVERGAG TPQPTLPLMR EIIGASKVEQ LFGVNGTGVV IAIVDTGVDY GHLDLQDALA WLIKTKDNKE IIAASIAISG TTLQYKTLRG QTASVPLDQV ASVEPLVLDA DESQVILFQG FTASGGYLPT SGKAFNVVDG PSLYDVTASC NYAVAGLTSR SGVYKFGMTY LYIPWYGGYV SVGVVMYDPE QPGVYTAARV DINNNCNFSD DTELRYFGNR LIIDNPTAPT RSLGVAGGYF FDRGLWFDFY AKFYPGWDLS GNYLSIFYDF NGHGTACSSV AAGRGKATYN LGLLGPQKLR GIAPGAKVLG VKGLWSGMVE PGMMWAAGFD VDSNGQWYWT GQKRAHVISN SWGISTFTYD YAGFGYDFES AVVNALAAPR FLDRNYPGIV IVHAGGNGGY GFGTITSPGA AVGAITVGAA TSGHFWLAIG TPYNGFRWGD IISWSLRGPT PAGYVKPDVV NVGAFGIATY PVGWGRYYYG TPEDWDTFGG TSQATPLTAG VVALVLSAVA DRVDPASVDP YLVRQFITST AVDIGYTPFT AGHGFVNATA TVIAARSYYG LPAPIAPVAL IRTNSVFNPE NSWDFQWRVN IPLYFGYLLN NLLTTQWTSY IQTYIPQPNI GMTSLYLSTT PGGQAVGQLV VTSTNWALSV KAEAFTLTPV YKKSTVISIP VRALGGYFTM QQLGFDENVL KQADLVVFRM SYSFSAFDPE FNYALNTRPV LWVFGWADLN NDGQISTNEL TWLNYGYQRH TVTEVPVSKL SAKLLPNQRL VLRVDARPVA PPYPATVPVN VEVVAYKRTP APDVQITPTS IVLKPRQSFT FTVVVRAPPN VAPTAYERLV VLTINGTSYV VPLSYVVRAS VPVNAEYVLT AGRSDSWYNV SEVRGGNDWS WRYESGDWRV YFISTPTLAR GLYVDFAWSC TNTSMIVYTL TDSGFFAGYL WNQGVSYHQY LGSGIFTWTS TGGQARIVAL PSASFAVPIS VSGFAHTTMS ASYPIRESRS FIILARTSLY GGCGTSEPIK GVVKPFIEGR DAPVLATTSP FIRVALSKPP IDYDFAVKFA SVLGGGFVVP MATISGDLTF NMYFIRTPLF VDYVALLYSS RYATWYRTGG NNFGGYPWYA VEGVSVIS
|
| |