Gene Pars_2355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2355 
Symbol 
ID5055818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2104907 
End bp2108683 
Gene Length3777 bp 
Protein Length1258 aa 
Translation table11 
GC content55% 
IMG OID640469906 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001154550 
Protein GI145592548 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.945162 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATAA AACCCATACT TACAGCATTG GTAATAACAA TGGTAATCCT TTATGCCCAG 
TCTCTGAAAA CTACAGGCAC AAACCCGGCG ATCTTTGACA CAAACGCGCC GGTGGTTGCC
AAACTCGTTG TTTCTGACTT AAACTCGTTT GCATCTCAGT GGAAGGCGGC CCCGACTGTA
CGATTGATGA AAATAGCAAA TAGAGATGTA AATGTCGTAT CGCTTTCCAT AAACGGCGTC
CAGTTGTCTG GTAGAGTGGA TGGCAATGTG GCATCGCTTT ACTACAAGGG GCCAGCCGCG
GCTTTGAAAA AAGTTGTGGA ATCTCCATAT GTCCAGTCCG TATACGTGAA GGCAATGCCG
GAGATACCGC CTAAGGACTT CTTTACGGAG GTGCGCGATT TTGTGGAAAG GGGGGCGGGC
ACTCCGCAAC CCACGCTCCC GCTCATGAGG GAGATAATCG GCGCCAGCAA AGTTGAACAG
CTATTCGGCG TAAACGGCAC CGGCGTGGTG ATCGCCATAG TGGACACAGG CGTCGACTAC
GGTCACCTAG ACCTCCAAGA CGCACTGGCA TGGCTGATAA AGACTAAGGA CAACAAGGAG
ATTATAGCCG CCTCAATAGC GATCTCCGGC ACCACTCTCC AGTACAAGAC GCTGAGGGGT
CAGACAGCCT CGGTACCCCT TGATCAAGTA GCCTCAGTAG AGCCGCTTGT CCTCGACGCA
GACGAGTCGC AGGTTATTCT CTTCCAGGGC TTTACGGCCT CTGGAGGTTA CTTGCCCACC
AGCGGCAAGG CATTTAATGT AGTAGACGGA CCCAGCCTCT ACGACGTCAC CGCATCTTGC
AACTACGCAG TGGCGGGTCT GACAAGTAGG AGCGGCGTGT ATAAATTTGG CATGACGTAC
TTGTACATTC CGTGGTACGG CGGCTATGTA AGTGTCGGCG TTGTGATGTA TGACCCAGAA
CAGCCGGGCG TCTACACCGC GGCCCGTGTA GATATCAACA ACAACTGCAA CTTCTCTGAC
GACACGGAGC TGAGATACTT CGGCAATAGG CTTATTATAG ACAACCCCAC GGCGCCTACT
AGGAGCCTCG GCGTCGCCGG AGGCTACTTC TTCGACCGAG GCCTTTGGTT TGACTTCTAC
GCCAAGTTCT ACCCCGGGTG GGACCTCTCC GGCAACTACC TCAGCATCTT CTACGACTTC
AACGGCCACG GCACGGCCTG TAGCTCCGTG GCCGCGGGTA GGGGCAAGGC CACGTACAAC
CTGGGCTTGC TGGGTCCACA AAAGCTGAGG GGGATCGCCC CCGGCGCCAA GGTGCTCGGC
GTTAAGGGAC TGTGGTCGGG CATGGTGGAG CCCGGCATGA TGTGGGCTGC CGGCTTCGAC
GTCGATAGCA ACGGCCAGTG GTACTGGACT GGGCAGAAGA GGGCCCACGT CATTAGCAAC
AGCTGGGGCA TCTCGACGTT CACATACGAC TATGCCGGCT TCGGCTACGA TTTCGAGTCT
GCAGTGGTCA ACGCCTTAGC CGCGCCGAGG TTCCTCGACA GAAACTACCC GGGCATTGTA
ATAGTACATG CCGGCGGCAA CGGCGGCTAC GGCTTCGGCA CCATCACAAG CCCCGGCGCC
GCCGTAGGGG CCATCACCGT AGGCGCCGCC ACCAGCGGAC ACTTCTGGCT AGCGATCGGC
ACTCCGTATA ACGGCTTTAG GTGGGGCGAC ATAATCAGCT GGTCGCTGAG GGGGCCAACG
CCAGCCGGCT ACGTTAAGCC CGATGTAGTC AACGTAGGCG CCTTCGGCAT TGCAACCTAT
CCAGTGGGCT GGGGCAGATA CTACTACGGC ACTCCCGAAG ATTGGGATAC CTTTGGCGGC
ACCTCGCAGG CGACGCCGCT AACAGCAGGC GTAGTTGCTC TTGTGCTCAG CGCGGTGGCC
GACAGGGTTG ATCCCGCCTC TGTAGATCCC TATCTGGTTA GGCAGTTCAT CACAAGCACC
GCGGTCGACA TAGGCTACAC GCCGTTTACC GCAGGCCACG GCTTTGTAAA CGCCACAGCA
ACCGTTATCG CGGCGCGCTC CTACTACGGC TTGCCGGCGC CAATCGCGCC CGTGGCGTTG
ATCCGTACGA ATTCTGTATT CAACCCAGAA AACTCCTGGG ATTTCCAGTG GAGGGTAAAT
ATACCGCTAT ATTTCGGTTA CTTATTGAAC AACTTATTGA CCACTCAGTG GACGTCTTAT
ATACAGACCT ATATCCCGCA ACCTAACATC GGTATGACAA GCCTCTACCT AAGCACGACG
CCAGGCGGCC AAGCAGTGGG GCAACTTGTG GTCACTTCGA CAAATTGGGC GCTTTCTGTT
AAAGCAGAGG CCTTTACGCT AACGCCGGTA TATAAGAAGT CTACGGTAAT TAGCATACCG
GTTAGGGCAC TCGGAGGTTA TTTCACAATG CAACAACTCG GCTTTGATGA AAACGTGTTG
AAGCAGGCCG ATCTTGTGGT GTTTAGGATG TCTTACAGCT TCTCGGCATT CGACCCAGAG
TTTAACTACG CACTTAACAC GAGACCTGTT TTGTGGGTCT TCGGCTGGGC CGATCTCAAC
AACGATGGAC AAATCTCCAC TAACGAGCTC ACGTGGCTGA ACTACGGCTA CCAACGCCAT
ACGGTTACTG AGGTGCCAGT CTCGAAGCTA TCTGCAAAGC TCTTGCCTAA TCAGAGGCTT
GTGCTAAGAG TGGATGCACG CCCGGTCGCA CCTCCATATC CCGCTACTGT GCCTGTAAAT
GTTGAGGTTG TTGCGTACAA GAGGACTCCG GCACCAGATG TACAGATAAC GCCGACAAGC
ATAGTGCTGA AACCGCGCCA GAGCTTCACC TTTACAGTAG TTGTTAGAGC GCCTCCAAAC
GTCGCTCCCA CTGCCTACGA AAGGCTTGTT GTCCTCACGA TAAACGGCAC GAGCTACGTA
GTGCCTCTCA GCTATGTTGT GAGAGCAAGC GTCCCAGTGA ACGCGGAGTA TGTCCTAACC
GCCGGGAGGT CTGACAGCTG GTACAACGTC AGCGAGGTGA GGGGTGGCAA CGACTGGAGC
TGGAGGTACG AGTCTGGGGA CTGGCGCGTC TACTTCATCT CCACTCCGAC ACTTGCCAGA
GGGCTTTACG TGGACTTCGC GTGGAGCTGT ACCAACACGT CTATGATCGT CTACACATTG
ACAGACAGCG GGTTCTTTGC AGGTTATTTA TGGAACCAGG GTGTCAGCTA CCACCAGTAT
CTAGGCTCCG GGATATTTAC ATGGACAAGC ACCGGTGGCC AGGCTAGGAT TGTGGCTCTC
CCCTCCGCAT CTTTCGCAGT GCCCATAAGC GTAAGCGGAT TTGCCCACAC AACTATGTCA
GCCTCTTACC CCATCCGCGA GAGTAGGAGT TTTATCATTC TGGCGCGGAC TTCGCTATAC
GGCGGCTGTG GCACCAGCGA GCCGATCAAG GGTGTGGTCA AGCCGTTTAT AGAGGGTAGA
GATGCGCCGG TTTTAGCCAC CACGTCGCCG TTTATAAGAG TTGCGCTTAG CAAGCCGCCC
ATAGACTACG ACTTCGCCGT CAAGTTCGCT TCTGTACTCG GCGGTGGGTT CGTAGTACCT
ATGGCAACGA TCAGCGGCGA TTTAACCTTT AACATGTACT TCATAAGGAC CCCGTTGTTT
GTAGACTACG TCGCATTGCT CTACTCATCC AGATACGCGA CGTGGTATAG GACTGGTGGG
AATAACTTTG GCGGATATCC GTGGTACGCC GTTGAGGGGG TGTCTGTAAT TAGCTAA
 
Protein sequence
MRIKPILTAL VITMVILYAQ SLKTTGTNPA IFDTNAPVVA KLVVSDLNSF ASQWKAAPTV 
RLMKIANRDV NVVSLSINGV QLSGRVDGNV ASLYYKGPAA ALKKVVESPY VQSVYVKAMP
EIPPKDFFTE VRDFVERGAG TPQPTLPLMR EIIGASKVEQ LFGVNGTGVV IAIVDTGVDY
GHLDLQDALA WLIKTKDNKE IIAASIAISG TTLQYKTLRG QTASVPLDQV ASVEPLVLDA
DESQVILFQG FTASGGYLPT SGKAFNVVDG PSLYDVTASC NYAVAGLTSR SGVYKFGMTY
LYIPWYGGYV SVGVVMYDPE QPGVYTAARV DINNNCNFSD DTELRYFGNR LIIDNPTAPT
RSLGVAGGYF FDRGLWFDFY AKFYPGWDLS GNYLSIFYDF NGHGTACSSV AAGRGKATYN
LGLLGPQKLR GIAPGAKVLG VKGLWSGMVE PGMMWAAGFD VDSNGQWYWT GQKRAHVISN
SWGISTFTYD YAGFGYDFES AVVNALAAPR FLDRNYPGIV IVHAGGNGGY GFGTITSPGA
AVGAITVGAA TSGHFWLAIG TPYNGFRWGD IISWSLRGPT PAGYVKPDVV NVGAFGIATY
PVGWGRYYYG TPEDWDTFGG TSQATPLTAG VVALVLSAVA DRVDPASVDP YLVRQFITST
AVDIGYTPFT AGHGFVNATA TVIAARSYYG LPAPIAPVAL IRTNSVFNPE NSWDFQWRVN
IPLYFGYLLN NLLTTQWTSY IQTYIPQPNI GMTSLYLSTT PGGQAVGQLV VTSTNWALSV
KAEAFTLTPV YKKSTVISIP VRALGGYFTM QQLGFDENVL KQADLVVFRM SYSFSAFDPE
FNYALNTRPV LWVFGWADLN NDGQISTNEL TWLNYGYQRH TVTEVPVSKL SAKLLPNQRL
VLRVDARPVA PPYPATVPVN VEVVAYKRTP APDVQITPTS IVLKPRQSFT FTVVVRAPPN
VAPTAYERLV VLTINGTSYV VPLSYVVRAS VPVNAEYVLT AGRSDSWYNV SEVRGGNDWS
WRYESGDWRV YFISTPTLAR GLYVDFAWSC TNTSMIVYTL TDSGFFAGYL WNQGVSYHQY
LGSGIFTWTS TGGQARIVAL PSASFAVPIS VSGFAHTTMS ASYPIRESRS FIILARTSLY
GGCGTSEPIK GVVKPFIEGR DAPVLATTSP FIRVALSKPP IDYDFAVKFA SVLGGGFVVP
MATISGDLTF NMYFIRTPLF VDYVALLYSS RYATWYRTGG NNFGGYPWYA VEGVSVIS