Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0864 |
Symbol | |
ID | 5054947 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 767660 |
End bp | 768535 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640468424 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001153101 |
Protein GI | 145591099 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGATATAA GCGGGCTAGT GGAGAAAGTA GCTGAGTCAG TGGTCGCGGT GGTTACAAGA CCCTACGAGG CCTTCCCAGG CGACTTCGGC TTCGGCACCG CCTTTGCAAT AGACACAAGG TTCTTCGCCA CGGCATACCA CGTGGTGGTC TCAGCGGAGG AGATGGCGCT GGTAACTCCC GAGGGAGAAA AGGCAGAGGG GCGTGTCGTC GTGGCTGACC CCGCTGAGGA CGTGGCGTTG ATATATTCAG AACTCCGCGC CCCGCCGCTC CGCATGGGTA GCGCCCTGAG GCTCAAGGTA GGGCAAGGCG TCGTGGCCAT AGGCTACCCC CTAGCCTTGC TGGATAAGCC CACCGCCACC TTCGGCATAG TAAGCGCGGT GGGCAGAACG CTGAGGGCGG GGGACCGGGT TTTTGAGTAC CTCATACAGA CAGACGCCGC TATCAACCCC GGCAACTCCG GCGGCCCCCT CGTGAATATG GAAGGCGAGG CTGTGGGGGT AAACTCCGCC ATAATAGCCG GCGCCCAGAA CTTAGGCTTC GCAGTCCCGA TAGACATCGT TAGGATAGCA TATGAAATGT ACAGAAAATA CGGCAAGTAT GTGCGGCCAG CGCTGGGCAT CTACCTGGCA ACATTGAACA AAGCTGCGGC ATCTCTCTAC GGCATTCCCG TAGAGAAGGG CCTTTTAGTA GTCGACGTAG TGCCGGGGTC GCCCGCCGAG GAGATCGGCA TTGAGAGAGG AGACGTGATA ATTAGGGTGG ACGGGAGGGA AGTGCACAAC GTCTTCGAGC TAAGGCTCCA CGTGGCGGAG GCCGTGATAA ACAGGAGGAG GCCGTCGTTT GAGGTCTGGC GCCGGGGCAG GAGAGTAGAG CTCTAG
|
Protein sequence | MDISGLVEKV AESVVAVVTR PYEAFPGDFG FGTAFAIDTR FFATAYHVVV SAEEMALVTP EGEKAEGRVV VADPAEDVAL IYSELRAPPL RMGSALRLKV GQGVVAIGYP LALLDKPTAT FGIVSAVGRT LRAGDRVFEY LIQTDAAINP GNSGGPLVNM EGEAVGVNSA IIAGAQNLGF AVPIDIVRIA YEMYRKYGKY VRPALGIYLA TLNKAAASLY GIPVEKGLLV VDVVPGSPAE EIGIERGDVI IRVDGREVHN VFELRLHVAE AVINRRRPSF EVWRRGRRVE L
|
| |