Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1805 |
Symbol | |
ID | 4617719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | + |
Start bp | 1637236 |
End bp | 1638426 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639784889 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_931297 |
Protein GI | 119873290 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0411901 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.0459712 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAAGGA GGGTGTTGGT AATTGTAGTA TTGGCATTGG TTATATTTAT ACAAGCCGCG AGAGTGGTAG TAGGTTATGA AGACCCTACA TCTTTAGGGG CTTTAGGCGA GTTAAATAAG ACAGGCGATA TAAAGATGTT AAAACATATC AAGGAGATAA AAGCAGTTGT GTTAAATCTC CCCGATAGCA AACTGGGGGA GTTAAAAGAG AAGTTAAAGG GGGTGAGATA TATCGAGGAG GATAAGGTAG CTTGGGCGAT AGGCTTTGCA GACTATGCAG ACGTGCAGTG GAATATCAAA ATGGTGAACG CCCCTCTTGT GTGGGATACA TACTTTGTGA CAATTGGCGA TGCGGCGTTT GGCTACGGCG TAACTGTCGC CGTGTTAGAC ACAGGCATAG ACTATACACA CCCAGAGCTC TACGGGAAGG TTGTATACTG CATATATACA GTGGGGGTTC GCTTATATAA AGGCACAAAT CTCAAGAACT GTGCAGATAG AAACGGCCAC GGGACACATG TAGCTGGTAT AATCGCCGCC TCGCTGGATA ACGTGGGCGT GGCTGGAGTT GCGCCAAAGG TAAGGCTGAT AGCTGTAAAA GTTCTAAACG ACGCGGGCTC TGGCTACTAC AGCGATATCG CCGAAGGCAT TGTCGAAGCC GTTAAAGCAG GCGCCAGGAT ACTTTCTATG TCTCTCGGCG GCCCTACAGA CTCCTCAGTG TTGAGAGACG CATCGTATTG GGCGTATCAA CAGGGCGTGG TGCAGGTGGC TGCGGCTGGG AATTCTGGTG ATGGGGACTC TGCTATTGAC AACGTGGCGT ATCCGGCTAG GTACAGCTGG GTTATTGCTG TCGCCGCTGT TGACCAAAAC TACGCGGTCC CCACTTGGTC GAGCGACGGC CCTGAGGTAG ACGTGGCTGC CCCCGGCGTG GATATCCTAT CTACATACCC CGGCGGGAGA TATGCATATA TGTCAGGAAC CTCCATGGCA ACTCCACACG TAACCGGCGT CGTGGCGTTA ATCCAAGCAG TTAGGACAGC ATACGGCCTT AGGCCTCTGA CGCCGGACGA GGTATACCAA GTTTTGACCT CTACTGCCAA AGATATAGGC CCGCCGGGCT TCGACGTCTA CAGCGGCTAT GGGCTTGTCG ATGCATACGC GGCTGTCACT GCCGCGCTGA AAATAGGATA G
|
Protein sequence | MTRRVLVIVV LALVIFIQAA RVVVGYEDPT SLGALGELNK TGDIKMLKHI KEIKAVVLNL PDSKLGELKE KLKGVRYIEE DKVAWAIGFA DYADVQWNIK MVNAPLVWDT YFVTIGDAAF GYGVTVAVLD TGIDYTHPEL YGKVVYCIYT VGVRLYKGTN LKNCADRNGH GTHVAGIIAA SLDNVGVAGV APKVRLIAVK VLNDAGSGYY SDIAEGIVEA VKAGARILSM SLGGPTDSSV LRDASYWAYQ QGVVQVAAAG NSGDGDSAID NVAYPARYSW VIAVAAVDQN YAVPTWSSDG PEVDVAAPGV DILSTYPGGR YAYMSGTSMA TPHVTGVVAL IQAVRTAYGL RPLTPDEVYQ VLTSTAKDIG PPGFDVYSGY GLVDAYAAVT AALKIG
|
| |