Gene Pisl_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1805 
Symbol 
ID4617719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1637236 
End bp1638426 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content51% 
IMG OID639784889 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_931297 
Protein GI119873290 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0411901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0459712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAGGA GGGTGTTGGT AATTGTAGTA TTGGCATTGG TTATATTTAT ACAAGCCGCG 
AGAGTGGTAG TAGGTTATGA AGACCCTACA TCTTTAGGGG CTTTAGGCGA GTTAAATAAG
ACAGGCGATA TAAAGATGTT AAAACATATC AAGGAGATAA AAGCAGTTGT GTTAAATCTC
CCCGATAGCA AACTGGGGGA GTTAAAAGAG AAGTTAAAGG GGGTGAGATA TATCGAGGAG
GATAAGGTAG CTTGGGCGAT AGGCTTTGCA GACTATGCAG ACGTGCAGTG GAATATCAAA
ATGGTGAACG CCCCTCTTGT GTGGGATACA TACTTTGTGA CAATTGGCGA TGCGGCGTTT
GGCTACGGCG TAACTGTCGC CGTGTTAGAC ACAGGCATAG ACTATACACA CCCAGAGCTC
TACGGGAAGG TTGTATACTG CATATATACA GTGGGGGTTC GCTTATATAA AGGCACAAAT
CTCAAGAACT GTGCAGATAG AAACGGCCAC GGGACACATG TAGCTGGTAT AATCGCCGCC
TCGCTGGATA ACGTGGGCGT GGCTGGAGTT GCGCCAAAGG TAAGGCTGAT AGCTGTAAAA
GTTCTAAACG ACGCGGGCTC TGGCTACTAC AGCGATATCG CCGAAGGCAT TGTCGAAGCC
GTTAAAGCAG GCGCCAGGAT ACTTTCTATG TCTCTCGGCG GCCCTACAGA CTCCTCAGTG
TTGAGAGACG CATCGTATTG GGCGTATCAA CAGGGCGTGG TGCAGGTGGC TGCGGCTGGG
AATTCTGGTG ATGGGGACTC TGCTATTGAC AACGTGGCGT ATCCGGCTAG GTACAGCTGG
GTTATTGCTG TCGCCGCTGT TGACCAAAAC TACGCGGTCC CCACTTGGTC GAGCGACGGC
CCTGAGGTAG ACGTGGCTGC CCCCGGCGTG GATATCCTAT CTACATACCC CGGCGGGAGA
TATGCATATA TGTCAGGAAC CTCCATGGCA ACTCCACACG TAACCGGCGT CGTGGCGTTA
ATCCAAGCAG TTAGGACAGC ATACGGCCTT AGGCCTCTGA CGCCGGACGA GGTATACCAA
GTTTTGACCT CTACTGCCAA AGATATAGGC CCGCCGGGCT TCGACGTCTA CAGCGGCTAT
GGGCTTGTCG ATGCATACGC GGCTGTCACT GCCGCGCTGA AAATAGGATA G
 
Protein sequence
MTRRVLVIVV LALVIFIQAA RVVVGYEDPT SLGALGELNK TGDIKMLKHI KEIKAVVLNL 
PDSKLGELKE KLKGVRYIEE DKVAWAIGFA DYADVQWNIK MVNAPLVWDT YFVTIGDAAF
GYGVTVAVLD TGIDYTHPEL YGKVVYCIYT VGVRLYKGTN LKNCADRNGH GTHVAGIIAA
SLDNVGVAGV APKVRLIAVK VLNDAGSGYY SDIAEGIVEA VKAGARILSM SLGGPTDSSV
LRDASYWAYQ QGVVQVAAAG NSGDGDSAID NVAYPARYSW VIAVAAVDQN YAVPTWSSDG
PEVDVAAPGV DILSTYPGGR YAYMSGTSMA TPHVTGVVAL IQAVRTAYGL RPLTPDEVYQ
VLTSTAKDIG PPGFDVYSGY GLVDAYAAVT AALKIG