Gene Pisl_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPisl_1541 
Symbol 
ID4618015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum islandicum DSM 4184 
KingdomArchaea 
Replicon accessionNC_008701 
Strand
Start bp1405105 
End bp1408203 
Gene Length3099 bp 
Protein Length1032 aa 
Translation table11 
GC content48% 
IMG OID639784624 
ProductHD superfamily hydrolase 
Protein accessionYP_931039 
Protein GI119873032 
COG category[R] General function prediction only 
COG ID[COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) 
TIGRFAM ID[TIGR02578] CRISPR-associated protein, Csm1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.980767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.454227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACC TAAAAAAGGT ATTGGGGGCG CTCCTTCACG ACGTAGGTAA GCCAATAAGG 
AGAGCTGGCT TAGCAGACAG AGACAAGCTA TTGGAAAACA TAAAAGAAAT TGAAAAATCA
CAAAATATTG AACTTAGAAA TTTACATGAT ACAGCAAGTA AAATCGGAAA GAGATCTGTT
GCACATACAG AGTTCACACA ACTAATATTG AACTTGCTAC TTCCGCCAGA GGAGGCAAGT
AGAATAAAAC CTGAAAAGGA TTGCACTAAG AAGGCCGATC GGAAAGCCGC GATGGAAAGA
GTGGGGTCTG GGGATGTAAT GGATGAATTG AAAAAGCAAC TGCAAGAATC TGGCTTTAAG
TACGGTGAAA CTACTCCTAT GTTGCTACCC CAGTGGATGC TCTTGAAAAC GAAGTATTTG
AACAGCGTTG GTCTATGTAG TTCACAGAAG TCCTGGAACT ACGAAATGGG AAGAAAGGAA
TTCATAGAAA ACCTGAAAGA TATCTTTGAG ATCTTAAAAG ATCTTAATAA GTCTAATAAA
GATAAATTCT TAGAAAAATA TAAAGAATTA TTATGTTATT TATTTGATAA AGAGATCTGG
GTTCCTGTGA AGCCTCTTGA TCCGGAGTTT ATTATCAACC TCAGAGGTAT GACTTACAAG
GAGGCCCTTA ATGGAAGTGA CTATCCTGAG GTGGCTAGGA GGCTGATTTC TATGCTTGTC
TGGGCTGGAA ATGTCTATAA TAGAAGGGTT TCCTATGGAT TGATAAATAC GGTACTTAAC
ATCCTCAAGG CGACTACGAT GTTTGTGCCA TCTGCGCATT GGGCGGCTTT GGTTCCGGAC
ATCAGTCTCT ATAGCCACAG CAGACTGACG GCGGCGTTTA GTACAGTGTG TGAAATGAGA
GACTCGTTTA TTATGCTCAC TATAGACACG ATGGGGGTCC AGAGGTTTGT GGCCTCGCCG
AGAGAGCCGG CCGCCGCATC CCGCATCTTG AGGGGGCGTA GTCTCATCGT TATTCTGGCG
CTCGACGCCT TAACGAAGTA TGCCCTTCAT CTTTTCGAGG TACCAAAGAC AAATGTGCTT
GTGGAAAAGG GCGGTGCTGT TGATCTGATT CTGCCGGTAA GGAGGATGGA GGACGACAAC
TCAAAAAAGA AAAGGAGGAG AAAAGACAGC ATTTCAAAGG AGGACAGGGA GAGGTTGAAT
ACCCTTGAGG AGGTGGCAGC GAAGCTTTCT GACTTCTTGG GTAGCGACAT TCGATTCATT
GTCGCATATA CCAAACCCCG TGGCCTTAAT CATATAACAT ACCTTAAGGC ATGGCTTAGT
TTTTACGGGG CCATGGAGGG GGGAAAAACT CGCGGGAGGA GAACCTATGG CATAATGAGT
GTTTTGACGG AGCTTGAGAG GGAGCTTGCC AAAAAGAAGG CAGTTCTGAG GAGGTCTGAG
AGTTGGTGGA AGAAGGTAGG TGGCTATGAT TCGTTGACTA AGGAGGCGAT TCTTGACGGC
GATAAATTCT CCGTCAAGGT GGGCGAGGAG GATAGTGATT ATTTCGACGG CGTTGCGGGC
CCCGGCAAGC TCCAGGTTGG CGATGTGTTA GCCGGTGTTA CCCATATGAG TCTAGCTGCC
GGCTCTGCTG GTCGTGGTCT CGCTGGAGTT ATCGGGATTC ACCTGTTCGG AAGCGCCTCC
GAGGATGACG TCGAAAGTTT TGTAAAGAAA TTGAGCAGTA AGCTGTTCGG CAACTGTCGT
CACACGTGTC TTCACGGCTA CGTAGATGAG TTTGAGGTGG CCGTTGTCCC TCTGTTGCCG
GTTAGGTCGG TCTATTTATT GCTGTCCGTG AGAAACAGCC CTGTCTACGA CGTAAGAGAT
CCAAAACAGT TGGCCGCCGC CTATAGGGGC ATCGCCAAAT TGCTTGATGA ACTTACGCAG
CTCCTTAGAG AGAATGGGAA CCCCGCTACC GATGGTAGGA GGGGTTGGGG GACGTATATT
GACATAAAGA TTGTGAATGC CCCGCACGCC TTTATTCTGA CCAAGGCAGA AGACGTTGAC
GTCAAGAACG CATTTGAGGA GCTTAGTGCA AAGATTAGGG ATCTGCTGGG CTCTGCAGAT
GTGGAGATTG GGTTCGACTT TTTTGCATAT TATCACCCCG CCGTCTACAA CGCGGAAAGG
GGGAGATACG AGTTGGTAAC GATCGACGAA TATGAGATAA TTGGCCTTGC CAAGATGGAT
GTTGATAGAT TTGGCGACGT AAGGCTTATG TATGCATTCT CTCCGTCTAG GTTGGTTACC
CTGTCGGACT ACGTCAACAT GGTATTGATG GGCAAGGGGT ACCTCACCGT TGTCGATGAA
GTGAAGAGGC GTAACGAAAA CGCGAAGAGA TTTAGGCTCC TCGACGTGAT TCCCCTGTAC
GCAGGGGGGG ACGACTTGAG CTTGTATGGC AAGTGGAGCC ATCTGCTCTA CTACCTTGCC
AAGCTACACC GCTCTCTTAG GGCCGCTCTC TACCCACTTA CGCTTTCCCT ATCTGCCGCT
ATAGACCGCG ACCACGTGCC TCTCCTATAT CTATACAGAA GGGCAGTAGA GGGCTTGGAG
CTACATGCGA AAAGGTATAG AGCAGCTGGC GCTATAGAAT CAGACGTCGT CGCGCTTGAG
CCTCCTGAGT GCGGAGGCGC ACAGGTTGAT CTCAAGAACC TATATAGCGC TTGGAACTTC
CAAATCTTTG AGAAGGTGTT GGATCCATTT ACGATGCCCT ACGTGAAACT TGAGGAGTGG
ACTAGGGAGC TCTATATTCT GTCCTCGCTG GCGGCTAGAT ATGAGACGCT TGAGATCAAG
CAACATAGGG CACCGGCGAC CAGGCTACGT GAGCGAAGTC TACAGCTCAA GGTTTATTAC
GCCTACGTGT GTGTAAGGAG AAAAGACGAG CTGGAAAAGC TGATAGATAC AATGAAGCGG
CTCGGCATCG GTGAAGACCC CATCTTGCTG TACCCCAGTT CTAGAGAGCT GGATAAGGCA
CTGAAACTGC TCAGTGTAGC CAAGCCGTAC CTAGACTTAG TTCTGCTTGC AATAAGGCGT
AAAGATACGG TTCAGCCACT GGAGACAGAA ACGCCTTAG
 
Protein sequence
MPDLKKVLGA LLHDVGKPIR RAGLADRDKL LENIKEIEKS QNIELRNLHD TASKIGKRSV 
AHTEFTQLIL NLLLPPEEAS RIKPEKDCTK KADRKAAMER VGSGDVMDEL KKQLQESGFK
YGETTPMLLP QWMLLKTKYL NSVGLCSSQK SWNYEMGRKE FIENLKDIFE ILKDLNKSNK
DKFLEKYKEL LCYLFDKEIW VPVKPLDPEF IINLRGMTYK EALNGSDYPE VARRLISMLV
WAGNVYNRRV SYGLINTVLN ILKATTMFVP SAHWAALVPD ISLYSHSRLT AAFSTVCEMR
DSFIMLTIDT MGVQRFVASP REPAAASRIL RGRSLIVILA LDALTKYALH LFEVPKTNVL
VEKGGAVDLI LPVRRMEDDN SKKKRRRKDS ISKEDRERLN TLEEVAAKLS DFLGSDIRFI
VAYTKPRGLN HITYLKAWLS FYGAMEGGKT RGRRTYGIMS VLTELERELA KKKAVLRRSE
SWWKKVGGYD SLTKEAILDG DKFSVKVGEE DSDYFDGVAG PGKLQVGDVL AGVTHMSLAA
GSAGRGLAGV IGIHLFGSAS EDDVESFVKK LSSKLFGNCR HTCLHGYVDE FEVAVVPLLP
VRSVYLLLSV RNSPVYDVRD PKQLAAAYRG IAKLLDELTQ LLRENGNPAT DGRRGWGTYI
DIKIVNAPHA FILTKAEDVD VKNAFEELSA KIRDLLGSAD VEIGFDFFAY YHPAVYNAER
GRYELVTIDE YEIIGLAKMD VDRFGDVRLM YAFSPSRLVT LSDYVNMVLM GKGYLTVVDE
VKRRNENAKR FRLLDVIPLY AGGDDLSLYG KWSHLLYYLA KLHRSLRAAL YPLTLSLSAA
IDRDHVPLLY LYRRAVEGLE LHAKRYRAAG AIESDVVALE PPECGGAQVD LKNLYSAWNF
QIFEKVLDPF TMPYVKLEEW TRELYILSSL AARYETLEIK QHRAPATRLR ERSLQLKVYY
AYVCVRRKDE LEKLIDTMKR LGIGEDPILL YPSSRELDKA LKLLSVAKPY LDLVLLAIRR
KDTVQPLETE TP