Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pisl_1541 |
Symbol | |
ID | 4618015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum islandicum DSM 4184 |
Kingdom | Archaea |
Replicon accession | NC_008701 |
Strand | - |
Start bp | 1405105 |
End bp | 1408203 |
Gene Length | 3099 bp |
Protein Length | 1032 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639784624 |
Product | HD superfamily hydrolase |
Protein accession | YP_931039 |
Protein GI | 119873032 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02578] CRISPR-associated protein, Csm1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.980767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.454227 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGACC TAAAAAAGGT ATTGGGGGCG CTCCTTCACG ACGTAGGTAA GCCAATAAGG AGAGCTGGCT TAGCAGACAG AGACAAGCTA TTGGAAAACA TAAAAGAAAT TGAAAAATCA CAAAATATTG AACTTAGAAA TTTACATGAT ACAGCAAGTA AAATCGGAAA GAGATCTGTT GCACATACAG AGTTCACACA ACTAATATTG AACTTGCTAC TTCCGCCAGA GGAGGCAAGT AGAATAAAAC CTGAAAAGGA TTGCACTAAG AAGGCCGATC GGAAAGCCGC GATGGAAAGA GTGGGGTCTG GGGATGTAAT GGATGAATTG AAAAAGCAAC TGCAAGAATC TGGCTTTAAG TACGGTGAAA CTACTCCTAT GTTGCTACCC CAGTGGATGC TCTTGAAAAC GAAGTATTTG AACAGCGTTG GTCTATGTAG TTCACAGAAG TCCTGGAACT ACGAAATGGG AAGAAAGGAA TTCATAGAAA ACCTGAAAGA TATCTTTGAG ATCTTAAAAG ATCTTAATAA GTCTAATAAA GATAAATTCT TAGAAAAATA TAAAGAATTA TTATGTTATT TATTTGATAA AGAGATCTGG GTTCCTGTGA AGCCTCTTGA TCCGGAGTTT ATTATCAACC TCAGAGGTAT GACTTACAAG GAGGCCCTTA ATGGAAGTGA CTATCCTGAG GTGGCTAGGA GGCTGATTTC TATGCTTGTC TGGGCTGGAA ATGTCTATAA TAGAAGGGTT TCCTATGGAT TGATAAATAC GGTACTTAAC ATCCTCAAGG CGACTACGAT GTTTGTGCCA TCTGCGCATT GGGCGGCTTT GGTTCCGGAC ATCAGTCTCT ATAGCCACAG CAGACTGACG GCGGCGTTTA GTACAGTGTG TGAAATGAGA GACTCGTTTA TTATGCTCAC TATAGACACG ATGGGGGTCC AGAGGTTTGT GGCCTCGCCG AGAGAGCCGG CCGCCGCATC CCGCATCTTG AGGGGGCGTA GTCTCATCGT TATTCTGGCG CTCGACGCCT TAACGAAGTA TGCCCTTCAT CTTTTCGAGG TACCAAAGAC AAATGTGCTT GTGGAAAAGG GCGGTGCTGT TGATCTGATT CTGCCGGTAA GGAGGATGGA GGACGACAAC TCAAAAAAGA AAAGGAGGAG AAAAGACAGC ATTTCAAAGG AGGACAGGGA GAGGTTGAAT ACCCTTGAGG AGGTGGCAGC GAAGCTTTCT GACTTCTTGG GTAGCGACAT TCGATTCATT GTCGCATATA CCAAACCCCG TGGCCTTAAT CATATAACAT ACCTTAAGGC ATGGCTTAGT TTTTACGGGG CCATGGAGGG GGGAAAAACT CGCGGGAGGA GAACCTATGG CATAATGAGT GTTTTGACGG AGCTTGAGAG GGAGCTTGCC AAAAAGAAGG CAGTTCTGAG GAGGTCTGAG AGTTGGTGGA AGAAGGTAGG TGGCTATGAT TCGTTGACTA AGGAGGCGAT TCTTGACGGC GATAAATTCT CCGTCAAGGT GGGCGAGGAG GATAGTGATT ATTTCGACGG CGTTGCGGGC CCCGGCAAGC TCCAGGTTGG CGATGTGTTA GCCGGTGTTA CCCATATGAG TCTAGCTGCC GGCTCTGCTG GTCGTGGTCT CGCTGGAGTT ATCGGGATTC ACCTGTTCGG AAGCGCCTCC GAGGATGACG TCGAAAGTTT TGTAAAGAAA TTGAGCAGTA AGCTGTTCGG CAACTGTCGT CACACGTGTC TTCACGGCTA CGTAGATGAG TTTGAGGTGG CCGTTGTCCC TCTGTTGCCG GTTAGGTCGG TCTATTTATT GCTGTCCGTG AGAAACAGCC CTGTCTACGA CGTAAGAGAT CCAAAACAGT TGGCCGCCGC CTATAGGGGC ATCGCCAAAT TGCTTGATGA ACTTACGCAG CTCCTTAGAG AGAATGGGAA CCCCGCTACC GATGGTAGGA GGGGTTGGGG GACGTATATT GACATAAAGA TTGTGAATGC CCCGCACGCC TTTATTCTGA CCAAGGCAGA AGACGTTGAC GTCAAGAACG CATTTGAGGA GCTTAGTGCA AAGATTAGGG ATCTGCTGGG CTCTGCAGAT GTGGAGATTG GGTTCGACTT TTTTGCATAT TATCACCCCG CCGTCTACAA CGCGGAAAGG GGGAGATACG AGTTGGTAAC GATCGACGAA TATGAGATAA TTGGCCTTGC CAAGATGGAT GTTGATAGAT TTGGCGACGT AAGGCTTATG TATGCATTCT CTCCGTCTAG GTTGGTTACC CTGTCGGACT ACGTCAACAT GGTATTGATG GGCAAGGGGT ACCTCACCGT TGTCGATGAA GTGAAGAGGC GTAACGAAAA CGCGAAGAGA TTTAGGCTCC TCGACGTGAT TCCCCTGTAC GCAGGGGGGG ACGACTTGAG CTTGTATGGC AAGTGGAGCC ATCTGCTCTA CTACCTTGCC AAGCTACACC GCTCTCTTAG GGCCGCTCTC TACCCACTTA CGCTTTCCCT ATCTGCCGCT ATAGACCGCG ACCACGTGCC TCTCCTATAT CTATACAGAA GGGCAGTAGA GGGCTTGGAG CTACATGCGA AAAGGTATAG AGCAGCTGGC GCTATAGAAT CAGACGTCGT CGCGCTTGAG CCTCCTGAGT GCGGAGGCGC ACAGGTTGAT CTCAAGAACC TATATAGCGC TTGGAACTTC CAAATCTTTG AGAAGGTGTT GGATCCATTT ACGATGCCCT ACGTGAAACT TGAGGAGTGG ACTAGGGAGC TCTATATTCT GTCCTCGCTG GCGGCTAGAT ATGAGACGCT TGAGATCAAG CAACATAGGG CACCGGCGAC CAGGCTACGT GAGCGAAGTC TACAGCTCAA GGTTTATTAC GCCTACGTGT GTGTAAGGAG AAAAGACGAG CTGGAAAAGC TGATAGATAC AATGAAGCGG CTCGGCATCG GTGAAGACCC CATCTTGCTG TACCCCAGTT CTAGAGAGCT GGATAAGGCA CTGAAACTGC TCAGTGTAGC CAAGCCGTAC CTAGACTTAG TTCTGCTTGC AATAAGGCGT AAAGATACGG TTCAGCCACT GGAGACAGAA ACGCCTTAG
|
Protein sequence | MPDLKKVLGA LLHDVGKPIR RAGLADRDKL LENIKEIEKS QNIELRNLHD TASKIGKRSV AHTEFTQLIL NLLLPPEEAS RIKPEKDCTK KADRKAAMER VGSGDVMDEL KKQLQESGFK YGETTPMLLP QWMLLKTKYL NSVGLCSSQK SWNYEMGRKE FIENLKDIFE ILKDLNKSNK DKFLEKYKEL LCYLFDKEIW VPVKPLDPEF IINLRGMTYK EALNGSDYPE VARRLISMLV WAGNVYNRRV SYGLINTVLN ILKATTMFVP SAHWAALVPD ISLYSHSRLT AAFSTVCEMR DSFIMLTIDT MGVQRFVASP REPAAASRIL RGRSLIVILA LDALTKYALH LFEVPKTNVL VEKGGAVDLI LPVRRMEDDN SKKKRRRKDS ISKEDRERLN TLEEVAAKLS DFLGSDIRFI VAYTKPRGLN HITYLKAWLS FYGAMEGGKT RGRRTYGIMS VLTELERELA KKKAVLRRSE SWWKKVGGYD SLTKEAILDG DKFSVKVGEE DSDYFDGVAG PGKLQVGDVL AGVTHMSLAA GSAGRGLAGV IGIHLFGSAS EDDVESFVKK LSSKLFGNCR HTCLHGYVDE FEVAVVPLLP VRSVYLLLSV RNSPVYDVRD PKQLAAAYRG IAKLLDELTQ LLRENGNPAT DGRRGWGTYI DIKIVNAPHA FILTKAEDVD VKNAFEELSA KIRDLLGSAD VEIGFDFFAY YHPAVYNAER GRYELVTIDE YEIIGLAKMD VDRFGDVRLM YAFSPSRLVT LSDYVNMVLM GKGYLTVVDE VKRRNENAKR FRLLDVIPLY AGGDDLSLYG KWSHLLYYLA KLHRSLRAAL YPLTLSLSAA IDRDHVPLLY LYRRAVEGLE LHAKRYRAAG AIESDVVALE PPECGGAQVD LKNLYSAWNF QIFEKVLDPF TMPYVKLEEW TRELYILSSL AARYETLEIK QHRAPATRLR ERSLQLKVYY AYVCVRRKDE LEKLIDTMKR LGIGEDPILL YPSSRELDKA LKLLSVAKPY LDLVLLAIRR KDTVQPLETE TP
|
| |