Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_0563 |
Symbol | |
ID | 6165902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | - |
Start bp | 514713 |
End bp | 517409 |
Gene Length | 2697 bp |
Protein Length | 898 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641667716 |
Product | CRISPR-associated RAMP Crm2 family protein |
Protein accession | YP_001793951 |
Protein GI | 171185032 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02577] CRISPR-associated protein, Crm2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.406769 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00098252 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGACTTTT CAAAAAAGGC GGCTCTCCTG CTGTCGAGCC CCCCGCACTG GGAGGAGGAC GGCCAGGTAA GAGAGCTCGT CGAGAGGGTG CGCAGGGCCC TGGGGGCCGA CGCCGGCGGG GACCTCGTCG AGAGGATTAG GGAGTACGTG TGGGGCGGCA TGCCGAGAGG CGGCGCTCCA TACGGCTACA TACACAACGT CTTCTCCCCC ACGCTGAAGA GGGAGCTGAG GAAGCCGCCG GCGGAGGAGG TCAGGGCGTA CTGGAGGGAT CTCGCCGAGG TTGTCGAGGG CGCCGGCGGT AAATACCACG TCTTCTACGC CGCCTACGAG GCGCTTTGGA TCAAGAGGGG GCTGTCGGCA AGGGTGGCCA ACCCCCTGGC GCCCTTCTAC GACGCCTTCG ACGAGGCCTA CGCCATGGCC ACGCTCGCCA ACTGGTTTTC CGACGGCGGA GAGCCCTCGG GGTACTTCGT AAACGCCGAC GTGCCGGGGG TCCAGGACTT CGTGGGGGCC GGGAGGAAGG CCGGCGACTT CTGGGCCGGC AGCTGGGCCC TCTCCATTGC CGTCTGGCTG ACCGCCTGGC CTTTCGTCGA GAAGTACGGC CCCGACGTAC TGCTGAGGCC AACCGCGAGG CTCAACCCGT ACTACTTCTA CTTCATCCGC GGGAGCTCCC AGAAGCTCGA CAAGCATCTC AGAGACGTCA TGCGCGAGGT CGGCTTCACG CCCCCAGAGC CGGCGTGGAT AATGCAGCCC CTCATCGGCG AGAAGATCCA GCTAGTGCTC CCCCGCAGGT GGAGTAGCGA GGAGGAGGTG GTGAGGGAGG TCGTGGAGGG GTTTAAAAAG GCGCTGGACT GCCTCACCGG CCTCGCGAGG GGGCACGTCT GCGACCTGCT GAGGGGCCAC ATAGACGTAA TTCCGAGCGG CGGCTGCTTT GACGCCCTGT ATAGGCTTGG GGACTACGCA GAGGTGAGGC TCCCGCTTAG GGTCACAGTC ATTGACATAG GGGAGTACTA CCGGGAGCTT AGGGATAAGT ACTGCGCCGG CGCCGACCAC TGCCCCGTTC TAAAAATCGC CTTTTTCGAC GAGCTACTAC GGAACTCGCA CGCAGCCGTA GGCTTTCAGA TGTCTCTAGA GAGGCAGCGG AGGAGGGTGC CCACGGCCGG GCCCTACTTC CACCCCGCCG CCTTCGACGA GGTGAAGCCG GCCTTCAAGA CGCCGGGGCC CGCCGCCTCT ACACACGTGT TTTTCAACGA GAAGACGTGG CGCGTCTGTA CAGTCTGCGG CTCCGAGCCG GCGGTCGTTG GGCTTAGGAA GGTGAGAACC CCCACGGGGA AGGAGGACTA CAACCCAGAC GATCTAGAGG CCTTTATAAA CGCGCTGGGG GCCAACTGCG AAGACTTCGA GAGGCATCTA AGGAGGGTCA TACGCCCCGG GGAGTTCCTG GGGCCCAGGT GCCTCGCCAA AAGGCTGATA TACCTCAGGG CGAAGCCCTC CAAGCAACCA GGCGAGGAGC CGTTGCCTGA GGAGAAGCTC CAGAGGTTTG AAAGCACGGA GGACGTGGCC GTGGCTGTGG TGGCCAAGGT GGGAGAGGCG TTGGAGAGGG AGGCCCAGAA GATTAAAGAC GGGAGGAGTG GGGCGTGTCT CAAGGTCGCG GACTATCTAA CGAAGGGCGG GGGGCGCGAC CTGGAGATGG TGTGGGGAAC CGCCGAGGAC ATGAAGAGGG AGTGGGAGGA GTGTATCAAG GCCCTTGAAA AGGCGGAGGT GGACCTCGGC GGGTTGGCGG CGGGGGCCGG ACTCGCGGGG TGGCCGCACA GCGCCGCGCT GGGGCCCAGG CTCTTCTACG CAGTCGTCAG GGGAGACGGC GACAGCATGG GCGAGCTCCT CGACGGGAGG CTGCCGGCTA GGTGGTATGA AGAATACGAG AGGGCGCTCG GCGGGGGGCT GGCCGGCGTG GAGGAGTACT TCAAGGCCGT GAAGAGCTAC CTTAAGCGCC ACGTGGGGGA GGATGCGGAG CAGACAGTGC CCATAACTCC CCTCTACCGC GTCGCCATAA ACAGATCGCT GGTTGTGACG TCGCTTAGAG ACTGGGCGGC CGTGGAGGGG GCCTCCGGAA TGTTGATATA CGCAGGAGGC GACGACGTGG TGGCCCTCGC GCCTGTGGAA AAGGCGCTGG ACGTGGTGGC GCAGACCAGG CAGAACTTCT GGGGGGGCGG CTTCCACAGC GTCGGCGGCT ACCACATACC ACAGCTGGCG GCCTACGGGA GGAGCTACGC CGTGAGGTTC GTACATGTGA TGGACATAAT GTCCATCGAG CTGAGGAAAT CCCACGAGGA CCTAGAACGC GCAAAAGACG CCGCGTGGGA GAGGCACAGG AAGGACTCCG TAGCCGTGGC CTCCTCGAGG ACCCAGCACG CCGCCGTCCT CCCCCTTAGA GACGTCTCCA CAGTGGACAG ACTGAAGCTC GCGTGGCTCT ACATGCTCAA AGACGCCATC AGCAAAAACG CCCCCCACGA CGCCGAGAGG AGGCTTAACG GGGACCTCGA CGCAGACGCC ACCTACAGAG TGGCCGAGCA CGTCCTCAGG AGGAACGCCA AAGACGGCGG AGCCGCCGCG CAGATAGCCT CATGGCTCCG CGGGGCGTCC CCGAGGTATA TAAGGGAGTT CTTCGGCGCA CTCGCCGTGT TGAAGGGGGT GCTATGA
|
Protein sequence | MDFSKKAALL LSSPPHWEED GQVRELVERV RRALGADAGG DLVERIREYV WGGMPRGGAP YGYIHNVFSP TLKRELRKPP AEEVRAYWRD LAEVVEGAGG KYHVFYAAYE ALWIKRGLSA RVANPLAPFY DAFDEAYAMA TLANWFSDGG EPSGYFVNAD VPGVQDFVGA GRKAGDFWAG SWALSIAVWL TAWPFVEKYG PDVLLRPTAR LNPYYFYFIR GSSQKLDKHL RDVMREVGFT PPEPAWIMQP LIGEKIQLVL PRRWSSEEEV VREVVEGFKK ALDCLTGLAR GHVCDLLRGH IDVIPSGGCF DALYRLGDYA EVRLPLRVTV IDIGEYYREL RDKYCAGADH CPVLKIAFFD ELLRNSHAAV GFQMSLERQR RRVPTAGPYF HPAAFDEVKP AFKTPGPAAS THVFFNEKTW RVCTVCGSEP AVVGLRKVRT PTGKEDYNPD DLEAFINALG ANCEDFERHL RRVIRPGEFL GPRCLAKRLI YLRAKPSKQP GEEPLPEEKL QRFESTEDVA VAVVAKVGEA LEREAQKIKD GRSGACLKVA DYLTKGGGRD LEMVWGTAED MKREWEECIK ALEKAEVDLG GLAAGAGLAG WPHSAALGPR LFYAVVRGDG DSMGELLDGR LPARWYEEYE RALGGGLAGV EEYFKAVKSY LKRHVGEDAE QTVPITPLYR VAINRSLVVT SLRDWAAVEG ASGMLIYAGG DDVVALAPVE KALDVVAQTR QNFWGGGFHS VGGYHIPQLA AYGRSYAVRF VHVMDIMSIE LRKSHEDLER AKDAAWERHR KDSVAVASSR TQHAAVLPLR DVSTVDRLKL AWLYMLKDAI SKNAPHDAER RLNGDLDADA TYRVAEHVLR RNAKDGGAAA QIASWLRGAS PRYIREFFGA LAVLKGVL
|
| |