Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1118 |
Symbol | |
ID | 5055201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1007542 |
End bp | 1010178 |
Gene Length | 2637 bp |
Protein Length | 878 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640468674 |
Product | CRISPR-associated RAMP Crm2 family protein |
Protein accession | YP_001153348 |
Protein GI | 145591346 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02577] CRISPR-associated protein, Crm2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTGGT TTGACAAGGC GTGGGCCCTG CTCCACGACC CGCCCTACAA GGCCCTCTGG CCACTGGGCT ACAAGCCGCT GGGAGGGAAG ACCCACGAGG AGGAGGCCAA GCGCTTGATG GCGGCTCTCC TGGGCGGCAC CAAGCTCGGC GGCGGGGCGC CGGACGAGAG GACGTCTAGG ATAGTCGCCG CCGCTGACAG GCTTGCCTCT TCCTTCGACC GCTGGGCCCT CTCCACGGAG GGCGAGGCGA AGTACTGGGT CAAGCCGGCG GAGCTGGTCA ACCCCTTCAA CCCGGCGTAC GCGGCAAGAG TAGAGCCGCC GCCCCCGGAG CAGTTCGGGG GACGGATAGA AAGCTTCGTA AAAAACGTAA ACCGAGTAGT TAGGGAGGCC GGCGATGAGA AAGAGGCCTA CTTCGCCCTG TACGCGGTGT ACGAGCTTGC GTGGATCGAG GCCGGCCTGC CGGCCCTGCC AGCAGACACG AGGGTCCCCA CCCACACCAT ATTCGACCAC CTATACGCCA CCGCATCTGT GATGAACTGG GTCGGCGACG GCGGCGAGCC GAGGGGGGAC GCCTGCCTTC TTGAAATCGA CATCCCAGGC ATCCAGAAGG TGATCTCCTC AGCGAGGAAG GCCGGGGACT ACCGCGCCGG CAGCATGTTG GTCTCCCTGG CTATTTGGGG CACGGCGTGG AGGTACATGG ACAAACACGG CCCCGACGTG TTGCTCTCCC CATCCCCCCG CTTCAACCCC TTCCTGTACC TCCAGCTGAG GCGCCTCTAC GGCTGGGGGG AGTCGGCGCT TCGGCTGTAC AGAAAGGTGG CGGGCATGGC ACTCGGGGCG GACGTCGCGG CGCTGTTGGA CAAAACGCCC CTGGTCCCAG GCACGGCGTA TTTAGCCCTC CCCAGCTGCT CCGACGCGGA GAGGGCAGTG GAGCACTTCG AGGACGCGCT CGACGAGATA AGGGCCATGG TCCTGGGGGA GAGGGAGGCG AAGCTCCCCC TAGCGGGCTC AACGACTGGC GACGTCTTGA AAATAGCCAA GGCGGCCTTG GAGGTTGCGC CAAGGAGGTA CCTCCCAGTG AGGGTGCGCT ACGCCTCTAT CTCAGAGGCG TGGGGCGCGG CGGAGGAGGC CGCCCGGGAG GTCAGCAGGG AGGCGGGGTT CGAGGTGGAC CCCGCCAGGT TCATCTTCTG GGCCTTGATG AAAGTCTTGA AAGAAAAGCC GGCGGTGCCC CACCCAGTGG CCTGGTTCGA CAAGGGCGGC GCCCCCAGGT TTGTTAAGAG GTACGGAGGG CCGTGGATCT ACAGTAGCCT AGACCCAGAC CAGCCAGCGG TCCTGAAGCT CTCGGGCGTT GTAACCCCCC AAGGGGTTGA CTACGACGAG GAGGCCAAGG CCGCCTTGGC CCAGATCGGG GTGCAGAACC TCGCCGAGTT GGCCAAGGTC TTTAGGCCAA AGGAGGCCCT CGGCCCAGTA GACGTGCTCA AGCGGGCTCT GTACTACGGG GTGTCCAGGG ATAGGGTGGA GTCGGTGGAG GCGGTGGCCC TTAGGTGGCA CTACCGCCGT GGGTATTTCA AAAACTGCCC AGACCTGCAG CGGAGGGTGG AGGAGGTCTT GCAAGGCGCC GACGCCGAGG CGGTCTTCAC GTCGCCGGAA GCCGCCGACA AGACCCTGGC CCAGGCCCCC CAGCTCTGCG GAGGCGACCC GGCCATCCCC GCCCCCACCC TCATGTACGC CATCATCAGG GCAGACGGAG ACAACATCGG CAAGCTGATA TCCGGCTGCC TACCCCCCGC ACAGCGGCCC CCCGTCGAGG ATCGGCTTGT AGAAGGCGAC AAAGGCCAGT GGGAGGAGGA CCAGAGGCGG CTGGAAAAGC TAGAAAAGGC GGTCCAGGTC CTGGGGGAGA AGGCCAAGTG CCGGGGCGCC GGCGGCGGGG CTGAGGCGAG GTACGCGGTG CCGTCGCCTG CCTACTACGC CGCCCTCTCC GCCTCTATGA TGATAACGGC GCTGAAAGAC GCGTACATCG TGGCCAAGCA CGACGGCGAG GTGGTCTTCG CCGGCGGCGA CGACTTGCTG GCCTTCGCCC CCCTTGCCGC GGCCTTCCAC ATCGTGAAGG AGAGCAGAGA GGCCTACTGG GGAGAAGGCG GCTTCCACAA AATAGGCCCC TACTCCCTCC CAGCCCTCGC GGCCTATGGA AAAAGCTACT CGGTAAGAGC CGCCCACTCC ATTACGGACT TCATGGCCAT AGAGGTGGAG GAGGCGACCC GACTACTGGA AAAGGCCAAG GAGGCGGTCA GAGGCAAAGA CGCGCTGGCC ATCTCCACCT CCACCGGCCA CGCGGGGTTC GCCAAGGCGC GCCACGCGGC GCTTGTGGAG CAGATAGCGG AGGCCTACAG GACGGGCAAC CTCAGCAAAA ACCTCCCCTA CGACCTCGAG AGGTGGGCCG GCGACGGGCT GAGGTGCGGA GGCGGCGAGA CGTGCAGAGA GGCGGCCAGC ATAATCCTCA CCTACGTGGC CAGCCGCAAC TCCAAAAACG GCATCCCAAG CCGCTTGGAG GAGCTACTCG ACGCCGTGGC CGACGGGGCG GATGCGGCAT TGAAAAACGC CGCGGAGCTC CTAAAAGCGG CGAGGGAGTG GGCATGA
|
Protein sequence | MGWFDKAWAL LHDPPYKALW PLGYKPLGGK THEEEAKRLM AALLGGTKLG GGAPDERTSR IVAAADRLAS SFDRWALSTE GEAKYWVKPA ELVNPFNPAY AARVEPPPPE QFGGRIESFV KNVNRVVREA GDEKEAYFAL YAVYELAWIE AGLPALPADT RVPTHTIFDH LYATASVMNW VGDGGEPRGD ACLLEIDIPG IQKVISSARK AGDYRAGSML VSLAIWGTAW RYMDKHGPDV LLSPSPRFNP FLYLQLRRLY GWGESALRLY RKVAGMALGA DVAALLDKTP LVPGTAYLAL PSCSDAERAV EHFEDALDEI RAMVLGEREA KLPLAGSTTG DVLKIAKAAL EVAPRRYLPV RVRYASISEA WGAAEEAARE VSREAGFEVD PARFIFWALM KVLKEKPAVP HPVAWFDKGG APRFVKRYGG PWIYSSLDPD QPAVLKLSGV VTPQGVDYDE EAKAALAQIG VQNLAELAKV FRPKEALGPV DVLKRALYYG VSRDRVESVE AVALRWHYRR GYFKNCPDLQ RRVEEVLQGA DAEAVFTSPE AADKTLAQAP QLCGGDPAIP APTLMYAIIR ADGDNIGKLI SGCLPPAQRP PVEDRLVEGD KGQWEEDQRR LEKLEKAVQV LGEKAKCRGA GGGAEARYAV PSPAYYAALS ASMMITALKD AYIVAKHDGE VVFAGGDDLL AFAPLAAAFH IVKESREAYW GEGGFHKIGP YSLPALAAYG KSYSVRAAHS ITDFMAIEVE EATRLLEKAK EAVRGKDALA ISTSTGHAGF AKARHAALVE QIAEAYRTGN LSKNLPYDLE RWAGDGLRCG GGETCREAAS IILTYVASRN SKNGIPSRLE ELLDAVADGA DAALKNAAEL LKAAREWA
|
| |