Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0649 |
Symbol | |
ID | 4463143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 682795 |
End bp | 684711 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639699657 |
Product | CRISPR-associated RAMP Crm2 family protein |
Protein accession | YP_843079 |
Protein GI | 116753961 |
COG category | [R] General function prediction only |
COG ID | [COG1353] Predicted hydrolase of the HD superfamily (permuted catalytic motifs) |
TIGRFAM ID | [TIGR02577] CRISPR-associated protein, Crm2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATGA GCGATAGCAG TTCCGAAATA AACTTCACAA TCGGTCCTGT TCAGGGGTTC ATCGCGCAGG CGCGCCGTAC GAGAGACCTC TGGAGCGGCT CGTTTCTCCT CTCGTACCTC TCCGGATGCG CCATGGCTGA GATACGGAGA TGTGATGGCA GAATAAAGGT TCCAGATGTA GAGGGAGATC AGCTGCTCCG CTGGATCGAG AGTGATGGTG AGGGTGCACC ACCCCGGATC GGCACACTTC CCAACAGGTT CGTGGCATCT GCGGAAGACC CTGCTGGCGC TGCCAGAAGC GCCGCGGAGG GTGTGAGGAG CAGGTGGAAG AAGATCGCAG ACTGTGTTTG GAATAAATAC GTGGAGGGCG TCGCAGATCA GGGAAAGAAC ACGCGTGAGA TCTGGGAGAG GCAGGTGAAT AACTTCTGGG AGATCCACTG GACGATCGGC GCGATGGAGG GAATGGAGGC GCGCAAGAAC TGGAGGATCT GCACAGACTG GTCAGACGGT CGACCGACCA TTGAATGGGG AGATCACTGC ACGATCATGA GCGACTGGCA GGAGATCTCA GGCTATGTGA GATCCATTGA GAGGAAAAAG CAGGATGATT TCTGGAAAGA GATACGCCAG GAGACCAGCG GGATGGATCT CAGAAGCGAT GAACGGCTCT GCGCGATCGC CCTGATCAAG AGGATGTTCC CATCGGTGGC AGAGGATGCC ATAGGATGGG AGGTCTCCGC GGATCGCTGG CCCTCCACGC TCTACGTGGC AGCGATACCA TGGCTCAGTG CTGTAATTGA ATCAGCTGAG AGGGATTATG CGAACGACTA CGCGAAAAAG GCGTTTAGCT ATGCGGAGCA CATACAGAGA AAGGGTGTTG CAGAGCAGAT CTTCGGAAGG AAAGACCTGT TCCTGGAGAT CGATGCGAAC TTCTACCACA CCACCTCCCT GAAGAACCCG AAGAGCACCC CACTCAAAAT GACCCCGGAG GATGGGGATG AACCGGAGGA TGTGAGGAGA AGGCGGGAAG AGCTTATTGA ATGCCTTGAG GAGCTTTACA ATAAAAAAGG GAAACCCTCT CCATTCTACG CCATGCTCCT CATGGACGGG GACAACATGG GGAGGCTGAT CAGGGAGAGT GGAGAGGCTG TGAGCAGAGC TCTTGCATCC TTCGCAAAAG ATGTTGAGCG TGTCGTGCAC GATAACCTTG GAGTCCTGGT GTACGCGGGT GGAGACGACG TCCTCGCGAT GCTGCCTGTT GAAAGAGCGA TGAGCTGCGC GTACAGGCTC TCTGTCAGTT TCGCGGAGTC GTTCGAAGCT CAGGGGATGG AGGCGACCAT ATCAGCGGGA TTGGTCTTCG CCAGCCATCG CGTGCCGCTC CGCTCTGTTA TGCGAGAGGC GCACTCCATC CTGGACGATA TAGCGAAGGA TGAGAACGGC AGGGGAAGCA TGGCTGTCAG CGTCCTCAAG GGCAGCGGCA GGTACTGCAG ATGGGTCAGC TCCTGGAAGG GTGCTGTATC TCCAGAGAAC GAGGTCGTTC TGGAGCGGCT GGCGAAGAAT CTCTCAGAGA AACCTGACGG CATAGAGGCG TCGAGCTCGT TCTTCTACAA GTCGAGAGAG CTGCTGCTCA TGCTCGCCGG CGAGCAGAGA TGGAGCCCGG GAATGTTCTT CGATCTCTCA CATCTCGGAG GTCTGGATAT AGAGACGCTC CTGCAGGCCG AATACCTCAA CGCGCTTGAG CACAGATCTG AGATAACAGA GGAGGTACGC GACAGCGCGG TGAGATACGT GAGAGATCTT CTTTCTGTGA GCCGGAGGCA CCGGGGCCCT GAGCATGAGC GGGCAACTGG CAAGGAGATC TGCGCGGATG GCATGCTGCT TGTGAAGTTC CTGTCACAGA AGGGGGTTCA GGAATGA
|
Protein sequence | MSMSDSSSEI NFTIGPVQGF IAQARRTRDL WSGSFLLSYL SGCAMAEIRR CDGRIKVPDV EGDQLLRWIE SDGEGAPPRI GTLPNRFVAS AEDPAGAARS AAEGVRSRWK KIADCVWNKY VEGVADQGKN TREIWERQVN NFWEIHWTIG AMEGMEARKN WRICTDWSDG RPTIEWGDHC TIMSDWQEIS GYVRSIERKK QDDFWKEIRQ ETSGMDLRSD ERLCAIALIK RMFPSVAEDA IGWEVSADRW PSTLYVAAIP WLSAVIESAE RDYANDYAKK AFSYAEHIQR KGVAEQIFGR KDLFLEIDAN FYHTTSLKNP KSTPLKMTPE DGDEPEDVRR RREELIECLE ELYNKKGKPS PFYAMLLMDG DNMGRLIRES GEAVSRALAS FAKDVERVVH DNLGVLVYAG GDDVLAMLPV ERAMSCAYRL SVSFAESFEA QGMEATISAG LVFASHRVPL RSVMREAHSI LDDIAKDENG RGSMAVSVLK GSGRYCRWVS SWKGAVSPEN EVVLERLAKN LSEKPDGIEA SSSFFYKSRE LLLMLAGEQR WSPGMFFDLS HLGGLDIETL LQAEYLNALE HRSEITEEVR DSAVRYVRDL LSVSRRHRGP EHERATGKEI CADGMLLVKF LSQKGVQE
|
| |