Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0648 |
Symbol | |
ID | 4463142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 681722 |
End bp | 682798 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639699656 |
Product | CRISPR-associated Cmr3 family protein |
Protein accession | YP_843078 |
Protein GI | 116753960 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01888] CRISPR-associated protein, Cmr3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.845582 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTATC TCAGAATCAC CCCGGTGGAT TCCTGGTTCT TCAGGGACGG AAGGCCGTTC CATTTGGGAG AGGCGACATC AGACGTTGGG GGGCTCTTTC CCCCAAGCGC GTTCACTGTG GTCGGAGCGA TAAGGGCGCA CCTCGCCCGG AGCATGGGCT GGCGTGAAGG GAAGTGGAGT GAAGAGATCT GCAGGGTGCT CGGGGATGGG TATGATCTGG CTGGTCTCAG GTTCAGGGGA CCGCTGCTCT GCAGGGAGGG AGATCGCGGT CCAGAGATGC TGTATCCAGC GCCGCTGAAC CTACTCGGGA AGAGGAACGA CAGCGGGTAC GAGATGAGGC TACTCCGTCC CGGGGAGGAG GTGGAGTGCG ATCTTGGAAG GGTCAGGCTT CCCACAGCGG AGAAAATCGA TGGCATGAAG CAGCTGTCAG GATACGTAAA TGGAGATCAG CTGAAACAGG TATTGAGGGG AGAGGTTCCG GAGGGACGGG TGATCCCGAG GGAGGAGTTA TGGGGGACGG AGTACGCTGT CGGTCTTGAG AGAGAAAGGG ATACAAGAAC AGCTAAGGAA GCGCATCTTT ACTCGATAAA CAGGATCAGG CTTGTGCGCG GAGTGCATCT GATCATGGGC GTTGAGGGGA TCGATGAGGG CCTCCTGGAA AAGCTGGACG GCGCGGTGAT GCCCATCGGC GGAGAGGGGC GGATGGGGTG TGTCGAGATG CTCAGCCCCG GAAAAGGCGT CTCTGAGATC AGGCAGGAGA TCGGGCCGAT CGACGGACGG GTCAGATTCA CGCTGGTGCA TATCACCCCG GCGTTTCTGG GGAGGTGGCC CCGTCCGGGA GAGAGCATCC CCGGGGTTCC TGGGGAGGTC GTATCAGCCT GCGTGGGGCG AGCCCTTCGC ATCGGTGGCT GGGATTCTGT TAACAGGAGA CCTGTGGAGC TCAAGCCGTT CATTCCACCG GGATCCGTCT GGTTCTGCGA GGCTGAGGCG GATGAGCTGA ACGATGTTAT GAGCGTGAGC AGGATAGGAG AGTACACAGG ATTTGGTTTT GGAGAGATTG CTCTAGGCAT TTGGTGA
|
Protein sequence | MIYLRITPVD SWFFRDGRPF HLGEATSDVG GLFPPSAFTV VGAIRAHLAR SMGWREGKWS EEICRVLGDG YDLAGLRFRG PLLCREGDRG PEMLYPAPLN LLGKRNDSGY EMRLLRPGEE VECDLGRVRL PTAEKIDGMK QLSGYVNGDQ LKQVLRGEVP EGRVIPREEL WGTEYAVGLE RERDTRTAKE AHLYSINRIR LVRGVHLIMG VEGIDEGLLE KLDGAVMPIG GEGRMGCVEM LSPGKGVSEI RQEIGPIDGR VRFTLVHITP AFLGRWPRPG ESIPGVPGEV VSACVGRALR IGGWDSVNRR PVELKPFIPP GSVWFCEAEA DELNDVMSVS RIGEYTGFGF GEIALGIW
|
| |