Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0651 |
Symbol | |
ID | 4463145 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | - |
Start bp | 685957 |
End bp | 687708 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639699659 |
Product | CRISPR-associated TM1812 family protein |
Protein accession | YP_843081 |
Protein GI | 116753963 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02221] CRISPR-associated protein, TM1812 family [TIGR02549] CRISPR-associated DxTHG motif protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0409281 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTTA GTTTACTGTG CATTAACTGC CGGCTGAAGA TTTGTATGGA TCGATACATT ATGGATAATC GCGTTCTTTT GACTGCCCTG GGGCTCAACC CAAAGGAGAC GACGTACACA CTGGGGAACA GAACCTGCAA GTCCCTCCTG TCCCCTGTGG CTCTGTACAA TCTGCTTCCG GAAGATGAGC GGTTCGATAG AGTGCTGGCC CTCTGCACGG AGGAGGTTCT CGAAAAGACG TTTCCCCTGC TCAGGGAGGA GCTGCCTGTC ATGTGTGAGC CCACGCGTGT CCACAGCGGC GTCAGCGCTG GCGATCTTCA GGCGACCCTG CATAACATGC TGGAGAAGAT ACCGGAGAAG GCAGAGCTCA TCCTGGATGT GACGCACGGC TTCAGGCACT ACCCGTTTCT GTTCTTCACC GCATGCCTGT ATCTGAAGGC TCTGAAGGAT GTTAAGATCA AAAAGATCTG GTACGGAAGG TTGGATGCAA AGAACCCTGA TGGTGCAACA CCATTCATCG ATCTGAGCAT CCTCCTTGAG ATGATCGACT GGTTCCGAGT CGTTCAGTCC TTCAGGGATA TGAGCAACCC GAAGGCCCTT GCGGATAAGT TGCTTGCATA CAAGAGAGAT GTGATAAAGC CGGATACGAA CATACCTCCC TCCACTAAAA AGAAATACAG CGCGGTCTCC AGCGCTGCAA AGGAGTTCGC AGAGCTGTTC GGCATAGGCC TTCCAGTGGA GATCGGCATG GCTTCAGCAG ATCTGCAGAG GCTGATAGAA GAGCTTCGTG TGGAGGGAGA TCTTTCAGCA ATAAAGATCC TCCTGGCAGA GGATCTGCTG CTGGAGATAG AAAATGCAGC AGAGCGCTTC AGGCTGCCAG AGGGCGTTGA AAAGAAAGAT CTCATTCTCG ATATCGGGGA GATCCACAGG CAGGAGAAGC TAATAGATGC ATACTTCGAG GCAGGTTACC ACAACAACGC TATCGGGCTG ATCAGAGAGC TTATCATAAG CAGGCTAATG CTCGAAGATG AGGATGGCAG GAATAGCTGG CTCGACAAAT CACAGCGAGA GAAGATCGAA CGATGTATTG GAGCCCTCAT GGATTACAGC AAAAAGAACA GCGCAGATCT TACAAAAGAC AGGAAAGAGC TTGCTGGTAT ATGGAGCTCC GTTATCGACG CCAGAAATAA GTTGCACCAT TATGGGATGA CGAAAGATAT TGTGAGTGAT AAACAAATCG GGATCCCGAA GTGGTGGGAC GATCTGAAGC AGCGTCTCGA CGACAATTCG TTTTGGGATT GCGGTTTTGG AGGTGGCTCT GGCAGGCTTC TGATCTCGGC ACTTGGCCTT CACCCAGGCT TTCTCTATAC CGCTCTGAGG AATGTGAATC CAGAGTTATG CCTGGTGATT GCATCACACG AAACTGTGGG CAGATGGGAT GAGCTCGCCA GAATGGCCAG TTACAGTGGC GATTTCGATG TTAAGGAAAT GGATGCACAT TCATTTGACG ATGGCAAAAC CATCGTTGTC AACTCAGGGG AGCTTCTCGT CAGATTCGAT GAGGTTGTGT GCAACCTGAC GGGTGGCACA GCTGCCATGC AGTATGCGGT CATCCAGCTG GCAAGGAGGG CAGAGCGGTT GGGAAGAAAT GTCAGATGGA TCGCGGTCAT CGATAATCGT TCAATTGAGG AGCAGAATCT GAACCCTTAC GTCGAGGGGG AGGTGGTCTT TTTGGAGGAT GCAGGGATAT GA
|
Protein sequence | MGFSLLCINC RLKICMDRYI MDNRVLLTAL GLNPKETTYT LGNRTCKSLL SPVALYNLLP EDERFDRVLA LCTEEVLEKT FPLLREELPV MCEPTRVHSG VSAGDLQATL HNMLEKIPEK AELILDVTHG FRHYPFLFFT ACLYLKALKD VKIKKIWYGR LDAKNPDGAT PFIDLSILLE MIDWFRVVQS FRDMSNPKAL ADKLLAYKRD VIKPDTNIPP STKKKYSAVS SAAKEFAELF GIGLPVEIGM ASADLQRLIE ELRVEGDLSA IKILLAEDLL LEIENAAERF RLPEGVEKKD LILDIGEIHR QEKLIDAYFE AGYHNNAIGL IRELIISRLM LEDEDGRNSW LDKSQREKIE RCIGALMDYS KKNSADLTKD RKELAGIWSS VIDARNKLHH YGMTKDIVSD KQIGIPKWWD DLKQRLDDNS FWDCGFGGGS GRLLISALGL HPGFLYTALR NVNPELCLVI ASHETVGRWD ELARMASYSG DFDVKEMDAH SFDDGKTIVV NSGELLVRFD EVVCNLTGGT AAMQYAVIQL ARRAERLGRN VRWIAVIDNR SIEEQNLNPY VEGEVVFLED AGI
|
| |