Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0638 |
Symbol | |
ID | 4462278 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 664105 |
End bp | 665961 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639699646 |
Product | CRISPR-associated Csh1 family protein |
Protein accession | YP_843068 |
Protein GI | 116753950 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02556] CRISPR-associated protein, TM1802 family [TIGR02591] CRISPR-associated protein, Csh1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0097245 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGATCC GTGTGATCGA GGCAGTAGCA AGAATTGGTA AGCAAAGAAT TGGTGAGCAT GCTCAGAGCG GTTGTGAGGA TATCCTGTAT TACATTGTGG AGAATCCAAA TCTCAATGGC GGTTACAACC ATGCTCTTGT TGTCACACTC GAGGAGAAGG ACGGAGATTT CTCCTACAGA GGGGTGGAGC TGGAGGAGCT CAAGGATTAC AGAAGATACC TGTACAAGGG GAAGAAGGGG AATGCCACCG ATGCCACCCC GACATGCAAG ATCGCAAAGG ATATCGAGAA AACATTCGAG AAAAAATTTC TGAGATGGTT TGATGGCATT GATTCTTCAG ATCTAAGCGG GGAAGAAAGA GCCACGCTGA AAAATATAAA GAATGTGTTA TTCTCAAACA AAGATAAAAT CTTTAAGGAG CTACAGGAGA AGCGATCTCA TCTCAAACAG AGAGAGAACT GCATAATAAC CCTTGGATTT GTGAAGGATG GTGATCTTAA ATACCTGGCG GATTATCCGG CGTTCAGAAA CGTTCTCCTC AAAAATAGCT GCGATCGGTT CTTCCGTAAA TACGGCGGCG AGTCCAGAGG AACGGATGCG CTCTGCTCTG TCTGCAAGGA GCAAAAGGAT GAGGTTTACG CATACGCCAT CCCCTGGCCG TTCCACACAT TCGACAAGCC CGGATTCATA GCAGGCGGGT TCAGGCAGTC TGATGCATGG AAGAACACGC CCGTCTGCTT AAACTGTGCG ATAAATCTCG ATGCGGGAAA GAAGTACATT GAGGAGAGCC TTGACTTCAG CTTCTACGGC TTCAGGTACC TTCTCATACC CAAGCTCATA ATCGGCGATG ACTACCAGGA GATCCTGGAT ATTCTGAGTG GTATGAAGAA AGAACTGAAG ATGAGCCGGA AGACCAGGAA CCGCATAACC GACGACGAGG ATGAGATCCT GGATATGGTG AAGGATCAGA AGGACTTCTT CAGCAACAGC CTGATGTTTT ATAAAAAGGA TAACTCCGCT TACAGGATAC TCCTTCACAT CGATGGCATT CTCCCATCCA GGTTGAGAAG GCTGTTCGAG GCCAAGGAGA GGGTTGAAGG TGCCTTCGGG ATATACAATG AGATGGTGCT ATCCGAAGAC AAAGGCAGGC TTATATTTGA TTTTGGCGTT CTGAGGCGGT TCTTCCCCAG GGAGTCAGGA AACGTTACAC ACGACAAGAT GTTTCTGGAA ATTGTCAACA GGATATTCGT GGGGAGGTAT GTGGATCGCC ATCTTCTGAT CTCATTCATG ATGAAGAGAA TCAGAGATGA TTTCGTGCAT GGGCGATCCA CGCTGATGAA CACGCTCAAC GGATTCATGC TTCTTCATTA TCTGAAGGAG CTGAATCTTC TGAAGGATCT TGAGGTAGAC AGAATGTCAG GAATTGTTCT CAAAAGAGAG GAGCTCGAGG CACTTCCTCT GGATAAGAGG GTTGAGAGGT TCTTCGAGGC GAACAGGGAG TTCTTTGACA GCGATGCGAA GAAGGCAACG TTTCTGGAAG GCGTGCTTGT GCAGAAGCTG CTGAACATCC AGTGGATGGA GAAGAATGCA AAGCCGTTCT ACACAAAGCT GCATGGGCTC AAGATGAACG AGGCCCTGAT CAAGAGGCTT CTTCCGGAGA TACAGAACAA GCTTGAGGAG TATGAGAAGA ACTACTACAG GGAGCTCGAG GCGATCATAG CGGAGCATTT CGTTCTTTCG GGCCGCGGGT GGAAGGAGAC AGATGATGAG CTGAGCTTCT ACTTCGTGCT CGGTATGAAC CTGCACGAGC TCTTCAGGGT GGATAAAGAG AAGGAAAAGA CGGAGGAAAC AGCATGA
|
Protein sequence | MVIRVIEAVA RIGKQRIGEH AQSGCEDILY YIVENPNLNG GYNHALVVTL EEKDGDFSYR GVELEELKDY RRYLYKGKKG NATDATPTCK IAKDIEKTFE KKFLRWFDGI DSSDLSGEER ATLKNIKNVL FSNKDKIFKE LQEKRSHLKQ RENCIITLGF VKDGDLKYLA DYPAFRNVLL KNSCDRFFRK YGGESRGTDA LCSVCKEQKD EVYAYAIPWP FHTFDKPGFI AGGFRQSDAW KNTPVCLNCA INLDAGKKYI EESLDFSFYG FRYLLIPKLI IGDDYQEILD ILSGMKKELK MSRKTRNRIT DDEDEILDMV KDQKDFFSNS LMFYKKDNSA YRILLHIDGI LPSRLRRLFE AKERVEGAFG IYNEMVLSED KGRLIFDFGV LRRFFPRESG NVTHDKMFLE IVNRIFVGRY VDRHLLISFM MKRIRDDFVH GRSTLMNTLN GFMLLHYLKE LNLLKDLEVD RMSGIVLKRE ELEALPLDKR VERFFEANRE FFDSDAKKAT FLEGVLVQKL LNIQWMEKNA KPFYTKLHGL KMNEALIKRL LPEIQNKLEE YEKNYYRELE AIIAEHFVLS GRGWKETDDE LSFYFVLGMN LHELFRVDKE KEKTEETA
|
| |