Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mthe_0639 |
Symbol | |
ID | 4462279 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosaeta thermophila PT |
Kingdom | Archaea |
Replicon accession | NC_008553 |
Strand | + |
Start bp | 665958 |
End bp | 666923 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 639699647 |
Product | CRISPR-associated Csh2 family protein |
Protein accession | YP_843069 |
Protein GI | 116753951 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3649] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR01595] CRISPR-associated protein, CT1132 family [TIGR02590] CRISPR-associated protein, Csh2 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00285412 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACAG TCTCGAACAG ATCTGAGCTG CTGTTCATCT ACGATATAAG AGATGGGAAT CCGAATGGAG ATCCGATGGA CGAGAACAAA CCGCGCATGG ATGAGGAGAC AGGGGTGAAC CTGGTAACCG ATGTGCGTCT CAAGAGGACG ATAAGAGATT ACCTCCACAA TTTCAAAGGG CTCGAGATAT TCGTGAGGGA GATAATCTAT GATGAGGAGA ACGGGTACAT CCAGGACGCC AAGAGGAGGG CCAAGGATTT CGGGGAAGAT CAGGAGAGGA TTCTGAGTGA GTGCATCGAC GTGAGGCTGT TCGGCGGTGT GATTCCGCTA GAGAAACGGA GACAGAACAA GCAAAAAGAT GAGGGTGAGG GATCATCCAA AGGGGATTCG ATTACCTATA CAGGACCTGT TCAGTTCAAG ATGGGGCGCT CGCTCCACAG GGTGGCCCTG AAGCACATAA AGGGCACAGG AGCATTCGCC TCGAAGGAGG GCATGACGCA GGCGACGTTC CGCGAGGAGT ATGTTCTCCC CTACTCACTG ATACTCTTCT ACGGCATAAT AAACGAGAAT GCTGCGAAGC ACACTGCTCT CACAGAGGAG GATGTGAGGC TGCTCCTCGA GGGCATGTGG AACGGGACGA AGAGCTTGAT ATCGAGAACG AAGGCAGGAC AGGTGCCCAG GCTGCTGCTC AAGGTCAACT ACAGCAAGGC GAACTACCAC ATCGGGGATC TGGACAGGAT GATAAAGCTG GTATCGGATG TGCAGCACGA GGCGATCAGA GGGCCCGAGG ATTTCTGTCT CGATGTCTCA GCGCTTGTGA GCAGGCTCAA AGCTGAGAAG GACTCGATCA GGGATCTTGA GCTCTGCTTC GACAGGCAGC TCAGGTTCGT CAGGGACGGC GCCGAGTTCT CCATGGAGAA GCTGAACGAA GCAACTGGCA TAGCGGTAAA TCCGATTGCG TTCTAG
|
Protein sequence | MSTVSNRSEL LFIYDIRDGN PNGDPMDENK PRMDEETGVN LVTDVRLKRT IRDYLHNFKG LEIFVREIIY DEENGYIQDA KRRAKDFGED QERILSECID VRLFGGVIPL EKRRQNKQKD EGEGSSKGDS ITYTGPVQFK MGRSLHRVAL KHIKGTGAFA SKEGMTQATF REEYVLPYSL ILFYGIINEN AAKHTALTEE DVRLLLEGMW NGTKSLISRT KAGQVPRLLL KVNYSKANYH IGDLDRMIKL VSDVQHEAIR GPEDFCLDVS ALVSRLKAEK DSIRDLELCF DRQLRFVRDG AEFSMEKLNE ATGIAVNPIA F
|
| |