Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_0582 |
Symbol | |
ID | 3996794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 591703 |
End bp | 592908 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 637958387 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_565305 |
Protein GI | 91772613 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0780613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGT TACTTCTAAA CGGTCATGGA ATTAACATGC ATGTTGATGG TGCTAAACTC CATATCAAAG ATGGAAGATT CTCAACAACT GAAGATCCTC AGGAGTACGT ATTCTCTCCA AAGAGGATGG ATATTGATAG TATTGTTGTA TATGGAAGAA GTGGATCTCT AAGCCTTGAA GCTATTAGAT GGTTGATTAA ACACAATGTA CAGATTACTA TGTTAGATTG GAATGGAAAG CTTCTAACTA CAATGCTTCC TTCTGAAAGT ACCAATGTAA AAACCAAGTT TGCTCAATAC CATGCTTATG AAGATTCAGA TGTAAGACTG AAGCTTGCAA AGAAATTCAT TGAGGCTAAG TTCTCTAAAT CTGAAGCTGT TCTTGATTAT TTGAAACAAA GAGATCCTGA AATTGAGTAT GATTTCTCTG ATGATAAGGC TAAACTTGAG AAAGCTAATT CTATTCGTGA TATCTTGGGT GTTGAAGGTG GAGTTGCTTG GAAGTATTGG AATGAGTATG CTAAAGCCAT TCCTGAAGAA TATGATTTCA GAGCAAGAAC TGATAATAAT GCCAGAGCTT CTAACTCAGG AGATAAAATC AATGTAATGT TCAATTATGG TTATGCATTA CTTGAATCTG AATGTATGAG AGCTATAAAT TCAGTTGGTC TTGATGCTCA TGTTGGGTTT TTGCATGAGA TGAATCCAAG TAAAAATAGT TTAGCCTACG ATCTCCAAGA ACCGTTCAGG TTTATTGTTG ATCTTGCTGT AATGAACTTG ATTGAAAAGG GAGTTATGAA TAATAAGGAC TTTATTAGGA CTGAGAGCTT TTCACTTAGG CTTAGACCAA CTGGAGCAAG GAAGGTTACT GAAGAGTTTA ATGCTATGAT GAATGGAAAG GTTGAGTACA GGAAGAAGAA CAGTTCTTGG AGTTCTGTCT TATTAGTTAA AGCAAGGGAG TTAAGCCATC AGCTTGTTGG GAAGAGAAAA ATAGTTGAGT TTAGCAAACC CGTTTATGTT GACAAGAGAG TTGATACTGA TTCTTTAAGG CAGAAGATAA TTGATATGTC TTATACTGAC TGGAAGAAGA TGGGATTCTC AAAGGGTACT TTACATTATA TGAAGCAGAA TGCCAAGAGT GATAAACCGT TTACTCTCAA TGCTCATGTA AGGGAAAGGT TGGAGAACTG GACGAGATGT TATTGA
|
Protein sequence | MKLLLLNGHG INMHVDGAKL HIKDGRFSTT EDPQEYVFSP KRMDIDSIVV YGRSGSLSLE AIRWLIKHNV QITMLDWNGK LLTTMLPSES TNVKTKFAQY HAYEDSDVRL KLAKKFIEAK FSKSEAVLDY LKQRDPEIEY DFSDDKAKLE KANSIRDILG VEGGVAWKYW NEYAKAIPEE YDFRARTDNN ARASNSGDKI NVMFNYGYAL LESECMRAIN SVGLDAHVGF LHEMNPSKNS LAYDLQEPFR FIVDLAVMNL IEKGVMNNKD FIRTESFSLR LRPTGARKVT EEFNAMMNGK VEYRKKNSSW SSVLLVKARE LSHQLVGKRK IVEFSKPVYV DKRVDTDSLR QKIIDMSYTD WKKMGFSKGT LHYMKQNAKS DKPFTLNAHV RERLENWTRC Y
|
| |