Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_0560 |
Symbol | |
ID | 3996991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | - |
Start bp | 566998 |
End bp | 568206 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637958367 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_565285 |
Protein GI | 91772593 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.00182957 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTGT TATTGCTAAA TGGTCATGGA ATCAATATGC GTGTTGATAG TGCAAAACTC CATATCAAAG ATGGCAGATT CTCAACAACT GAAGATCCTG AAGAGTATGT GTTCTCTCCT AAGAGGATGG ATATTGATAG CATTGTTGTG TATGGGAGAA GTGGATCTCT AAGCCTCGAA GCTATCAGAT GGTTGATTAA ACATAATGTA CAGATTACTA TTTTAGATTG GAACGGTAAG CTTCTAACAA CAATGCTTCC TTCTGAAAGT ACCAATGTAA AGACAAAGTT TGCTCAATAC CATGCTTATG AAGATCAGGA TACAAGAGTA AAGCTTGCTA AGAAGTTTAT TGAAGCCAAG TTCTCCAAAT CTGAGGCTGT TCTTGATTAT CTTAAACTAA GGTATCCTGA GATTAATTAT GATATCTCCG TTGATAAGGG TAAACTTGAG AATGCTAAAT CTGTAAGGGA GATCTTAGGT GTTGAAGGTG GAGTTGCTTG GAAGTACTGG AATGAGTATG CTAAAGCTAT TCCTGAAGAA TATGATTTCA GGGCAAGGAC TGATAATAAT GCCAGAGCAT CTAACTCAGG CGATAAAGTT AATGTCATGC TGAATTATGG GTATGCTTTG CTTGAATCTG AATGTTTGAG AGCCATTAAT TCAGTTGGTC TTGATGCTCA TGTAGGATTC CTTCATGAGA TGAATCCAAG TAAGAATAGT TTAGCCTATG ATCTCCAAGA ACCGTTCAGG TTTATTGTTG ATCTTACTGT AATGAACCTG ATTGAAAAGG GAATTATGGA TAGTAAGGAC TTTATCAGGA CTGAGAGTTT TTCATTGAGA TTGAGACCTA CTGGAGCAAG GAAGGTTACT GAAGAGTTTA ATTCTGTGAT GAATGGTAAA GTCGAGTATA GGAAGAAGAA TAGCTCTTGG GGATCTGTAC TATTGCTCAA GGCAAGAGAG CTGAGTCATC ATCTTGTTGG GAAGAGGAAA ACGATTGAGT TTGTTAAGCC TGTTTATATC AATGAAAGGG ATGATACTGA TCTGTTGAGG AAGACGATTA TTGATATGCC TTATACCGAG TGGAAGAAAA TGGGGTTCTC AAAGGGTACA TTGAACTATA TGAAACAGAA TGCCAATAGT GAGAAGCCGT TTACATTAAA TGTCCATGTA AGAGAGAGGT TAGAGAATTG GGGGAGTATT GTTGAATGA
|
Protein sequence | MKLLLLNGHG INMRVDSAKL HIKDGRFSTT EDPEEYVFSP KRMDIDSIVV YGRSGSLSLE AIRWLIKHNV QITILDWNGK LLTTMLPSES TNVKTKFAQY HAYEDQDTRV KLAKKFIEAK FSKSEAVLDY LKLRYPEINY DISVDKGKLE NAKSVREILG VEGGVAWKYW NEYAKAIPEE YDFRARTDNN ARASNSGDKV NVMLNYGYAL LESECLRAIN SVGLDAHVGF LHEMNPSKNS LAYDLQEPFR FIVDLTVMNL IEKGIMDSKD FIRTESFSLR LRPTGARKVT EEFNSVMNGK VEYRKKNSSW GSVLLLKARE LSHHLVGKRK TIEFVKPVYI NERDDTDLLR KTIIDMPYTE WKKMGFSKGT LNYMKQNANS EKPFTLNVHV RERLENWGSI VE
|
| |