Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_1127 |
Symbol | |
ID | 3997734 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | + |
Start bp | 1211168 |
End bp | 1212169 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637958895 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_565803 |
Protein GI | 91773111 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03641] CRISPR-associated endonuclease Cas1, HMARI/TNEAP subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.168006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGCAG ATTATTACAT ATTGCAGGAT GGGATTTTAA AGAGAAAGGA AAATACGGTC TATTTTGTGA ATAAGGATGA GAAGAGAGTT TTACCTATCA ATAAAATATA TTCAATTTAT GCATACGGTA AGTTATCCTT TTCATCAGGT GTAGTTTCAT ACCTTTCAAA AAATGGTATT CCGATACATT TTTTTAATTA TTATGGTTTT TATGAAGGAA GCATGTATCC ACGAGAGACG CTCATTAGTG GCGATCTTGT AATTCATCAA GCATCTCATT ATCTTGATAG TGAAAAAAGG ATGCTTTTGG CAGGAAAGTT CATTGAAGGT GCATGTGGTA ACATTCTTAA GAATTTAAAG TATTATTCAA GAACAAAGGA GGATTGTCAA GATGCGATGA ATTCCTATGT TAGTTCGATA GAATCGGAAT TAAGTCGCTT ACCAAATGCG GATTCCATTC CTAAAATGAT GAATGTTGAA GGGCGTATGA GATATATCTA TTACAATGCA TTGGATGAGA TATTTCCGGA AGATTATAGA ATTGTGACAA GAACAAGACG TCCTCCTGGA AATAAAATGA ATACTCTTAT TAGTTTTGGA AATTCGTTGA TGTATACGAC TGTTCTTTCT GAGATTTATA ATACTCAATT AAATCCAACT ATTTCTTATC TTCATGAACC TTTTGAACGA CGTTTTTCTC TTGCTCTTGA TGTGAGCGAA ATATTTAAGC CCATAATAAT TGATCGAATT ATTCTTAAAC TTGTTAATAA GAATATGTTA GATGATAATT GTTTTATGGG TGAGATCGGT GATATGCTGT TGAGTGAAAA AGGCAAGAAG ATATTCTTAC AAGAATATAA TTCTAAATTA AGTACTACTA TTAAACACAA AGGGTTAAAG CGAAATGTTT CTTATAAAAG GCTAATTAGG CTGGAACTTT ATAAATTATC AAAGCATGTG ATTGAAGACG AAGAATATGT TCCTCTTGTG ATGTGGTGGT AA
|
Protein sequence | MRADYYILQD GILKRKENTV YFVNKDEKRV LPINKIYSIY AYGKLSFSSG VVSYLSKNGI PIHFFNYYGF YEGSMYPRET LISGDLVIHQ ASHYLDSEKR MLLAGKFIEG ACGNILKNLK YYSRTKEDCQ DAMNSYVSSI ESELSRLPNA DSIPKMMNVE GRMRYIYYNA LDEIFPEDYR IVTRTRRPPG NKMNTLISFG NSLMYTTVLS EIYNTQLNPT ISYLHEPFER RFSLALDVSE IFKPIIIDRI ILKLVNKNML DDNCFMGEIG DMLLSEKGKK IFLQEYNSKL STTIKHKGLK RNVSYKRLIR LELYKLSKHV IEDEEYVPLV MWW
|
| |