Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Emin_0244 |
Symbol | |
ID | 6262916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Elusimicrobium minutum Pei191 |
Kingdom | Bacteria |
Replicon accession | NC_010644 |
Strand | + |
Start bp | 264709 |
End bp | 265605 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642610708 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001875143 |
Protein GI | 187250661 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGCGAG TTTTGGATAT CCCCGGGGAC GGGTACCATT TATGCGTAAA AAACAATAAC TTCTCCGCAG TAAAAGACAG AGAGGAAAAA CTGCATTGTT TATTTGACGA TATAAACAGC ATTATACTTT ACGGTAATAA TATTACCATT TCCAATACTT GCATACAAAA ATGTTTAGAG CATAAAGTAC CGGTCATCTT CTGCGATAAA ACCTATAACC CCGCCGGAAT GCTGCTTTCT TCTTTTACCA CAAATATTTA CGGACGCAGA CTCCAGTTAC AAATAAATGC CTCAAAACCA CAAATAAAAC AAGCCTGGCA ACAAATAATC ACAAGTAAGT TAAACAACCA AGCTGAGGTG TTAAAAAGAT TTGACACGCT TAAGGCGGCG GAAACCATTT TTAATATGGC CCGCGAGGTG CGCTCTGGCG ATGCTACTTT TAAAGAAGGT GTCGGCGCAA AGGTATATTT TGAAAATTTA TTTAATGATT TTCATAGAAA TACCGACGAT AAGGATATTA TAAATTCAGC GTTAAATTAT GGCTATGCGA TTGTTAGAAG TTCTATTGCG CGGGCGGTTG TTTCCGCCGG ATTAAATCCC GCCATCGGTA TTTTCCACAG TAAGAACCAT AATCCGTTTT GTTTAATAGA TGATTTGATA GAACCACTGC GTCCTCTTAT AGATTTTATG GTAAAAAATA AATTGGATGT TTTGACGCAA GAGGAAAGTC TGTCGCCTTC GGCTAAAAAA TATATGGCAA GCGTGATAGA AAGTAACTTG TATTTTGAGG ATGGTGCCTT TAATCTTACG GCCGGGATAC AAAAATATAT CCAGTCGTAT ATCGCGTTTT TGGAAGAACG GGAAAACAGG ATAATTTTCC CGGCAATTTT AAAATGA
|
Protein sequence | MWRVLDIPGD GYHLCVKNNN FSAVKDREEK LHCLFDDINS IILYGNNITI SNTCIQKCLE HKVPVIFCDK TYNPAGMLLS SFTTNIYGRR LQLQINASKP QIKQAWQQII TSKLNNQAEV LKRFDTLKAA ETIFNMAREV RSGDATFKEG VGAKVYFENL FNDFHRNTDD KDIINSALNY GYAIVRSSIA RAVVSAGLNP AIGIFHSKNH NPFCLIDDLI EPLRPLIDFM VKNKLDVLTQ EESLSPSAKK YMASVIESNL YFEDGAFNLT AGIQKYIQSY IAFLEERENR IIFPAILK
|
| |