Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmc1_3524 |
Symbol | |
ID | 4481122 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Magnetococcus sp. MC-1 |
Kingdom | Bacteria |
Replicon accession | NC_008576 |
Strand | + |
Start bp | 4405889 |
End bp | 4406914 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639724273 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_867415 |
Protein GI | 117926798 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAACCC ATTTAAACAC CCTTTTTATC AACACCCAGG GCAGCTATCT GGCCAAAAAG GGGGAGTGTA TTGATGTTCG TCAGGAGCAG CGCTCCATGG CTATGGTTCC CATCCATACG CTAGAGGGTG TGGCTTGCTT TGGGCAGGTC TCTTACTCTC CTTATTTGAT GGCCCACTGC GCCGAGCATG GTGTTAGCCT CAGCCATTTT GATGAAAGGG GTCGATTTTT GGCCGCCATG CGTGGACCCA CCAGCGGCAA TGTGCTGTTA CGCAGGCAGC AGTATCGTTG GGCCGATGAT CCCCAGCAGG CGGCCCAAAT GACCCGTTTT ATCCTGCACG CCAAACTACG CAATGGCCGC ACGGTCATGG CGCGCGCCCT GCGTGAACGC GGGCAGACAA ACACGGCCTT GGAAACGACG GTCCACGAAT TGGGGGTTTT GGTGGCGTTA CTGGATAAAA CCGATTCGGT AGAGAGTCTG CGTGGGTTAG AAGGGCGGGC GGGTGCGCTT TATTGGGGAT CGTTTCAGCA TCTTATTTAT CAGGATTCGC CGGAGTTTAA GTTCTTTGGC CGCAGTCGTC GTCCGCCGTT GGATCGGGTT AATGCCCTGC TCTCTTTCCT CTATGCCATG TTGATGCATG ATGTACGCTC GGCGTTGGAG GGGGTTGGGT TGGATCCTTA TGTGGGATTT TTGCATCAAG ATCGCCCTGG TCGGCCAGGC TTGGCGCTGG ATATGATGGA GGAGTTTCGC CCCTATCTGG CCGACCGGCT GGCTTTGACG TTGATTAATC GTGGTCAGCT CGGTGCGAAG GATTTTGAGG TGCAGGCTTC GCAGGCGACC TATCTTACCG AGGCGGGGCG AAAAAAGGTC ATCGTGGCGT ATCAAAAGCG TAAGGATGAG CAAATAACGC ACCCCTTTTT ACAGGAGCAC TGCGCCATAG GCATGGTGTG GCATTTACAG GCTCTGCTGT TGGCGCGATA TATCCGTGGG GATTTGGATG GTTATCCTGC TTTTTTGTGG CGTTAA
|
Protein sequence | MRTHLNTLFI NTQGSYLAKK GECIDVRQEQ RSMAMVPIHT LEGVACFGQV SYSPYLMAHC AEHGVSLSHF DERGRFLAAM RGPTSGNVLL RRQQYRWADD PQQAAQMTRF ILHAKLRNGR TVMARALRER GQTNTALETT VHELGVLVAL LDKTDSVESL RGLEGRAGAL YWGSFQHLIY QDSPEFKFFG RSRRPPLDRV NALLSFLYAM LMHDVRSALE GVGLDPYVGF LHQDRPGRPG LALDMMEEFR PYLADRLALT LINRGQLGAK DFEVQASQAT YLTEAGRKKV IVAYQKRKDE QITHPFLQEH CAIGMVWHLQ ALLLARYIRG DLDGYPAFLW R
|
| |