Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2009 |
Symbol | |
ID | 6275770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2439325 |
End bp | 2440260 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642614068 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_001878600 |
Protein GI | 187736488 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.515668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.104859 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATACC ATATTCTAAG CATAGATGCC TATACATGCC ATCTGAGCTG CGACAAGGGC CAACTCAGAT GCGCGGATGG AGAGAATTCT CCCCGAACGA TTCCGCTGGA GGATGTGGGG GCTGTTGTGC TCAGTTCTTT TAAGGCGACG CTCACGAGCA ATTTGCTGAT AGAACTGGCC AGGAAGAGAA TAGGATTTGT GCTGTGTGAA AGCTACAGGC CTGCCGTGCT GCTCCTGCCA GCGGATCGGT CTACGGATAC CGGTCTGCTA AGACATCTGG CGGATATGCC GGCCCGTTTG CGGAACCGCC TTTGGCAAAA GACTTTGGAT GCCAAGTGTG GGAATCAGAC GGCTCTGGCC CAAGCATGGA ATCCGCATCA TCCCGCCATT GCGGAGCTGA AGAGAATGGC CGTGACGGAA AAGACGGCGA GGGAAGCAGA GTGCGCCCGC CTGTTCTGGA GCGTATTTGC GGATACATGG GCAAACTCCG ATTTTCGCAG GGGACGTCAT GAGGAGGGGT TTAATAACCT CTTCAACTAT GCGTACGCTA TTCTGTTGTC TTGCATATTG CAATATCTCT TTGCTCTGGG GCTGGATCCC TGCTTCGGCA TTTTTCATCA ATCCCGGGAA CATGCGGCGC CTTTGGCTTA TGATCTGATG GAACCCTTCA GGCCTGCCTT TGACGCCAAT GTGGCCCGTT GGATTCATTT GTGCCTGCGG GAAGGAAAAA CAGAAGAGAG AGCAGGAGAA ATCACCCGTG AGTTCAGGCA ACATATTACA GCCACCTTGC AGGCTTCTGT CATGTACCGG GATAAACAGC TGCCGTTGAA AGCGGCGGTA GAGGCCGTTT GCCGCAGTTT CCGCAAAGCA GTTCTTGCCG GACAATCCGA ACCGTATGAA CCATGGCTTA TGACAACTAT AAAATGGGCT GGCTAG
|
Protein sequence | MSYHILSIDA YTCHLSCDKG QLRCADGENS PRTIPLEDVG AVVLSSFKAT LTSNLLIELA RKRIGFVLCE SYRPAVLLLP ADRSTDTGLL RHLADMPARL RNRLWQKTLD AKCGNQTALA QAWNPHHPAI AELKRMAVTE KTAREAECAR LFWSVFADTW ANSDFRRGRH EEGFNNLFNY AYAILLSCIL QYLFALGLDP CFGIFHQSRE HAAPLAYDLM EPFRPAFDAN VARWIHLCLR EGKTEERAGE ITREFRQHIT ATLQASVMYR DKQLPLKAAV EAVCRSFRKA VLAGQSEPYE PWLMTTIKWA G
|
| |