Gene Amuc_2009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2009 
Symbol 
ID6275770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2439325 
End bp2440260 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content53% 
IMG OID642614068 
ProductCRISPR-associated protein Cas1 
Protein accessionYP_001878600 
Protein GI187736488 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03639] CRISPR-associated endonuclease Cas1, NMENI subtype 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.515668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.104859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATACC ATATTCTAAG CATAGATGCC TATACATGCC ATCTGAGCTG CGACAAGGGC 
CAACTCAGAT GCGCGGATGG AGAGAATTCT CCCCGAACGA TTCCGCTGGA GGATGTGGGG
GCTGTTGTGC TCAGTTCTTT TAAGGCGACG CTCACGAGCA ATTTGCTGAT AGAACTGGCC
AGGAAGAGAA TAGGATTTGT GCTGTGTGAA AGCTACAGGC CTGCCGTGCT GCTCCTGCCA
GCGGATCGGT CTACGGATAC CGGTCTGCTA AGACATCTGG CGGATATGCC GGCCCGTTTG
CGGAACCGCC TTTGGCAAAA GACTTTGGAT GCCAAGTGTG GGAATCAGAC GGCTCTGGCC
CAAGCATGGA ATCCGCATCA TCCCGCCATT GCGGAGCTGA AGAGAATGGC CGTGACGGAA
AAGACGGCGA GGGAAGCAGA GTGCGCCCGC CTGTTCTGGA GCGTATTTGC GGATACATGG
GCAAACTCCG ATTTTCGCAG GGGACGTCAT GAGGAGGGGT TTAATAACCT CTTCAACTAT
GCGTACGCTA TTCTGTTGTC TTGCATATTG CAATATCTCT TTGCTCTGGG GCTGGATCCC
TGCTTCGGCA TTTTTCATCA ATCCCGGGAA CATGCGGCGC CTTTGGCTTA TGATCTGATG
GAACCCTTCA GGCCTGCCTT TGACGCCAAT GTGGCCCGTT GGATTCATTT GTGCCTGCGG
GAAGGAAAAA CAGAAGAGAG AGCAGGAGAA ATCACCCGTG AGTTCAGGCA ACATATTACA
GCCACCTTGC AGGCTTCTGT CATGTACCGG GATAAACAGC TGCCGTTGAA AGCGGCGGTA
GAGGCCGTTT GCCGCAGTTT CCGCAAAGCA GTTCTTGCCG GACAATCCGA ACCGTATGAA
CCATGGCTTA TGACAACTAT AAAATGGGCT GGCTAG
 
Protein sequence
MSYHILSIDA YTCHLSCDKG QLRCADGENS PRTIPLEDVG AVVLSSFKAT LTSNLLIELA 
RKRIGFVLCE SYRPAVLLLP ADRSTDTGLL RHLADMPARL RNRLWQKTLD AKCGNQTALA
QAWNPHHPAI AELKRMAVTE KTAREAECAR LFWSVFADTW ANSDFRRGRH EEGFNNLFNY
AYAILLSCIL QYLFALGLDP CFGIFHQSRE HAAPLAYDLM EPFRPAFDAN VARWIHLCLR
EGKTEERAGE ITREFRQHIT ATLQASVMYR DKQLPLKAAV EAVCRSFRKA VLAGQSEPYE
PWLMTTIKWA G