Gene Amuc_0346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0346 
Symbol 
ID6274971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp407251 
End bp409722 
Gene Length2472 bp 
Protein Length823 aa 
Translation table11 
GC content60% 
IMG OID642612397 
ProductDNA mismatch repair protein MutS 
Protein accessionYP_001876966 
Protein GI187734854 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID[TIGR01070] DNA mismatch repair protein MutS 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0580477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.279118 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGATC AGTACCTTCG CATGAAGAAG GGCTTGCCGG AAGACGTGCT GCTGTTTTTC 
CGGCTGGGGG ATTTTTACGA AATGTTTTTT GAGGACGCCA AGGAGGCCTC CGCCATTTTG
GGCCTGACGC TGACCAAGCG CCACGGTATT CCCATGTGCG GGGTGCCCCA CCACAGCGCG
GAAGGGTATA TCGGACGGCT GGTGAAGGGA GGAAAGCGGG TGGCTATTGC GGAGCAGACC
ACCATTCCGC AGCCGGGCAA GCTGGTGGAG CGGGAACTGA CCCGCGTGAT TTCCGCCGGA
ACCCTGGCGG ATATGAATTT GCTGGATTCC TCCCGCCATA ATTACATTGT GGCGTTGTAC
AAGGACAAGA AGCGCTTCGG CCTGGCATGC GTGGACCATA CCACGGGGGA ATTTTCCGTG
GCCCAGTTCG AACACATGGA TTTGCTTCTG GACGAGCTGT CCCGCATCAA TCCCTCCGAA
CTCCTGGTCA GCGACGAGCA GACGGACTGC TTCCCCGGAA CCCACCCCAC GCTTTATTAC
GACGGATATA CTTTTCTGCC TTCCACGGCC ATTCCCAATT TGCTGAACCA TTTCCGGGTG
CATTCCCTGG ACGGTTTCGG CTGCGGGGAG ATGACCGCCG CTCTGGGGGC GTCCGGCGCG
GTGCTCCATT ATCTGGGCTA CCAGCTCCGC CGCCCCACGG ACCACCTGCG CCGCATTTCC
GTGCGCGCCA CGGAGAACGC CGTGCTGATT GACCAGGCCA GCCAGAGGAA TCTGGACCTG
GTGGATTCCC GCGGCGGCGT GAAGCTTTCC CTGCTGGGGA CCCTGGACAG GACGAGCACC
CCCATGGGCG CGCGCAAACT CCGGGACTGG CTGCTCCACC CCCTGTGTGA TCTGGAAAAG
CTCCTGGCGA GGCAGGAGGT GATCGCCGTT CTGCTTCAGG AACCCTACCT GATGAGCAAG
CTGAGGGAGA GCCTGAAGAA TGTGCGGGAC ATGGAGCGGC TGACGGGGCG CATTTCCCAG
GGCGCCGGGA ATGCCCGCGA CCTCCAGGCG CTGGCTTCCT CCCTGGCGCG CATTCCCGCG
CTCAGGGATG ATCTGGAATC CCTGCCCGGC GGCGGGGACA TGCTGGAGAG CATCCGTTCC
CGCATGGGCT GCTTTGATGA GCTGGTGGAT TTGCTCCAGC GCGCTCTGGT GGATGAACCT
CCCGTGACCA TCAAGGAGGG AGGCATCATC AGGGAAGGGT ACCATGCCGG TCTGGATGAA
TTGCGTCTGG CTTCCCGCGA CGGGAAGGAG TGGCTGGCAC GGCTGCAGGA GAAGGAGCGC
AAGCGCACGG GAATTGATTC CCTGAAAATC CGCTTCAATA ACGTCTTCGG CTATTACATT
GAAGTAACGA AGAGCCATTA CGATAAAGTG CCCCCGGATT ACCAGCGCAA GCAGACGCTG
GTGAATGCGG AGCGCTTTAT TACCCCGGAA CTCAAGCAGA TGGAGAATAC CATCTTGGGG
GCGGACGAGC GTTCCCGCCA GGTGGAGTAT GAGCAGTTCC TCCTGTTGCG CGAGGAAGTG
GGGCGCCACA TTGACGATAT CCAGATTACG GCGGATGCCA TGGCGGACCT GGACGTGCTG
CTGGGGCTGG CGGAGGGGGC CCAGCAGTAC CGGTATTGCC GCCCCGTTCT GGACAATTCC
ATGACCCTGC GCATTGTCAA TGGCCGTCAT CCCGTTATTG AGCAGAATGT TTCCGGCGAT
GTGTTCGTTC CCAATGACGC TTTTCTGGAA CCGGAGGAAA ACCGCCTTAT TCTGCTGACC
GGGCCCAATA TGGCGGGCAA GAGCACCTAT ATCCGCCAGG TGGCCCTGAT TACGCTGATG
GCCCAGATTG GAGCGTATGT GCCGGCGGAG TCCGCCCATA TCGGTTTGGT GGACCGCATT
TTCTGCCGCG TGGGAGCCAG CGACGACCTG GCGCGCGGCC AGTCCACCTT TATGGTGGAG
ATGAGCGAAA CCTCCCTGAT TCTGAATAAT GCCACGGAGC GCTCCCTGAT TATTCTGGAT
GAAATCGGGC GAGGCACGGC CACTTTTGAC GGGCTTTCCA TTGCCTGGGC CGTGGCGGAG
TACCTGCATG ACGAGTTGAA GTCCCGCACC CTGTTCGCCA CGCATTATCA TGAGCTGACG
GATTTGGCCA ATTCCAGGCA GGGCGTGCAG AATTACAATG TGGCCGTGCG CGAGTGGAAG
GAGGAAATCG TGTTTCTGCG CAAGATCGTG CCGGGGGCGG CGGATAAGTC CTACGGCATC
CAGGTGGCCC GACTGGCCGG CATGCCTGCC GTCATTGTGG ACCGCGCCAA GGCCATCCTG
TCCCATCTGG AGATGAATTC CACGCGCCCC CGGAGGAAGG AGCGCTCCCG GCTGGCGGAA
CCGAGAGCCA AGAATACGGA TATGGAAGAC GATATGCCTG CCGGGGAATA TGCTCAGCTG
GAGTTGTTCT GA
 
Protein sequence
MMDQYLRMKK GLPEDVLLFF RLGDFYEMFF EDAKEASAIL GLTLTKRHGI PMCGVPHHSA 
EGYIGRLVKG GKRVAIAEQT TIPQPGKLVE RELTRVISAG TLADMNLLDS SRHNYIVALY
KDKKRFGLAC VDHTTGEFSV AQFEHMDLLL DELSRINPSE LLVSDEQTDC FPGTHPTLYY
DGYTFLPSTA IPNLLNHFRV HSLDGFGCGE MTAALGASGA VLHYLGYQLR RPTDHLRRIS
VRATENAVLI DQASQRNLDL VDSRGGVKLS LLGTLDRTST PMGARKLRDW LLHPLCDLEK
LLARQEVIAV LLQEPYLMSK LRESLKNVRD MERLTGRISQ GAGNARDLQA LASSLARIPA
LRDDLESLPG GGDMLESIRS RMGCFDELVD LLQRALVDEP PVTIKEGGII REGYHAGLDE
LRLASRDGKE WLARLQEKER KRTGIDSLKI RFNNVFGYYI EVTKSHYDKV PPDYQRKQTL
VNAERFITPE LKQMENTILG ADERSRQVEY EQFLLLREEV GRHIDDIQIT ADAMADLDVL
LGLAEGAQQY RYCRPVLDNS MTLRIVNGRH PVIEQNVSGD VFVPNDAFLE PEENRLILLT
GPNMAGKSTY IRQVALITLM AQIGAYVPAE SAHIGLVDRI FCRVGASDDL ARGQSTFMVE
MSETSLILNN ATERSLIILD EIGRGTATFD GLSIAWAVAE YLHDELKSRT LFATHYHELT
DLANSRQGVQ NYNVAVREWK EEIVFLRKIV PGAADKSYGI QVARLAGMPA VIVDRAKAIL
SHLEMNSTRP RRKERSRLAE PRAKNTDMED DMPAGEYAQL ELF