Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0346 |
Symbol | |
ID | 6274971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 407251 |
End bp | 409722 |
Gene Length | 2472 bp |
Protein Length | 823 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642612397 |
Product | DNA mismatch repair protein MutS |
Protein accession | YP_001876966 |
Protein GI | 187734854 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | [TIGR01070] DNA mismatch repair protein MutS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0580477 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.279118 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGGATC AGTACCTTCG CATGAAGAAG GGCTTGCCGG AAGACGTGCT GCTGTTTTTC CGGCTGGGGG ATTTTTACGA AATGTTTTTT GAGGACGCCA AGGAGGCCTC CGCCATTTTG GGCCTGACGC TGACCAAGCG CCACGGTATT CCCATGTGCG GGGTGCCCCA CCACAGCGCG GAAGGGTATA TCGGACGGCT GGTGAAGGGA GGAAAGCGGG TGGCTATTGC GGAGCAGACC ACCATTCCGC AGCCGGGCAA GCTGGTGGAG CGGGAACTGA CCCGCGTGAT TTCCGCCGGA ACCCTGGCGG ATATGAATTT GCTGGATTCC TCCCGCCATA ATTACATTGT GGCGTTGTAC AAGGACAAGA AGCGCTTCGG CCTGGCATGC GTGGACCATA CCACGGGGGA ATTTTCCGTG GCCCAGTTCG AACACATGGA TTTGCTTCTG GACGAGCTGT CCCGCATCAA TCCCTCCGAA CTCCTGGTCA GCGACGAGCA GACGGACTGC TTCCCCGGAA CCCACCCCAC GCTTTATTAC GACGGATATA CTTTTCTGCC TTCCACGGCC ATTCCCAATT TGCTGAACCA TTTCCGGGTG CATTCCCTGG ACGGTTTCGG CTGCGGGGAG ATGACCGCCG CTCTGGGGGC GTCCGGCGCG GTGCTCCATT ATCTGGGCTA CCAGCTCCGC CGCCCCACGG ACCACCTGCG CCGCATTTCC GTGCGCGCCA CGGAGAACGC CGTGCTGATT GACCAGGCCA GCCAGAGGAA TCTGGACCTG GTGGATTCCC GCGGCGGCGT GAAGCTTTCC CTGCTGGGGA CCCTGGACAG GACGAGCACC CCCATGGGCG CGCGCAAACT CCGGGACTGG CTGCTCCACC CCCTGTGTGA TCTGGAAAAG CTCCTGGCGA GGCAGGAGGT GATCGCCGTT CTGCTTCAGG AACCCTACCT GATGAGCAAG CTGAGGGAGA GCCTGAAGAA TGTGCGGGAC ATGGAGCGGC TGACGGGGCG CATTTCCCAG GGCGCCGGGA ATGCCCGCGA CCTCCAGGCG CTGGCTTCCT CCCTGGCGCG CATTCCCGCG CTCAGGGATG ATCTGGAATC CCTGCCCGGC GGCGGGGACA TGCTGGAGAG CATCCGTTCC CGCATGGGCT GCTTTGATGA GCTGGTGGAT TTGCTCCAGC GCGCTCTGGT GGATGAACCT CCCGTGACCA TCAAGGAGGG AGGCATCATC AGGGAAGGGT ACCATGCCGG TCTGGATGAA TTGCGTCTGG CTTCCCGCGA CGGGAAGGAG TGGCTGGCAC GGCTGCAGGA GAAGGAGCGC AAGCGCACGG GAATTGATTC CCTGAAAATC CGCTTCAATA ACGTCTTCGG CTATTACATT GAAGTAACGA AGAGCCATTA CGATAAAGTG CCCCCGGATT ACCAGCGCAA GCAGACGCTG GTGAATGCGG AGCGCTTTAT TACCCCGGAA CTCAAGCAGA TGGAGAATAC CATCTTGGGG GCGGACGAGC GTTCCCGCCA GGTGGAGTAT GAGCAGTTCC TCCTGTTGCG CGAGGAAGTG GGGCGCCACA TTGACGATAT CCAGATTACG GCGGATGCCA TGGCGGACCT GGACGTGCTG CTGGGGCTGG CGGAGGGGGC CCAGCAGTAC CGGTATTGCC GCCCCGTTCT GGACAATTCC ATGACCCTGC GCATTGTCAA TGGCCGTCAT CCCGTTATTG AGCAGAATGT TTCCGGCGAT GTGTTCGTTC CCAATGACGC TTTTCTGGAA CCGGAGGAAA ACCGCCTTAT TCTGCTGACC GGGCCCAATA TGGCGGGCAA GAGCACCTAT ATCCGCCAGG TGGCCCTGAT TACGCTGATG GCCCAGATTG GAGCGTATGT GCCGGCGGAG TCCGCCCATA TCGGTTTGGT GGACCGCATT TTCTGCCGCG TGGGAGCCAG CGACGACCTG GCGCGCGGCC AGTCCACCTT TATGGTGGAG ATGAGCGAAA CCTCCCTGAT TCTGAATAAT GCCACGGAGC GCTCCCTGAT TATTCTGGAT GAAATCGGGC GAGGCACGGC CACTTTTGAC GGGCTTTCCA TTGCCTGGGC CGTGGCGGAG TACCTGCATG ACGAGTTGAA GTCCCGCACC CTGTTCGCCA CGCATTATCA TGAGCTGACG GATTTGGCCA ATTCCAGGCA GGGCGTGCAG AATTACAATG TGGCCGTGCG CGAGTGGAAG GAGGAAATCG TGTTTCTGCG CAAGATCGTG CCGGGGGCGG CGGATAAGTC CTACGGCATC CAGGTGGCCC GACTGGCCGG CATGCCTGCC GTCATTGTGG ACCGCGCCAA GGCCATCCTG TCCCATCTGG AGATGAATTC CACGCGCCCC CGGAGGAAGG AGCGCTCCCG GCTGGCGGAA CCGAGAGCCA AGAATACGGA TATGGAAGAC GATATGCCTG CCGGGGAATA TGCTCAGCTG GAGTTGTTCT GA
|
Protein sequence | MMDQYLRMKK GLPEDVLLFF RLGDFYEMFF EDAKEASAIL GLTLTKRHGI PMCGVPHHSA EGYIGRLVKG GKRVAIAEQT TIPQPGKLVE RELTRVISAG TLADMNLLDS SRHNYIVALY KDKKRFGLAC VDHTTGEFSV AQFEHMDLLL DELSRINPSE LLVSDEQTDC FPGTHPTLYY DGYTFLPSTA IPNLLNHFRV HSLDGFGCGE MTAALGASGA VLHYLGYQLR RPTDHLRRIS VRATENAVLI DQASQRNLDL VDSRGGVKLS LLGTLDRTST PMGARKLRDW LLHPLCDLEK LLARQEVIAV LLQEPYLMSK LRESLKNVRD MERLTGRISQ GAGNARDLQA LASSLARIPA LRDDLESLPG GGDMLESIRS RMGCFDELVD LLQRALVDEP PVTIKEGGII REGYHAGLDE LRLASRDGKE WLARLQEKER KRTGIDSLKI RFNNVFGYYI EVTKSHYDKV PPDYQRKQTL VNAERFITPE LKQMENTILG ADERSRQVEY EQFLLLREEV GRHIDDIQIT ADAMADLDVL LGLAEGAQQY RYCRPVLDNS MTLRIVNGRH PVIEQNVSGD VFVPNDAFLE PEENRLILLT GPNMAGKSTY IRQVALITLM AQIGAYVPAE SAHIGLVDRI FCRVGASDDL ARGQSTFMVE MSETSLILNN ATERSLIILD EIGRGTATFD GLSIAWAVAE YLHDELKSRT LFATHYHELT DLANSRQGVQ NYNVAVREWK EEIVFLRKIV PGAADKSYGI QVARLAGMPA VIVDRAKAIL SHLEMNSTRP RRKERSRLAE PRAKNTDMED DMPAGEYAQL ELF
|
| |