Gene Amuc_0086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0086 
Symbol 
ID6275009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp112700 
End bp114601 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content59% 
IMG OID642612131 
ProductDNA mismatch repair protein MutL 
Protein accessionYP_001876712 
Protein GI187734600 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.243153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATCCA TCCACGTCAT GTCTCCCACC CTTGCCAGCC AAGTGGCGGC AGGAGAAGTA 
GTGGAGCGCC CCGCCTCCGT TGTGAAGGAA CTGGTGGAAA ACAGCCTGGA TGCAGGAGCC
AAATTCGTAC GGGTGGAAAT ACGCCGCGGA GGCGTGGGCA TGATCAAGGT GACGGACGAC
GGCAGCGGCA TGTCACGGGC AGACGCCGAG CTGTGCACCA AACGGCACGC CACCAGCAAA
CTTTCTTCCC TGGAGGAGCT TTTTGAAATC ACACACCTGG GATTCCGCGG AGAGGCTCTG
CCCAGCATTG CCAGCGTTTC CCGCTTCAAA CTCTGTACCA GGCAGCAGCA GGATCTGGAA
GGCTGGGAAA TACGCATTGA CGGAGGCCTG GAACACGAAC CGAGAAGCTC AGGCGTCTCT
CCCGGCACCG CCATTGAGGT GGCCGACTTG TTTTACAATA CTCCAGCGCG CCGCAAGTTC
CTCAAATCCG CAGAAACGGA AGCTTCCCAT GTGGAGCACC AGATACGCCT GCATGCCCTG
GCTTATCCCC AAGTACGGTT TGCCTATAAA CGGGACGACC AGCTTGTTTT CGACCTTCCC
GCTACAGCGG ATCTGCGCGT CCGCATTTCC GCACTGACGG ATGCTGCCAC AGCGGCGGCC
CTGATTCCGA TAGAAACGAC CATCGGCCCC GGTATTTCCA TCACGGGATT CCTGCTTCCT
CTTTCCGAAG CCAGACGCAC CAGAAAAGGG CAGTACGTCT TTATGAACAC GCGCCCCGTG
GAGGACCAGC TCATTAACAG GGCCATTCGG GACGGCTACG GAGGCTTCCC CACCGGACTG
CATCCGGCGC TTTTTTTGTA TATGGAGGTG GAACCCGCTC TGGTGGACGT CAATGTGCAC
CCCGCCAAGA AGGAAGTGAG ATTCCGCCGT TCCGCAGACG TGGTAAACAC CATTGTGGAA
GCCATAGCCA ACACTCTTCA AAAGCATGCC CGGCAGGAAA TCCACGCTGC CGCCGCGCCG
GAGCCGGAGA GAACTCTTCC TCCTGCCCAT TCTACCACCG CGCCATATGG GGAAATACCC
GCCCGTTCCA CCAACCCCGG TTCTGCTTTC CCGGCGGCCG CCAGACCGGC CCCTGCTTCT
TCCGCCGCCC AACCGCCGCT TTCCTCCTCT GCCAAACAAT CCCATGGGGC TGTCCCTCCC
CCCACCCTCC GGGCCATTCC CCTGAAACAG GTTCCCGCCA CCCAGGGAAA ACTGGATTTT
CACCGTCAGG AGGATGAAGA GACGGCACGA AACGCCCATG AAAACGCAGC TCTGGAAAGG
GATGCCTCCG CCGGATTTTC CTATCTGGGA ACACTCCGCC AGCAATTCGC CCTGTTTGAA
ACGCCGGAAG GCCTGGTTCT GATGCATCCC AAGGCGGCCC GGGAACGCAT CATATTTGAA
CGGCTGCGCG CACGCCGGGA AGCCCCCATG CCGTCCCAGC AGCTTCTGGA TCCGGTGGTG
CTGGATCTGG ACCCGCGGGA TTTTGCCGTC ATCCGGCAGT TCGCCCCGCA TTTTGACCAG
GCCGGCATGG CCGTTACGCC CTTTGGACAG AATACAATCA GAATAGAATC CATCCCCGCA
CTGCTGGAAC TGGAAAACGC ACGCGCTTTT CTCCTGGAGC TGGTGGACCG TCTCACCCAG
TCCGAATTCA GCCGAAATGC CAAACGCGTG GCTTATGAGA CCTTCATTGG GGAATTTGCC
AGAAAATCCG CCTGGAGGGA GCGCATTTCC CCTCACCGGG CCCCTGCCAT CCTAAAGGAT
TTGCTTGCCT GTGAAGTGCC GTACTGCACC CCGGGTGGCA AACCCACGCT GGTGAATTAT
TCCGTTCCGG AAATTAAACG TAAATTCGGC CTACAGGCAT AA
 
Protein sequence
MPSIHVMSPT LASQVAAGEV VERPASVVKE LVENSLDAGA KFVRVEIRRG GVGMIKVTDD 
GSGMSRADAE LCTKRHATSK LSSLEELFEI THLGFRGEAL PSIASVSRFK LCTRQQQDLE
GWEIRIDGGL EHEPRSSGVS PGTAIEVADL FYNTPARRKF LKSAETEASH VEHQIRLHAL
AYPQVRFAYK RDDQLVFDLP ATADLRVRIS ALTDAATAAA LIPIETTIGP GISITGFLLP
LSEARRTRKG QYVFMNTRPV EDQLINRAIR DGYGGFPTGL HPALFLYMEV EPALVDVNVH
PAKKEVRFRR SADVVNTIVE AIANTLQKHA RQEIHAAAAP EPERTLPPAH STTAPYGEIP
ARSTNPGSAF PAAARPAPAS SAAQPPLSSS AKQSHGAVPP PTLRAIPLKQ VPATQGKLDF
HRQEDEETAR NAHENAALER DASAGFSYLG TLRQQFALFE TPEGLVLMHP KAARERIIFE
RLRARREAPM PSQQLLDPVV LDLDPRDFAV IRQFAPHFDQ AGMAVTPFGQ NTIRIESIPA
LLELENARAF LLELVDRLTQ SEFSRNAKRV AYETFIGEFA RKSAWRERIS PHRAPAILKD
LLACEVPYCT PGGKPTLVNY SVPEIKRKFG LQA