Gene Amuc_0867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0867 
Symbol 
ID6274301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1035676 
End bp1036824 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content58% 
IMG OID642612922 
Productpeptidase M42 family protein 
Protein accessionYP_001877481 
Protein GI187735369 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.937418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTGG AACTGCTTCG TCAGGTTTGC GTGACTCCGG GCGCTCCGGG GTTTGAAGAC 
AAAATCCGCG ATTTCATCAT CCAGGAAGTG GCCCCGCTGG TGGACGCCGT GCGCGTGGAC
AACATGGGCA GCGTGATTGC CATCGTGGAA GGCAAAAACA CGGAAAAAAC CATGATGGCC
GCCGCCCACA TGGATGAAAT CGGGTTCATG GTCCGTCACA TCGACGACAA GGGGTTCATC
AAATTCCTGC CACTGGGCGG CTTTGACGCC AAGACGCTGA CGGCCCAGCG CGTCATCGTC
CACGGTAAAA AAGACCTCAT CGGCGTCATG GGCGTGAAAC CCATCCACGT CATGTCCCCG
GCGGAACGTA CCAAGCTGCC GGAAGTGACC GACTTCTTCA TTGACCTGGG CATGAGCAAG
GAGGAAGTGG AAAAATACGT TTCCGTAGGC GACTCCGTTA CCCGTGAACG GGATTTGGTG
GAAATGGGGG ATTGCGTGAA CGTCAAATCT CTGGACAACC GCGCCGGATG CTACGTGCTG
ATTGAAGCCC TCCGCGCCAT CAAGGCTTCC AGGAAGAAAC CCTCCTGCAA CTTCGTGGCC
GCCTTTACCG TTCAGGAGGA AGTGGGCCTG AGGGGCGCGC AGGCCGGCAC GCTGGACATC
CAACCGGATT TCTCCATTGC CCTGGATGTC ACCATCGCCT GTGACATTCC CGGAACTCCG
GCGCACGACC AGGTTTCCCA CCTGGGCGCA GGCGCCGCCA TCAAGCTGTA TGACGGTTCC
GTCATTGCAG ACCGCCGCAT GGTCAAGTTC ATGAAGGCCA TGGCAGACGC CAACAAAATT
AAATGGCAGA CGGAAATGCT GCCGGCGGGA GGCACGGATG CCGGAGCCAT GCAGAAATTC
GTTCCGGGCG GTTCCATTGC CGGGGCCATT TCCGTTCCCA CCCGCAATGT GCACCAGGTT
ATTGAAATGG CTCACAAAGA CGACCTGGAC GCTTCCGTAG CGCTTCTGAC CGCCTGCGCC
ATGAACGTGG ACAAATGGGA CTGGTCCTGG AACTCCGTCA ACGAATGCCC GGCGGAAAAA
CCCGCCAAGG CCGCAAAAAC GGCAGCCAAG CCCGCCAAAG CCGCTGCCAA GAAGAAAAAG
GCCAAATAA
 
Protein sequence
MNLELLRQVC VTPGAPGFED KIRDFIIQEV APLVDAVRVD NMGSVIAIVE GKNTEKTMMA 
AAHMDEIGFM VRHIDDKGFI KFLPLGGFDA KTLTAQRVIV HGKKDLIGVM GVKPIHVMSP
AERTKLPEVT DFFIDLGMSK EEVEKYVSVG DSVTRERDLV EMGDCVNVKS LDNRAGCYVL
IEALRAIKAS RKKPSCNFVA AFTVQEEVGL RGAQAGTLDI QPDFSIALDV TIACDIPGTP
AHDQVSHLGA GAAIKLYDGS VIADRRMVKF MKAMADANKI KWQTEMLPAG GTDAGAMQKF
VPGGSIAGAI SVPTRNVHQV IEMAHKDDLD ASVALLTACA MNVDKWDWSW NSVNECPAEK
PAKAAKTAAK PAKAAAKKKK AK