Gene Amuc_0537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0537 
Symbol 
ID6275044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp632466 
End bp633545 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content59% 
IMG OID642612587 
Productpeptidase M42 family protein 
Protein accessionYP_001877156 
Protein GI187735044 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.349139 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTA GCGACGACAG CCTTGAATTC TTAACGGAAC TTCTGGAAAC CCCCAGCCCT 
TCCGGCTTTG AAATAGATGC CCAGCGCATC TGGGCGGACG AACTGCGCAA ATATACGGAA
GACGTCCAGT GCGACACTTA CGGCAATACC TGGGCCGTCT TCCACGCGGA CGCGGAAGAA
GCCCCCACAT TGATGATTGA AGCCCACGCG GATGAAATCG GCTTCATGAT CCGCCATATC
ACCAAGGACG GCTTCCTGTA TGTGGAACGC GTAGGCGGCA CGGATACGGC CATCGCACGG
GGGCGCCGCG TGCGCTTCCT GGGTTCCCAG GGAGAAGTGA TGGGGGTGAC CGGAAACACG
GCCATCCACT TGCGGGAACC CGGAGAGAAG GAACCCAAAA TCTGGGAAAT TTACGTTGAT
GTAGGCGCCT CCTCCGACAA GGAAGTAGCG GAACTCGGTT TGCGCGTGGG CCATGTGGGC
GTTTACTGCG ACGGCCCCAT GCTGATGAAT GAAAACAAGC TGGTATGCCG GGCTCTGGAC
AACCGGCTGA GCGGCTTCAT CCTGTCGGAA ATAGCCCGCA AGCTGTGCAA GCTGAAAAAG
CCCGTCTCCT GGAACGTGGT GCTCGTCAAT GCCGTGCAGG AAGAAGTGGG CTGCATTGGC
GCGGGAATGA TTACCCACCG CCTGCGCCCG GATGCGGCTA TCTGCATAGA CGTGACTCAT
GCCACGGACT CGCCCGGACT GGACAAGGGC AAATTTGGCG ATATCAGGCT TGGCGGCGGC
CCTGCGGTCA TCCACGGCAC GGCCAACCAT CCCAATCTGG TGGCCCGTCT GGAAATCGTG
GCGGACAAGA ACAAAATACC CCTCCAGCAT GAAGCCGCCG GACGCCGCAC CGGAACGGAT
ACGGACAGTA TCTACATCTC CCGCGACGGC GTAGCCTCCG CGCTGGTGTC CGTCCCCCTG
CGCTATATGC ACTCCCCGGT GGAAACGGCC TCTCTGACAG ATGTGGAAAA TACAATCAAG
CTGCTGCTGG AATTGGTCAA ATCCCTGATG CCGGGAGACT CTTTCGGGCA CAAGCTGTAA
 
Protein sequence
MKISDDSLEF LTELLETPSP SGFEIDAQRI WADELRKYTE DVQCDTYGNT WAVFHADAEE 
APTLMIEAHA DEIGFMIRHI TKDGFLYVER VGGTDTAIAR GRRVRFLGSQ GEVMGVTGNT
AIHLREPGEK EPKIWEIYVD VGASSDKEVA ELGLRVGHVG VYCDGPMLMN ENKLVCRALD
NRLSGFILSE IARKLCKLKK PVSWNVVLVN AVQEEVGCIG AGMITHRLRP DAAICIDVTH
ATDSPGLDKG KFGDIRLGGG PAVIHGTANH PNLVARLEIV ADKNKIPLQH EAAGRRTGTD
TDSIYISRDG VASALVSVPL RYMHSPVETA SLTDVENTIK LLLELVKSLM PGDSFGHKL