Gene Amuc_0052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0052 
Symbol 
ID6275118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp69948 
End bp72809 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content58% 
IMG OID642612095 
ProductHyalurononglucosaminidase 
Protein accessionYP_001876679 
Protein GI187734567 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.110511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.10576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGTT TTCAACTTCT CCAATGTGGC TTGAGCGTGG CGGCCTTTCT GGCCGGCTCC 
GCTCTGGCGG CGACTCCTGC CGTATATCCG TCCCCCCAGC AGTCCAAATT CACCTCCCAG
ACGGTTGCTT TCTCCGGGAA GCCTTCCGTG ACAATACGTT CTGCCAAGGC GGGCGGGAGC
AAGCTGTTGG ACGGGGTTCC GGAAAAATCC GGAGCCTACA AGCTGGTGAT TTCTCCGCAG
GGAAAGGTGG GCATAGGAGC CCATGACGAA CGGGGGGCAT TTTACGCCAT GCAGACGCTG
CGGCAGCTGG GAACGAAGAC CGGCGGAGAA GGCGTGATTT TGCCCGTTGG GGAAATTACG
GATTGGCCGG ACATTGAATT CCGCGGAACG GTGGAAGGTT TTTACGGCAC TCCCTGGAGC
CATGAAGCCC GTCTGAGCCA GCTGCGCTTT TACGGGCAGA ACAAAATGAA CACATACATC
TACGGGCCGA AGGACGATCC CTACCATTCC TCCCCCCACT GGCGGGATCC CTATCCCGCA
GACCAGGCCG CCCAGATCAG GGAACTGGTG AAGGTGGCGA AGGAAAACCA TGTGGACTTT
GTCTGGGCCA TCCACCCCGG CAAAGACATT AAGTGGACGG AAGAAGACAT GAACAACGTC
ATCAAAAAGT TTGAGATGAT GTACAAGCTG GGCGTCCGTT CCTTTGCCGT GTTTTTTGAC
GACATCTTCG GAGAGGGCAA GCGGGGGGAC ATGCAGGCCC TTCTGCTGAA CAAAATCAAT
GACGAATTTG TCAAGGTGAA GAAAGACGTC ACTCCCCTGG TCATGTGCCC TACGGAGTAC
AACCGCGGCT GGGCCAATCC CAAGCCGGGA ACTTATCTGG ATATTCTGGG CGACCGCCTG
GACCCCTCCA TCCACGTTAT GTGGACGGGG AATTCCGTCT GCCATGACAT CACGCTGGAA
GGCCAGCAGT GGGTGAACAG GAGAATCAAG CGCCCTTCCT ACGTCTGGTG GAATTTCCCC
GTTACGGACT ACTGCCGTTC CAACCTGTGC ATGGGCCGCG TGTACGGACT TGCCACGGAA
CCGGGAGCCA GGGAATCCAT GGGCGGATTT GTCTCCAACC CCATGGATAA GCCGGAGGCC
TCCAAGGTTT CCCTCTTCGG TCTTGCGGAC TACTCCTGGA ATATCAACGG CTTTAAGTCG
GAGGAATCCT GGAAGGAGGG AGTCAGGCGC CTGTTCCCGA AGGCGGCGGA AGCCATGCAG
GTTTTTGTGA ACCACAACTC CGACCAGGGG CCTAACGGCC ACGGTTACCG GCGTGAGGAA
TCCGTGGAAA TAGAACCTGT GGTCAAGCGC GTGCTGGAAG CCGCCCGGGA AGGCAGGATA
GAGAAGGCTG ATGCGGCTCT GCTTAAAAAG GAATTCGCTC GCATGGCTTC CGCTGCGCCT
GTCATCCGCG CCAAGGCCGA CAATCCCCGG CTGATGAAGG AAATCGGCGC GTGGGTGGAC
GCCTTTGAAC AACTGGGGCG TGCTGGCCAG CATGCCGTGG CGGCGCTGGA GGAAAACAAC
GCCGGAGAGG CCGCGACCCA CCTGGTGCAG GCCACGCAGG CTTTGGCGGC CATGGATGGA
ATTTCCCGCC GCCATAACCA GGAAGGGCAG CTGTACCGTT CCGCGGTAAA GACCGGTTCC
CGGGTGATGG CCCCCGCCGT CAATGAACTG GCGGATATTG TCTCCAAAAA AGTTTTCCCG
GCCATTGCCG GCGCTCCGGC TCTTTCCCCC AAGCCTCTGG TCAAGGGGGG CAGCATGGAT
AAGGCGGAAC TCTTCTGTGA CGGGGACCGC GGCTCCTTCT GGCACTCCGG CGCTTATGGT
CAGCCGGGAG ACTGGTATGG CGTGGACTAC GGAATGCCCA TTCCCGTGCG GAGCGTGGAA
GTGCTGATGG GCCGCAACGA CAAGGACGGT GACTATGTAG AGAAAGGGCA GCTGGAAGGC
TCCCGCGACC TCAAGACCTG GAAGCCGCTG GGGCCGGAGA CCGCCGGAAT GCAGGTAGCC
TGGAAGGCTC CCAAGCCGGT TTTGCTCCGT GCCGTGCGCT ACCGCGTCAT TGAGCCGAAG
AAAACGGGCA ACGGACGGGC CGTCTGGACT GCCGTGAGGG AAATAGCCGT CAACACGCCT
CCTTCCGCTA TGGCTTCCTC CAACGTGGCT GGGCTGGAGG GCGTTTCCGT GCAGAAATCC
GACAAAATCG TGCGAATCAA CCGCGTAATG GAAACGCACA AGATGAAACC CGGAGAATTT
ATTTCCCTGC GGTTGGATGG CCCCACGGAC GCCACCTGGC TGGAAGTCAA CCTGGAACGG
GATGACGTCA ACTCCTGGGC TGAGGTAGTG CTGGATGTGG AAGGTTCCTC CAAACCCGTG
GTCCAGAAGC TGGACAAACA GGGCAAGAAC TTCATTGCCA GGGGAAATCA GCTCCCCAAG
GGGATCAAAG GCATGAAACT GGTCAATAAA AGCGGGAAGG AACAGGACAT TGTTCTGAAC
ATGTTTAAAT TTGACGTTCC TCCTTCTGAT CCTGGCACCA GCCTGGTCTC CCTGAGCGAC
AGAAACCTGA AGACGGTTTA CCGTGCCGAT AAGCCGTTGG ACGTAGTCGT TCCCAATCTG
GACAACCCCA GGGCTTCCAA GGTAGTCGTA GTGGGATCCG CCGCCTTCGC CATCCAGGCG
CGCCGCGGCG AGGGTGCCTG GACGCTGGTC GGCAAGAGAA ACGCCGGTCC CGGAGTCTCC
GAATTTGCCA TTCCTGCCGG AACTTCCGCC GTGCGCCTGA CCTACAAGGC TCCCCAGCCG
GATGCAATCA TTAATGAAGT GATTTTTTCC TCCAGGAAAT AA
 
Protein sequence
MNGFQLLQCG LSVAAFLAGS ALAATPAVYP SPQQSKFTSQ TVAFSGKPSV TIRSAKAGGS 
KLLDGVPEKS GAYKLVISPQ GKVGIGAHDE RGAFYAMQTL RQLGTKTGGE GVILPVGEIT
DWPDIEFRGT VEGFYGTPWS HEARLSQLRF YGQNKMNTYI YGPKDDPYHS SPHWRDPYPA
DQAAQIRELV KVAKENHVDF VWAIHPGKDI KWTEEDMNNV IKKFEMMYKL GVRSFAVFFD
DIFGEGKRGD MQALLLNKIN DEFVKVKKDV TPLVMCPTEY NRGWANPKPG TYLDILGDRL
DPSIHVMWTG NSVCHDITLE GQQWVNRRIK RPSYVWWNFP VTDYCRSNLC MGRVYGLATE
PGARESMGGF VSNPMDKPEA SKVSLFGLAD YSWNINGFKS EESWKEGVRR LFPKAAEAMQ
VFVNHNSDQG PNGHGYRREE SVEIEPVVKR VLEAAREGRI EKADAALLKK EFARMASAAP
VIRAKADNPR LMKEIGAWVD AFEQLGRAGQ HAVAALEENN AGEAATHLVQ ATQALAAMDG
ISRRHNQEGQ LYRSAVKTGS RVMAPAVNEL ADIVSKKVFP AIAGAPALSP KPLVKGGSMD
KAELFCDGDR GSFWHSGAYG QPGDWYGVDY GMPIPVRSVE VLMGRNDKDG DYVEKGQLEG
SRDLKTWKPL GPETAGMQVA WKAPKPVLLR AVRYRVIEPK KTGNGRAVWT AVREIAVNTP
PSAMASSNVA GLEGVSVQKS DKIVRINRVM ETHKMKPGEF ISLRLDGPTD ATWLEVNLER
DDVNSWAEVV LDVEGSSKPV VQKLDKQGKN FIARGNQLPK GIKGMKLVNK SGKEQDIVLN
MFKFDVPPSD PGTSLVSLSD RNLKTVYRAD KPLDVVVPNL DNPRASKVVV VGSAAFAIQA
RRGEGAWTLV GKRNAGPGVS EFAIPAGTSA VRLTYKAPQP DAIINEVIFS SRK