Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0052 |
Symbol | |
ID | 6275118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 69948 |
End bp | 72809 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642612095 |
Product | Hyalurononglucosaminidase |
Protein accession | YP_001876679 |
Protein GI | 187734567 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.110511 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.10576 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGGTT TTCAACTTCT CCAATGTGGC TTGAGCGTGG CGGCCTTTCT GGCCGGCTCC GCTCTGGCGG CGACTCCTGC CGTATATCCG TCCCCCCAGC AGTCCAAATT CACCTCCCAG ACGGTTGCTT TCTCCGGGAA GCCTTCCGTG ACAATACGTT CTGCCAAGGC GGGCGGGAGC AAGCTGTTGG ACGGGGTTCC GGAAAAATCC GGAGCCTACA AGCTGGTGAT TTCTCCGCAG GGAAAGGTGG GCATAGGAGC CCATGACGAA CGGGGGGCAT TTTACGCCAT GCAGACGCTG CGGCAGCTGG GAACGAAGAC CGGCGGAGAA GGCGTGATTT TGCCCGTTGG GGAAATTACG GATTGGCCGG ACATTGAATT CCGCGGAACG GTGGAAGGTT TTTACGGCAC TCCCTGGAGC CATGAAGCCC GTCTGAGCCA GCTGCGCTTT TACGGGCAGA ACAAAATGAA CACATACATC TACGGGCCGA AGGACGATCC CTACCATTCC TCCCCCCACT GGCGGGATCC CTATCCCGCA GACCAGGCCG CCCAGATCAG GGAACTGGTG AAGGTGGCGA AGGAAAACCA TGTGGACTTT GTCTGGGCCA TCCACCCCGG CAAAGACATT AAGTGGACGG AAGAAGACAT GAACAACGTC ATCAAAAAGT TTGAGATGAT GTACAAGCTG GGCGTCCGTT CCTTTGCCGT GTTTTTTGAC GACATCTTCG GAGAGGGCAA GCGGGGGGAC ATGCAGGCCC TTCTGCTGAA CAAAATCAAT GACGAATTTG TCAAGGTGAA GAAAGACGTC ACTCCCCTGG TCATGTGCCC TACGGAGTAC AACCGCGGCT GGGCCAATCC CAAGCCGGGA ACTTATCTGG ATATTCTGGG CGACCGCCTG GACCCCTCCA TCCACGTTAT GTGGACGGGG AATTCCGTCT GCCATGACAT CACGCTGGAA GGCCAGCAGT GGGTGAACAG GAGAATCAAG CGCCCTTCCT ACGTCTGGTG GAATTTCCCC GTTACGGACT ACTGCCGTTC CAACCTGTGC ATGGGCCGCG TGTACGGACT TGCCACGGAA CCGGGAGCCA GGGAATCCAT GGGCGGATTT GTCTCCAACC CCATGGATAA GCCGGAGGCC TCCAAGGTTT CCCTCTTCGG TCTTGCGGAC TACTCCTGGA ATATCAACGG CTTTAAGTCG GAGGAATCCT GGAAGGAGGG AGTCAGGCGC CTGTTCCCGA AGGCGGCGGA AGCCATGCAG GTTTTTGTGA ACCACAACTC CGACCAGGGG CCTAACGGCC ACGGTTACCG GCGTGAGGAA TCCGTGGAAA TAGAACCTGT GGTCAAGCGC GTGCTGGAAG CCGCCCGGGA AGGCAGGATA GAGAAGGCTG ATGCGGCTCT GCTTAAAAAG GAATTCGCTC GCATGGCTTC CGCTGCGCCT GTCATCCGCG CCAAGGCCGA CAATCCCCGG CTGATGAAGG AAATCGGCGC GTGGGTGGAC GCCTTTGAAC AACTGGGGCG TGCTGGCCAG CATGCCGTGG CGGCGCTGGA GGAAAACAAC GCCGGAGAGG CCGCGACCCA CCTGGTGCAG GCCACGCAGG CTTTGGCGGC CATGGATGGA ATTTCCCGCC GCCATAACCA GGAAGGGCAG CTGTACCGTT CCGCGGTAAA GACCGGTTCC CGGGTGATGG CCCCCGCCGT CAATGAACTG GCGGATATTG TCTCCAAAAA AGTTTTCCCG GCCATTGCCG GCGCTCCGGC TCTTTCCCCC AAGCCTCTGG TCAAGGGGGG CAGCATGGAT AAGGCGGAAC TCTTCTGTGA CGGGGACCGC GGCTCCTTCT GGCACTCCGG CGCTTATGGT CAGCCGGGAG ACTGGTATGG CGTGGACTAC GGAATGCCCA TTCCCGTGCG GAGCGTGGAA GTGCTGATGG GCCGCAACGA CAAGGACGGT GACTATGTAG AGAAAGGGCA GCTGGAAGGC TCCCGCGACC TCAAGACCTG GAAGCCGCTG GGGCCGGAGA CCGCCGGAAT GCAGGTAGCC TGGAAGGCTC CCAAGCCGGT TTTGCTCCGT GCCGTGCGCT ACCGCGTCAT TGAGCCGAAG AAAACGGGCA ACGGACGGGC CGTCTGGACT GCCGTGAGGG AAATAGCCGT CAACACGCCT CCTTCCGCTA TGGCTTCCTC CAACGTGGCT GGGCTGGAGG GCGTTTCCGT GCAGAAATCC GACAAAATCG TGCGAATCAA CCGCGTAATG GAAACGCACA AGATGAAACC CGGAGAATTT ATTTCCCTGC GGTTGGATGG CCCCACGGAC GCCACCTGGC TGGAAGTCAA CCTGGAACGG GATGACGTCA ACTCCTGGGC TGAGGTAGTG CTGGATGTGG AAGGTTCCTC CAAACCCGTG GTCCAGAAGC TGGACAAACA GGGCAAGAAC TTCATTGCCA GGGGAAATCA GCTCCCCAAG GGGATCAAAG GCATGAAACT GGTCAATAAA AGCGGGAAGG AACAGGACAT TGTTCTGAAC ATGTTTAAAT TTGACGTTCC TCCTTCTGAT CCTGGCACCA GCCTGGTCTC CCTGAGCGAC AGAAACCTGA AGACGGTTTA CCGTGCCGAT AAGCCGTTGG ACGTAGTCGT TCCCAATCTG GACAACCCCA GGGCTTCCAA GGTAGTCGTA GTGGGATCCG CCGCCTTCGC CATCCAGGCG CGCCGCGGCG AGGGTGCCTG GACGCTGGTC GGCAAGAGAA ACGCCGGTCC CGGAGTCTCC GAATTTGCCA TTCCTGCCGG AACTTCCGCC GTGCGCCTGA CCTACAAGGC TCCCCAGCCG GATGCAATCA TTAATGAAGT GATTTTTTCC TCCAGGAAAT AA
|
Protein sequence | MNGFQLLQCG LSVAAFLAGS ALAATPAVYP SPQQSKFTSQ TVAFSGKPSV TIRSAKAGGS KLLDGVPEKS GAYKLVISPQ GKVGIGAHDE RGAFYAMQTL RQLGTKTGGE GVILPVGEIT DWPDIEFRGT VEGFYGTPWS HEARLSQLRF YGQNKMNTYI YGPKDDPYHS SPHWRDPYPA DQAAQIRELV KVAKENHVDF VWAIHPGKDI KWTEEDMNNV IKKFEMMYKL GVRSFAVFFD DIFGEGKRGD MQALLLNKIN DEFVKVKKDV TPLVMCPTEY NRGWANPKPG TYLDILGDRL DPSIHVMWTG NSVCHDITLE GQQWVNRRIK RPSYVWWNFP VTDYCRSNLC MGRVYGLATE PGARESMGGF VSNPMDKPEA SKVSLFGLAD YSWNINGFKS EESWKEGVRR LFPKAAEAMQ VFVNHNSDQG PNGHGYRREE SVEIEPVVKR VLEAAREGRI EKADAALLKK EFARMASAAP VIRAKADNPR LMKEIGAWVD AFEQLGRAGQ HAVAALEENN AGEAATHLVQ ATQALAAMDG ISRRHNQEGQ LYRSAVKTGS RVMAPAVNEL ADIVSKKVFP AIAGAPALSP KPLVKGGSMD KAELFCDGDR GSFWHSGAYG QPGDWYGVDY GMPIPVRSVE VLMGRNDKDG DYVEKGQLEG SRDLKTWKPL GPETAGMQVA WKAPKPVLLR AVRYRVIEPK KTGNGRAVWT AVREIAVNTP PSAMASSNVA GLEGVSVQKS DKIVRINRVM ETHKMKPGEF ISLRLDGPTD ATWLEVNLER DDVNSWAEVV LDVEGSSKPV VQKLDKQGKN FIARGNQLPK GIKGMKLVNK SGKEQDIVLN MFKFDVPPSD PGTSLVSLSD RNLKTVYRAD KPLDVVVPNL DNPRASKVVV VGSAAFAIQA RRGEGAWTLV GKRNAGPGVS EFAIPAGTSA VRLTYKAPQP DAIINEVIFS SRK
|
| |