Gene Amuc_0868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0868 
Symbol 
ID6274300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1036915 
End bp1038564 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content58% 
IMG OID642612923 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001877482 
Protein GI187735370 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCCA AGTGTACCTT TTCCGCCACG GTTTTCAGCC TGTTTTCCCT TTGCTGGGGC 
GCCCCATCCT CTCCGGTTCT CGAAGCGCCC CATACCATTC CCCTGCCCGC CGCCATGCGC
GTCCAAACCG GAGAAAGCGG GTTTTCCCTG AAAAACGGCG TCAGGCTCCC GGAAAAAAAT
CCTCTTTCCA GGCAGGCGGA ACGGATTTTC CGCGACAACG GGATCAACAC GGCCCTGGTT
AAAAACAACG CGGACATCAT CTTTACGGAA GACGCTTCCC TGGGCAGGGA AGGCTACCGC
CTTGCCGTAA CGCCGGATTC CATCTCCATT GCCTCCGGTT CCGTGAACGG AACCCTGTAT
GCCCTTCAAT CCCTCGTTCA AAGCATCGCT GCCGACAAAA ACGGAGCTCC GGCCCTGCCC
CGGATGGACG TAAAAGACCA GCCCCGCTTT TCATGGCGGG GCCTGATGGT AGACAGCTGC
CGCCACATGA TGCCCGTGCG GGACATCAAA AAAGTGCTGG ACCTGATGGA ACGGTATAAA
TTCAACACCC TGCACTGGCA CCTGACGGAC GACCAGGGGT GGCGTCTCCC AATCGCCAAG
TACCCCAGGC TGACAACCGT GGGAGGCGCC CGGGCTCAAT CCCCCGTCAT CGGCAACCGC
AATAAGGGAG ACGGCATCCC CTACTCCGGC CATTACACCG CAGATGAAAT CCGGGATGTG
GTGCGGTACG CCAGAGACCG GGGCATTACC GTCATTCCGG AAGTGGAAAT GCCAGGCCAT
GCCTCCGCAG CCATCGCCGC CTATCCGGAA CTGGGGAATA CGGACATCCC GGGTTATGAG
CCTAGGGTGC AGGAAACCTG GGGCGTGCAC TCCTATACCT TCTCCCCCAC GGAAAAAACC
TTCCGTTTTC TGGAAGACGT CATTGATGAA ATATGCGCCC TGTTCCCGGA CAGCCCCTAC
ATCCACATCG GAGGGGATGA AGCGCCCAAG AATCAGTGGA AACAGTCCCC CACGGCCCAG
CGGGTCATGA AGGACAACGG CCTGGCCAAT GAACACGAGC TCCAGAGCTA CTTCATCCGC
CGCGTGGAAA AAATGATCAA TAACCGCGGA AAAAGGCTCA TTGGCTGGGA TGAAATCCAG
GAAGGGGGCC TTTCCCCCAC CGCTACCATG ATGGTTTGGC GCAGCCAAAT GCCGCACATC
GCCGCACAAG CCCTGGCTCA AGGCAACGAT ATTGTGATGA CGCCCAACAG CCACCTGTAC
TTTGACTATG ACCAGGGGCC CGGAAAACCC GCTGCCCCCG AATACGAGAC GATTAATAAC
AATCAGCTGA CCTGGCAGCA TGTTTACGGA CTGGAACCGG TGCCTCAGGG AACGCCCCGG
GAACGGGAAA AGCAGGTGCT GGGCTGCCAG GCGAACATCT GGACGGAATA TATCCCGAAC
CTGCCGAAAT GGGAATACCA TGTCTTCCCC CGCGCCCTGG CGCTGGCGGA AGTTGCCTGG
ACCCCGCAGG AGCTAAAAAA TGAGAAAGAT TTCCGTAAAC GCCTCGACCG CCAGCTTCCC
TTCCTGGACG CCCGCGGCGT CAATTACAAA AGACCGGACA ATGGAGCCCC CGCACAGCCG
AAGGCCGTCA TTACGCGGGA ACGCCGTTAA
 
Protein sequence
MISKCTFSAT VFSLFSLCWG APSSPVLEAP HTIPLPAAMR VQTGESGFSL KNGVRLPEKN 
PLSRQAERIF RDNGINTALV KNNADIIFTE DASLGREGYR LAVTPDSISI ASGSVNGTLY
ALQSLVQSIA ADKNGAPALP RMDVKDQPRF SWRGLMVDSC RHMMPVRDIK KVLDLMERYK
FNTLHWHLTD DQGWRLPIAK YPRLTTVGGA RAQSPVIGNR NKGDGIPYSG HYTADEIRDV
VRYARDRGIT VIPEVEMPGH ASAAIAAYPE LGNTDIPGYE PRVQETWGVH SYTFSPTEKT
FRFLEDVIDE ICALFPDSPY IHIGGDEAPK NQWKQSPTAQ RVMKDNGLAN EHELQSYFIR
RVEKMINNRG KRLIGWDEIQ EGGLSPTATM MVWRSQMPHI AAQALAQGND IVMTPNSHLY
FDYDQGPGKP AAPEYETINN NQLTWQHVYG LEPVPQGTPR EREKQVLGCQ ANIWTEYIPN
LPKWEYHVFP RALALAEVAW TPQELKNEKD FRKRLDRQLP FLDARGVNYK RPDNGAPAQP
KAVITRERR