Gene Amuc_2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2109 
Symbol 
ID6275448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2569269 
End bp2570330 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content60% 
IMG OID642614171 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001878699 
Protein GI187736587 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.302973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCCTG CCCTGATCGG CATAAGCGGC CATGAAGTGG GCGCGGAGGA GGAGGCTGCC 
ATCCGGCGTT TGCAGCCGGC CGGATTCATT CTGTTTTCCA GGAATATTGA TTCCGTGGAG
CAGGTGCGCG GTCTGACGGA GTCGCTGCGG AAACTTTGCC TCCACCATCC TGTCATTGCC
GTGGATCAGG AGGGGGGGCG GGTGGTTCGC ACCGCTTCCC TGGGCTTGAA TTTGCCCTCC
CCGGCTTCGC TGGCCCGGCT CGGTTCGGTT GGCGGCATCG TGGAACTGGG CGCGGTGACG
GCTTTAGCTC TCCGCTACCT GGGGGTGAAC CTGAATTTTG CCCCGGTGCT GGATATTTGC
CATGATCCGT CCGCAGCCAA CGCGCTGCCC GGCCGCTGCT GGGGAGACAA TGCGCAGGAC
GTTATTTCAC GCGGCGGCGT TTATGCCTCC AACCTGCGCC GGGGAGGCGT GCAGAGCTGC
GGCAAACATT TTCCCGGCAT GGGGCGTGCC CTGGCGGATC CCCATTTCAG CCTTCCCGTG
ATCGGCCTGG ATGAACGGGA GCTGTTCAAG ACGGATCTGC TGCCTTTTCT GGCTCTGTGT
CCGGCATTGT CCTCCATCAT GTCCGCCCAT ATCATGCTGC CTCAGATTGA TCCCGATTAT
CCCGCCACCT TGTCTGAACG GGTGATCAGG GGGCTTCTGC GTGACCGCCT CGGTTTCCGG
GGCGTGGTGT TTACGGATGA TTTGTGCATG GGGGCGATTA CGACGCAGTA TTCACCGGAT
GACGCCGCCT TCCTGTCCCT GAAGGCCGGA TGCGATCTTC CCCTGATTTG CCATGACCCT
CTTCCCTGGC TGGATGGGTT GGCTTCCCGC CAGGAAAGTT TGAACGCCTA TGACCGGTGG
GATTCTTTCA AACGGGTGGA AAAGCTGAGC GACTCCCTGT GTTTCCCGTT TCCGGAAAAG
GCTTCCCTGT GGGATTCCTG CCTCCGCCGT GCGGAGGCTC TATGCCGCCT GGAAGAGGAC
GGAAGAGAAA AACTGCCTTC ATCCCCAGTC CAGAAATATT GA
 
Protein sequence
MLPALIGISG HEVGAEEEAA IRRLQPAGFI LFSRNIDSVE QVRGLTESLR KLCLHHPVIA 
VDQEGGRVVR TASLGLNLPS PASLARLGSV GGIVELGAVT ALALRYLGVN LNFAPVLDIC
HDPSAANALP GRCWGDNAQD VISRGGVYAS NLRRGGVQSC GKHFPGMGRA LADPHFSLPV
IGLDERELFK TDLLPFLALC PALSSIMSAH IMLPQIDPDY PATLSERVIR GLLRDRLGFR
GVVFTDDLCM GAITTQYSPD DAAFLSLKAG CDLPLICHDP LPWLDGLASR QESLNAYDRW
DSFKRVEKLS DSLCFPFPEK ASLWDSCLRR AEALCRLEED GREKLPSSPV QKY