Gene Amuc_1815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1815 
Symbol 
ID6275605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2202069 
End bp2204255 
Gene Length2187 bp 
Protein Length728 aa 
Translation table11 
GC content55% 
IMG OID642613879 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001878414 
Protein GI187736302 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0101329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.0951479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCA TGAAGTTCCT TTTATTATCG TTTGCATGGG TGTGCATGGC TTGCGCCGGA 
GCATGGGGGC AGGATACGGC CCCGTCTTTC CCGGCTAACG GGGCCAATTA CAGGCTGTTT
CCGGCGGACC GGCCTCCGCT GGTTCCCAAA CCCCAGCAGC TGCGCTGGGA CGACAGGGCC
ATTCCCGTGC AGTCCGTACG CATTTTGGCT CCGTCTCCGT CCAGGACTTC CTATCCGGAA
CAGATGAAGT TCATTGTTTC CGAATTGAAA TCTTTTCTGG CGGAGCACTG CGTGAAGGTG
GCTCCGGACG GGACGTTTGC CGTTAAATTC GTCAAGGGGG ATGTGAAAGC CGGCACGGAA
AATTCCAAGC TGAAGGAGGA GGCTTATTCC CTCCGAGTAA CTTCCGGCGG CGCACTCATT
ACGGCGATGG ATACCAGAGG ATTCTATTAC GGCATGAAAA CGCTGGAGCA GCTTCTTTTG
CGCCGCGGCG GGACGACGAC CATTGCCGCC TGCGATATCG TGGACTGGCC GGATTTTGAA
ATCCGCGGAT TCATGAACGA TGTGGGACGC AATTACATGC CGCTGCCTCT GATTGCACGG
GAGCTGGATT CCATGGCGCA GCTCAAGCTG AATGTTTACC ACTTCCATTT TACGGAGAAC
CCCGGCTGGC GGCTGGAATC CAAAATTTAT CCGGAGCTGA ACGCCCCCGA AAATTATACG
CGCATGCCTG GCAAGTTTTA CACGCAGAAG GAGTTTAAGC AGCTGGTGGA GTACTGCCGC
CTGCGCAATA TCCTGCTGAT TCCGGAGATG GATATGCCGG GGCACAGCCA GATGTTCCGC
AAGGCGCTCA ACGTGAAGAT GAGCGATGAA AAAGCCACCA AAGCCCTGGT GGCCCTGATC
AAGGAGTTGT GTTCCCTGGT TCCCAAGGAG AAAATGCCCA TCATCCACAT TGGCACGGAC
GAGGTGCGCG GCAAGGATGA GCAGGTGAAC AATGAGATTC TTAAGGAGTA CATCCATGCA
GTGGAGTTCT GCGGCCGCAT TCCCATGAGA TGGCAGCCCG GCCTGACGCC GAAGGGCTAT
AACGGCTCCA TCCAGCAGTT ATGGTCCGGC CGCCAGAACC GTGGCGCATG GCCTACCGAC
GGAGCGAAGT ATGTTGATTC CCTGGAGACT TACCTGAACC ACCTTGATCC GTTTGAAACG
GCCATGACCA TGTATTTCCG CCGGGCATGC CCGTTTCGGA ATGCGGAAGG ACTGGGCATG
ATGCTGTGTT CTTTCCCGGA CCTGGAAATT ACGGATCCGC GCAACCAAGT TCTTCAGACG
CCCGTTTACG CCGGCATGGC GTTCGTTTCC GAACCTTTGT GGAATAATCC CCATGAGAAG
GTGCTGGGAG ACCCCAACCA GGACGAATAT ATGAAGTATT TTTCCAATCT GCCCGTGCAG
GGGGATCCTC TGCTGAAGGG GTTTGCGGAT TACGAGAACC GCGTGCTCGC CATCCGGGAC
CGTTTTTTCG TGGATAAGGA GTTTAATTAC GTACGGCAGG CGAATATTCC CTGGAAATTG
CTGGGGCCTA TTCCCAACGG CGGTAAGACG GAAAAGGAAT TCGCTCCGGA GGAGGACAAC
AAGGCAGGGA AGATGAGGGA TTCCTACGAG ATTGACGGCG TCACCTATGA GTGGTCCGGA
GACGATTACA CGGGGGCCAC CATCATTTTC AAGCATTACT GCGATTTTCC GACGCTGTTC
AATGGCGCAA AGATGGGAGC TTATCCCCAC AAAAACCACA CTTATTACGC GCAGACCTGG
ATTTATTCCC CCAAGGCGCA GACGGTGCCT TTCTGGATCA GCGGACATAC CTGGGCCACG
TCCGATTGGC GCAACGGTCC GGCGAGCGTT CCCGGCAAGT GGTTTCATGC GGATCCCAAA
TTTTTTGTGA ACGGCCGGGA GATTGCCCCC CCGCAATGGA AAAAGCCGCG TAACAGCGGC
GTGATGGTGG ATGAAAACTA CCATTTCCGG GAGCCTTCCA TGGTTCCTCT TAAGAAGGGT
TGGAACCGCG TGCTGGTAAA GAGCCCCAGC AACAATTCCG CGCGTCGGTG GATGTTCACA
TTCGTTCCGG TGCTGGTGAA CCCCAAGACG CCCGGCTGCA ATGTGAAGGA GTATCCCGGC
CTCAAATTTT CCACACGTCC GGAATAG
 
Protein sequence
MNRMKFLLLS FAWVCMACAG AWGQDTAPSF PANGANYRLF PADRPPLVPK PQQLRWDDRA 
IPVQSVRILA PSPSRTSYPE QMKFIVSELK SFLAEHCVKV APDGTFAVKF VKGDVKAGTE
NSKLKEEAYS LRVTSGGALI TAMDTRGFYY GMKTLEQLLL RRGGTTTIAA CDIVDWPDFE
IRGFMNDVGR NYMPLPLIAR ELDSMAQLKL NVYHFHFTEN PGWRLESKIY PELNAPENYT
RMPGKFYTQK EFKQLVEYCR LRNILLIPEM DMPGHSQMFR KALNVKMSDE KATKALVALI
KELCSLVPKE KMPIIHIGTD EVRGKDEQVN NEILKEYIHA VEFCGRIPMR WQPGLTPKGY
NGSIQQLWSG RQNRGAWPTD GAKYVDSLET YLNHLDPFET AMTMYFRRAC PFRNAEGLGM
MLCSFPDLEI TDPRNQVLQT PVYAGMAFVS EPLWNNPHEK VLGDPNQDEY MKYFSNLPVQ
GDPLLKGFAD YENRVLAIRD RFFVDKEFNY VRQANIPWKL LGPIPNGGKT EKEFAPEEDN
KAGKMRDSYE IDGVTYEWSG DDYTGATIIF KHYCDFPTLF NGAKMGAYPH KNHTYYAQTW
IYSPKAQTVP FWISGHTWAT SDWRNGPASV PGKWFHADPK FFVNGREIAP PQWKKPRNSG
VMVDENYHFR EPSMVPLKKG WNRVLVKSPS NNSARRWMFT FVPVLVNPKT PGCNVKEYPG
LKFSTRPE