Gene Amuc_2019 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2019 
Symbol 
ID6274491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2452124 
End bp2453638 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content56% 
IMG OID642614079 
ProductBeta-N-acetylhexosaminidase 
Protein accessionYP_001878610 
Protein GI187736498 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.00758955 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTAGAAT CTTTTCCGAA ACAGTTCTTC ATGATGTTTA AACTGCCCCT CATCCTGGCC 
TGCGCCATTT TTTCCGCCCA CATGGCATGT GCCGCCGCAG CGGACAAATA CAGCGTTATC
CCTGAACCGG AAAAAACGGA GCTGCAGCAC AACAGTACCA GAACCTTAAA ACTTCTTTCC
GACCAGGAGG CTCCGACCCT GGGAACGGAC GCCTACCGGC TCACAGTCAC CCCGCAGGGG
GCGCACCTTG CTTCCGGAGG AAGGGAAGGC AGAATTTACG GGCTGGCAAC CCTCCGCCAG
CTCCGGGACC AGCTGGCGGG ACAGCCGGAG GGCATCCCCT GCGGCGTCAT CACGGACAAG
CCGCGCTATC CGTGGCGCGG CCTCATGGTA GATCCCGCAC GCTTTTTCAT CCCCACGGCC
GATCTGAAAA AATTTGTGGA TATGATGGCC TACTACAAAT TCAACAAGCT CCAGATCCAC
TTGACGGACG ACCAGGGGTG GCGTCTTCCG GTGCCCGGCT ACCCCAAACT CAAAAGCATC
TCCTCCAAAC GGAAAGAAAG CATGCGCAAC GGAATCCCCC ATGAAGGGAT GTACACCAAA
CAGGAACTGA AAGAGCTGGT GGCGTACTGC GCAGCGCGCG GCATTGAGGT CATCCCTGAA
ATAGACGTGC CGGGGCACAA CCAAGCCCTG GCGGCAGCCT ACCCTGAATT CTTCTGCTTC
CCGAACCCGG ATACGAAAGT GAAGACCGAT GAAGGCGTCA CCCTCCACCT CATCTGCCCG
CATAAACCGG AAGTCTGGAA ATTTTATGCC GCTGTTTTCA AGGAACTCAA AGATATTTTC
CCGTCCGGCA TCGTCCATCT GGGCGGCGAT GAAGCACCCC TGGAAAAAAC CTGGGCCAAA
TGCCCCCTCA GCATCCAGTA CCGGGAGCAA AAAGGCATGA AGGACGTCCA CGAGGAATTG
AAGGAATTCA TCAAAAAAAT GTCCTCCATG CTGGCTGTTC ACGGCAAGCG CATCCAACTA
TGGTATGAAA AACCGTGGGC CAGGGCCAAC ATCTACAACA AAGGAGACAC CGTCTTCACC
TGGCGCATGG GACTGACACC GTCCACCATC ACGGAGACGA AAAAGCAGGG GCTCTCCCTG
ATCATTGCTG CCGGGGAATA CTGTTACCTG GACTATCCGC AACTTCCAGG GCAAAGCAAC
CGGGGATGGA TGCCCACCAC CACGCTGGAG CAAAGCTACA GGCTGGACCC CGCCTACGGC
AGACCAGAAA AGGAAACAAA CCATATCACC GGCGTTCAGG GCACCGTGTG GGGAGAACAT
CTCCCTACCC TGAACCACAT TCTCTACCGC GCCTATCCGC GTGCCTGCGC CATTGCGGAA
GCCGGCTGGT CACCGATGAA CGTGCGCTCC TGGGAAAACT TCCGGCGCAA GCTGGCCGAC
CACCGTCAAT TCATCCTCAA ACGCTTCAAT TATGATATGG AGCGCACCAA AGAAAACGAA
CCGCCTTTCA AATAA
 
Protein sequence
MVESFPKQFF MMFKLPLILA CAIFSAHMAC AAAADKYSVI PEPEKTELQH NSTRTLKLLS 
DQEAPTLGTD AYRLTVTPQG AHLASGGREG RIYGLATLRQ LRDQLAGQPE GIPCGVITDK
PRYPWRGLMV DPARFFIPTA DLKKFVDMMA YYKFNKLQIH LTDDQGWRLP VPGYPKLKSI
SSKRKESMRN GIPHEGMYTK QELKELVAYC AARGIEVIPE IDVPGHNQAL AAAYPEFFCF
PNPDTKVKTD EGVTLHLICP HKPEVWKFYA AVFKELKDIF PSGIVHLGGD EAPLEKTWAK
CPLSIQYREQ KGMKDVHEEL KEFIKKMSSM LAVHGKRIQL WYEKPWARAN IYNKGDTVFT
WRMGLTPSTI TETKKQGLSL IIAAGEYCYL DYPQLPGQSN RGWMPTTTLE QSYRLDPAYG
RPEKETNHIT GVQGTVWGEH LPTLNHILYR AYPRACAIAE AGWSPMNVRS WENFRRKLAD
HRQFILKRFN YDMERTKENE PPFK