Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2109 |
Symbol | |
ID | 6275448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2569269 |
End bp | 2570330 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642614171 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001878699 |
Protein GI | 187736587 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.302973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCCTG CCCTGATCGG CATAAGCGGC CATGAAGTGG GCGCGGAGGA GGAGGCTGCC ATCCGGCGTT TGCAGCCGGC CGGATTCATT CTGTTTTCCA GGAATATTGA TTCCGTGGAG CAGGTGCGCG GTCTGACGGA GTCGCTGCGG AAACTTTGCC TCCACCATCC TGTCATTGCC GTGGATCAGG AGGGGGGGCG GGTGGTTCGC ACCGCTTCCC TGGGCTTGAA TTTGCCCTCC CCGGCTTCGC TGGCCCGGCT CGGTTCGGTT GGCGGCATCG TGGAACTGGG CGCGGTGACG GCTTTAGCTC TCCGCTACCT GGGGGTGAAC CTGAATTTTG CCCCGGTGCT GGATATTTGC CATGATCCGT CCGCAGCCAA CGCGCTGCCC GGCCGCTGCT GGGGAGACAA TGCGCAGGAC GTTATTTCAC GCGGCGGCGT TTATGCCTCC AACCTGCGCC GGGGAGGCGT GCAGAGCTGC GGCAAACATT TTCCCGGCAT GGGGCGTGCC CTGGCGGATC CCCATTTCAG CCTTCCCGTG ATCGGCCTGG ATGAACGGGA GCTGTTCAAG ACGGATCTGC TGCCTTTTCT GGCTCTGTGT CCGGCATTGT CCTCCATCAT GTCCGCCCAT ATCATGCTGC CTCAGATTGA TCCCGATTAT CCCGCCACCT TGTCTGAACG GGTGATCAGG GGGCTTCTGC GTGACCGCCT CGGTTTCCGG GGCGTGGTGT TTACGGATGA TTTGTGCATG GGGGCGATTA CGACGCAGTA TTCACCGGAT GACGCCGCCT TCCTGTCCCT GAAGGCCGGA TGCGATCTTC CCCTGATTTG CCATGACCCT CTTCCCTGGC TGGATGGGTT GGCTTCCCGC CAGGAAAGTT TGAACGCCTA TGACCGGTGG GATTCTTTCA AACGGGTGGA AAAGCTGAGC GACTCCCTGT GTTTCCCGTT TCCGGAAAAG GCTTCCCTGT GGGATTCCTG CCTCCGCCGT GCGGAGGCTC TATGCCGCCT GGAAGAGGAC GGAAGAGAAA AACTGCCTTC ATCCCCAGTC CAGAAATATT GA
|
Protein sequence | MLPALIGISG HEVGAEEEAA IRRLQPAGFI LFSRNIDSVE QVRGLTESLR KLCLHHPVIA VDQEGGRVVR TASLGLNLPS PASLARLGSV GGIVELGAVT ALALRYLGVN LNFAPVLDIC HDPSAANALP GRCWGDNAQD VISRGGVYAS NLRRGGVQSC GKHFPGMGRA LADPHFSLPV IGLDERELFK TDLLPFLALC PALSSIMSAH IMLPQIDPDY PATLSERVIR GLLRDRLGFR GVVFTDDLCM GAITTQYSPD DAAFLSLKAG CDLPLICHDP LPWLDGLASR QESLNAYDRW DSFKRVEKLS DSLCFPFPEK ASLWDSCLRR AEALCRLEED GREKLPSSPV QKY
|
| |