Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1669 |
Symbol | |
ID | 6274577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2021876 |
End bp | 2023519 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642613727 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001878268 |
Protein GI | 187736156 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.254995 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.452028 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTTCA GACATGTTTC CCATGTGTTT CTTTTTCTGT TTTCCGGCTT ATGCGGCACG GCCGTGAATG CTCAGGATCA GATAATACCC CGGCCCGTGT CCGTGAAGGT AGAGGCCAAG GGGGCGGCAA CGCCCGTGAA GCTGGGTGAA GGAATGCGCA TTATATGCAA GGAGAAGGAC GGGGAGTTCC GGCGCCAGGC TCACTTGTTG CAGCAGTTTT TGTCACGCGG AACCGGATTG GCTCTGGACG GGGCTGGCGG TACGGGAATA ATCCGCGTTG TGAAGGACGC TGCCCTGAAG CAGTACGGGC CGGAGGCATA CCGTTTGGAA GCGGCTCCCG GAAATATCGT CATCAGCGCC GCGACTCCCA AAGGAGTTTA TTATGCGGGG CAAAGTCTGG CGCAGATGCT GCCTGCCGCC TTTTTCAACA ATGATGCGGA CAAAAAGAGC GTAAGTTGGA ATGTAGCGGA GAAGCCTTTT TCCATATTGG ATTATCCCCG TTTTGCCTGG CGAGCCTTCA TGCTGGATGA AGCACGGCAT TTTTTTGGAG AGGAGGAGGT GAAGAGGCTG ATTGACCAAA TGGGGCTCCT GAAAATGAAT GTCCTGCACT GGCATTTGTC GGATGATGCC GGATGGCGCA TCCAGATTAA AAAATATCCC AAACTGACAT CCGTGGGAGG CAAGAGGAGA GATACGGAGA TAGAGACGTG GGGAAGCGGC AAGTATGAAG GCAGGCCCCA TGAAGGCTTT TATACGCAGG AACAGGTCAG GCGCATTGTC CGGTATGCGG CCGACAGGAA TATAACGATT GTTCCTGAAA TAGATATTCC GGGCCATTCC GCCGCCGCCA TCGTTTCCTA TCCGGAGTTG AAATTGTCCG CCAGGCCTTT CGCGGAAGTG CCCGTGAGTT TCAATGACGG AGCGGCGTTT GATCCCACCA GCGAACGTAC CTACCAGTTC CTGGGAGACA TCATGACAGA GCTGGCTTCC CTCTTTCCCG GAGGTATCAT TCACATCGGC GGTGACGAGG TGCGGTACAA AAAGTACTGG GAAGGCGTGC CCCACATTGA GGCGTTCATG AAGAAAAAAG GCATTAAGAC TTTTCCCGAC CTTCAAATCA TGTTTACGAA TCGGATTTCC GGCATGCTGG CTGGGATGGG GCGCCGGATG ATGGGGTGGA ATGAAATACT GGGGTCCGAC GTGCACAATG ACGGCGGCCG GGGAGCTGCG CTCGGCAAGC TGGATGCAAA TGCTATCATT CATTTCTGGT ATGGGTCCGA TAAAATTGCC GCCAAGGCGA TCAGGGAAGG CCGCCAGGTG GTGAATTCCA CTTCCCACAT GACTTACATT AACAAAGGAT ATGACAAGCT CCCTCTGTCC AGGTCCTATT CTTTTGAGCC CGTTTTTGCG GGATTGAACC CGGAGCAGCA GAAGAATGTC ATCGGGCTGG GCTGCCAGGT CTGGACGGAA TGGATTGCGG ACGTGGAGAA ACTGCACCGT CATGTATTTC CCCGTATTGC CGCTTATGCG GAGACGGGGT GGACCCGGAA GGAGGATAAG AATTTTCAGG ATTTCCAGAG GCGTTTGACG GGGTATGAAA AAATTTTGGA TGCCCAGGGC ATCCGGCACG GTGAAAAGGA GTAG
|
Protein sequence | MMFRHVSHVF LFLFSGLCGT AVNAQDQIIP RPVSVKVEAK GAATPVKLGE GMRIICKEKD GEFRRQAHLL QQFLSRGTGL ALDGAGGTGI IRVVKDAALK QYGPEAYRLE AAPGNIVISA ATPKGVYYAG QSLAQMLPAA FFNNDADKKS VSWNVAEKPF SILDYPRFAW RAFMLDEARH FFGEEEVKRL IDQMGLLKMN VLHWHLSDDA GWRIQIKKYP KLTSVGGKRR DTEIETWGSG KYEGRPHEGF YTQEQVRRIV RYAADRNITI VPEIDIPGHS AAAIVSYPEL KLSARPFAEV PVSFNDGAAF DPTSERTYQF LGDIMTELAS LFPGGIIHIG GDEVRYKKYW EGVPHIEAFM KKKGIKTFPD LQIMFTNRIS GMLAGMGRRM MGWNEILGSD VHNDGGRGAA LGKLDANAII HFWYGSDKIA AKAIREGRQV VNSTSHMTYI NKGYDKLPLS RSYSFEPVFA GLNPEQQKNV IGLGCQVWTE WIADVEKLHR HVFPRIAAYA ETGWTRKEDK NFQDFQRRLT GYEKILDAQG IRHGEKE
|
| |