Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0397 |
Symbol | |
ID | 6274810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 479336 |
End bp | 481504 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642612448 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001877017 |
Protein GI | 187734905 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.571244 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 78 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTTGT TCAGAGTGTT CAGCATTCCC GTACGGTTTT TTATTGCCTT GCTGATTTAT TTTTCATGGG GGCCGCTGGC CGGAGGGGTG GTTACCGCGC CTGTGCCGGA GTTTTCTTCC CCCGCGCTTA TTCCATATCC TTCCAAAGTC GTCAGGGGCG CGGGCGAGGC AGGTTTCAGG AGCGTTCATG TGAAGGTTGG TTCGGATGTT CCTGGCAGGG ATGATTTGAT GAAGGAGATC AAGGACATAT TCAGGACTTC CGGCATTCAG GGTTCCCTGA ATAGTGGGGG GGCTGCTGAA GGTGCCATGA CATGGGAATT GTTTCCGGAC GCCAGGATGA AAGGAGAGGG GTATGACCTG TCCGTCGGAT TGGGAAAAGC AACCGTCAGG GCAGGAAGTT TTGGCGGTTT TTTTAACGCT CTTCAAACGC TGCGGCAACT GGTTTTCAAT AGAGAAGGGG AATTTGCCAT GCCTGTTGTG CGTATTAGTG ACAAACCCGC TTTCGTCCTG CGGGGAATTA TGCTGGATGT GGGGCGTTAT TACATGTCTC CTGCCTTGAT CAAGGAGGTG ATGCGCCGTC TTTCCCGCTA TAAGATTAAT ACGCTGCATC TTCATTTGAC GGATGATCCC GCATGGAGGC TGGAAGTGAA GAAGTATCCG GCCCTGACGG ATGGGGCTTT TCATTGGAAG TCCCGGCTCC CGGGGCGGTT TTATACACAG GCTCAATTGA AGGATCTGAC GGATTATTGC GCCCGGTTGA ATATCCAGGT GATTCCGGAG ATTGACATGC CGGGACACAG CCAGCCTTTT GCCAGGGCCA TGAAGACCGG CATGCAGACG GAGAAGGGTG TTTCCATATT GAAGGATGTC GTGGACGAGG CCGTTTCCCT GTTTCCCGGG AGGTTTTTCC ACATGGGATC CGACGAGGCG CACATTTCCA TGAAGGATTT TATCCCACGT ATGGCGGAGC ATATCCGCGG GAAGGGCAAG GAGGTGGTCG TGTGGTCTCC GGGCGGCCCG CACGATAAGG ATTCCGTATT GATGTGCTGG GGAGAGAACG AGGCCGGCGC CAGGATGGAT AAGAATATGA AGCGCATCGA CAGCAATGGT TTTTACATTG ACTGGGCGGA TTCCCAGTCC GGCGTGTACC AGGTGTTTTT CCAGCAGCCC TGCGAGGTGC CTCAAGGGGA TGATAAGGCG TTGGGGGCCA TCATGCCCGT GTGGTGCGAC GGAAACCTGA GCAGTGAGAG GCGGGTGCTG GAACAGTATC CGTTTTATCC TTGTGCATTG ACGTTTGCAG AGCGTGTCTG GCGCGGAAGC GCTACCAAGA GGAGGGATTA TATGGCCCAG CTTCCCCCCA GGGGAACGGA CGGCTGGAAG GAATTCCGGG AGTTTGAACA GCGTTTGGCT TTCCATCGTG ATCATTTTTT CCAAGGCGTT CCTTTTGCCT ATGTGAAACA GGCGGACGTT GCCTGGAGCC TGGTGGGGCC ATTCGACCAC CGGGGGAAGA ATGATACCTC TTTTGAACCG GAAAGAAGAA TAGCCCCTTC CTACAGGGAC GGGGACAGGA TACTGGCCTG GAAGAAGACT CCTGTTTACG GCGCCGCCGT ACATGTCAGG CATCTTTTTG CGATGTTCAA CATGCACCGG AACCAGTACC GGACGGACCA CTGGCCCACG CTGATGTCAA GGGAGGTTGG AAAGGAGGAT GGAACCTGCT ATGCTCTGAC TTTCATCCGC AGCCCCAGGG AGCAGGAGGT GTGGCTGATG TTTGGCCTGA ACGGCATGTG GGGGCATTCA GGCGGGTACC GCAGCGCACG CGCCCCTGAA CAGGGAAGCT GGGATTTTTC CGGCGGGGAC GTATGGTTGA ACGGAAGGCG CGTGAACCCT CCCAGGTGGC CTTTCAAGAG CCTGCCCTGG ACGGGGTGGG GAAAGGGGCG CATTGAAGAA GCTCCTCTGA CCTGGGAAGG GTATTTTTTC CGCCCTCCTG TAAAGATCAA GCTCCGCAAG GGGTTGAACC GTGTATTGAT CCGCAGCGTG TTCGGGCACT GGAAGGGAGA CGACGGCCAG AGAAGCTGGT TTTTCTGCTG CATTCCCGTC CTGTGGGACG GCATTCATTA CCGGGAGGTT CCCGGCCTGG AATATGACCC CCGTCCGGAT GCCCGCTGA
|
Protein sequence | MLLFRVFSIP VRFFIALLIY FSWGPLAGGV VTAPVPEFSS PALIPYPSKV VRGAGEAGFR SVHVKVGSDV PGRDDLMKEI KDIFRTSGIQ GSLNSGGAAE GAMTWELFPD ARMKGEGYDL SVGLGKATVR AGSFGGFFNA LQTLRQLVFN REGEFAMPVV RISDKPAFVL RGIMLDVGRY YMSPALIKEV MRRLSRYKIN TLHLHLTDDP AWRLEVKKYP ALTDGAFHWK SRLPGRFYTQ AQLKDLTDYC ARLNIQVIPE IDMPGHSQPF ARAMKTGMQT EKGVSILKDV VDEAVSLFPG RFFHMGSDEA HISMKDFIPR MAEHIRGKGK EVVVWSPGGP HDKDSVLMCW GENEAGARMD KNMKRIDSNG FYIDWADSQS GVYQVFFQQP CEVPQGDDKA LGAIMPVWCD GNLSSERRVL EQYPFYPCAL TFAERVWRGS ATKRRDYMAQ LPPRGTDGWK EFREFEQRLA FHRDHFFQGV PFAYVKQADV AWSLVGPFDH RGKNDTSFEP ERRIAPSYRD GDRILAWKKT PVYGAAVHVR HLFAMFNMHR NQYRTDHWPT LMSREVGKED GTCYALTFIR SPREQEVWLM FGLNGMWGHS GGYRSARAPE QGSWDFSGGD VWLNGRRVNP PRWPFKSLPW TGWGKGRIEE APLTWEGYFF RPPVKIKLRK GLNRVLIRSV FGHWKGDDGQ RSWFFCCIPV LWDGIHYREV PGLEYDPRPD AR
|
| |