Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1032 |
Symbol | |
ID | 6274083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1224627 |
End bp | 1226183 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642613081 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001877639 |
Protein GI | 187735527 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATCAA TTTTGCTAGC CGCAGCGTTC CTGGGCTCCC TCTGCTGGGC CGGAACCAAT CCCTACAACA TTATTCCGGA GCCCGTCAAC GTGACGACAA CTTCCGGAAC TACCAAAAAC CTCAAAATCG TCCATGAGCA AAAAGTCGCC GGACTGGGCA ATGAAGGGTA TGCCATGAAA CTAACGCCCG GCGGCGTGGA ACTCCGTTAT ACCACGCCCA ACGGGAAGGC CATGGCCATG GCGACCCTGT TCCAGCTCCA GGACCAGCTT TCGGATACTC CTGAGGGACT TCCCTGCGGC AGCATCCAGG ATTCCCCCGA CTTCGGCTGG CGCGGCATGA TGGTTGACGT AGGCCGCTAC CACTATCCCA TGAAGGAGAT CTACAATTTT GTGGACGCCA TGCATTATTA CAAATACAAC GTCCTGCATC TCCATCTGAC GGAAGACCAG GGCTGGCGTC TCCCCGTTCC GGGCTACGAC AAGCTCCGCA CCATCGGCGC CGTCCGCCCC TCCGCTCCGG AAAGCCAGAA CAACTCCCTG CTGGCCAATG AAGGCATGTA TACCAAGAAG GAGCTCCAGG ACCTGGTAGC CTACTGCAAA GCGCGCGGCA TCCAGGTACT GCCGGAAGTG GAAATGCCGG GTCATAACAT GGCCCTGGCC GCGTCCTATC CCGAATTCTG CTGCAACACC AAACGGGCCC AGGTATGGAC GCACGGCGGT GTTTCCTCCA AGCTGATTTG CCCGCAGAAA CCGGCCACTA AAAAGTTTCT TAAGGATACC TTCAATACCG TCCAGCAGAT ATTCCCTTTC CCGTACATCC ACATCGGAGG TGACGAATGC CCCATGGGGG ACTGGAAGAA GTGCCCGGAC TGCCAGGCCG CCCGAGCCAA AAAGGGCCAG GGGGATAATG TGGAAGCCCA GATGAGCGAT TTCACGAAAA GCCTGACGGC CATGCTCGCC AAGCACCGGA AAAAGCCCAT CCTGTGGTAT GACATCAACA AGAGCTATTA CCACAAGGGG GAAACCGTCA TGTCCTGGCT GCCGGGAGAA TTCCCGCGTT GCATTGATAA GACGAAGGAA CAGGGCATCG ACCTCATCGT CACCCCCCAG TTCAAGTATT ATCTGGCGCG TACCCAGATG AAATTCCCGG CGGACGACGT GCGCGCCCGG CCCGGTGGAG CTCCCATCCT GCTGAAAGAC TGCTACAACT TCGATCCCCG CAACGGACGG GACAAGAATG ACGTCAAGCA CATCAAGGGA ATCAACCTCT GCATGTGGGC GGAATGGATT CCCTCCGGCG AATTGCTGAT GTACATGACC TACCCCCGCG CCATGGCTGT TTCCGAAACC GCGTGGGGCA GCCACAAGAA CCGTCCAAGC CTGGAAGAGT TTGAAAAGAA AATGGAAACC CACAAGAAAC ATTTCCAGAA GCGTTTCGGC TATACTCTGG AACGCACTGT GGAAAACAAA CCCTACCGGG AAAAATTCAT CACCCAAGAG GAAATCGAAC GTATTAACGA GAATTATAAA AAGGGCCAGC AAAACGCGGA CAAATAG
|
Protein sequence | MKSILLAAAF LGSLCWAGTN PYNIIPEPVN VTTTSGTTKN LKIVHEQKVA GLGNEGYAMK LTPGGVELRY TTPNGKAMAM ATLFQLQDQL SDTPEGLPCG SIQDSPDFGW RGMMVDVGRY HYPMKEIYNF VDAMHYYKYN VLHLHLTEDQ GWRLPVPGYD KLRTIGAVRP SAPESQNNSL LANEGMYTKK ELQDLVAYCK ARGIQVLPEV EMPGHNMALA ASYPEFCCNT KRAQVWTHGG VSSKLICPQK PATKKFLKDT FNTVQQIFPF PYIHIGGDEC PMGDWKKCPD CQAARAKKGQ GDNVEAQMSD FTKSLTAMLA KHRKKPILWY DINKSYYHKG ETVMSWLPGE FPRCIDKTKE QGIDLIVTPQ FKYYLARTQM KFPADDVRAR PGGAPILLKD CYNFDPRNGR DKNDVKHIKG INLCMWAEWI PSGELLMYMT YPRAMAVSET AWGSHKNRPS LEEFEKKMET HKKHFQKRFG YTLERTVENK PYREKFITQE EIERINENYK KGQQNADK
|
| |