Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1835 |
Symbol | |
ID | 6275505 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2229943 |
End bp | 2231967 |
Gene Length | 2025 bp |
Protein Length | 674 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642613899 |
Product | Exo-alpha-sialidase |
Protein accession | YP_001878434 |
Protein GI | 187736322 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4409] Neuraminidase (sialidase) |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCTTG GCCTGTTGTG CGCACTGGGC CTGTCTATTC CCTCCGTTCT CGGCAAGGAA AGCTTTGAGC AGGCCAGGCG TGGCAAATTT ACAACGCTTT CCACCAAATA CGGCCTTATG TCCTGCCGGA ACGGTGTGGC GGAAATCGGA GGAGGGGGAA AATCCGGAGA AGCCTCCCTG CGGATGTTCG GCGGACAGGA TGCTGAATTG AAACTGGACT TGAAGGATAC GCCTTCCAGG GAAGTCCGGC TTTCTGCCTG GGCGGAGCGA TGGACCGGGC AGGCCCCTTT TGAATTTTCC ATTGTGGCCA TAGGGCCGAA TGGAGAAAAG AAAATTTATG ACGGCAAGGA TATCAGGACG GGCGGGTTTC ATACCAGGAT AGAAGCCAGT GTTCCTGCCG GAACGCGTTC CCTGGTGTTC AGGCTTACTT CTCCGGAAAA CAAGGGAATG AAGCTGGACG ACCTGTTTCT TGTTCCCTGT ATTCCCATGA AAGTGAATCC GCAGGTGGAG ATGGCCTCTT CCGCTTACCC GGTGATGGTG CGTATCCCGT GCAGCCCCGT TCTTTCTCTG AATGTCCGGA CGGACGGCTG CCTTAATCCT CAGTTCCTGA CAGCTGTCAA TCTGGATTTT ACGGGTACGA CGAAGCTTTC CGACATTGAG TCCGTGGCTG TAATACGGGG GGAAGAGGCC CCTATCATCC ATCATGGGGA AGAGCCGTTC CCGAAAGACT CTTCCCAGGT TCTTGGTACA GTAAAGCTTG CCGGTTCCGC CAGACCCCAG ATTTCTGTGA AGGGGAAAAT GGAGCTGGAG CCCGGAGACA ATTACCTGTG GGCTTGCGTG ACGATGAAAG AAGGAGCCTC CCTGGACGGC AGGGTGGTGG TGCGTCCGGC CAGCGTTGTG GCGGGCAATA AACCGGTGAG GGTTGCCAAT GCGGCTCCCG TGGCGCAGCG CATCGGCGTG GCCGTAGTCA GGCATGGGGA TTTCAAATCA AAATTCTACC GTATTCCCGG TCTGGCCCGT TCCAGGAAGG GGACCCTGCT GGCCGTGTAC GATATCCGGT ACAACCATTC CGGAGACCTT CCGGCCAACA TTGATGTGGG CGTAAGCCGC TCTACGGACG GAGGCCGCAC CTGGTCTGAT GTCAAAATCG CCATTGATGA TTCCAAGATT GACCCCTCTC TGGGGGCTAC CAGGGGCGTA GGGGATCCGG CCATTCTGGT GGATGAAAAG ACGGGGCGCA TCTGGGTGGC CGCCATATGG AGCCACAGGC ATTCCATCTG GGGCAGCAAG TCCGGAGACA ATTCTCCGGA GGCCTGCGGA CAGCTGGTGC TGGCCTACAG CGATGACGAT GGCCTGACCT GGTCCAGTCC GATCAATATC ACGGAACAAA CCAAGAACAA GGATTGGCGC ATTTTATTTA ATGGCCCCGG CAATGGCATT TGCATGAAAG ACGGCACGCT GGTCTTCGCC GCCCAGTACT GGGACGGCAA AGGGGTGCCG TGGTCCACCA TTGTTTATTC CAAAGACCGG GGAAAAACCT GGCACTGCGG CACGGGCGTC AACCAGCAGA CGACGGAAGC CCAGGTGATT GAGCTGGAAG ACGGCTCCGT CATGATCAAC GCCCGATGCA ACTGGGGCGG TTCCCGCATC GTGGGCGTTA CGAAAGACCT GGGCCAAACG TGGGAAAAAC ACCCCACCAA CCGCACTGCC CAGCTGAAGG AACCGGTCTG CCAGGGCAGC CTGCTTGCCG TGGACGGCGT TCCGGGCGCG GGCAGAGTGG TTCTGTTTTC CAATCCCAAT ACCACATCCG GACGTTCCCA CATGACGTTG AAAGCTTCTA CGAATGATGC CGGGTCATGG CCGGAAGACA AATGGCTTCT TTATGATGCC CGCAAAGGCT GGGGATATTC CTGCCTGGCG CCGGTAGATA AGAACCATGT GGGCGTGCTG TACGAATCCC AGGGGGCGCT GAACTTCCTG AAAATTCCCT ATAAGGATGT TCTTAACGCA AAAAATGCGC GCTGA
|
Protein sequence | MGLGLLCALG LSIPSVLGKE SFEQARRGKF TTLSTKYGLM SCRNGVAEIG GGGKSGEASL RMFGGQDAEL KLDLKDTPSR EVRLSAWAER WTGQAPFEFS IVAIGPNGEK KIYDGKDIRT GGFHTRIEAS VPAGTRSLVF RLTSPENKGM KLDDLFLVPC IPMKVNPQVE MASSAYPVMV RIPCSPVLSL NVRTDGCLNP QFLTAVNLDF TGTTKLSDIE SVAVIRGEEA PIIHHGEEPF PKDSSQVLGT VKLAGSARPQ ISVKGKMELE PGDNYLWACV TMKEGASLDG RVVVRPASVV AGNKPVRVAN AAPVAQRIGV AVVRHGDFKS KFYRIPGLAR SRKGTLLAVY DIRYNHSGDL PANIDVGVSR STDGGRTWSD VKIAIDDSKI DPSLGATRGV GDPAILVDEK TGRIWVAAIW SHRHSIWGSK SGDNSPEACG QLVLAYSDDD GLTWSSPINI TEQTKNKDWR ILFNGPGNGI CMKDGTLVFA AQYWDGKGVP WSTIVYSKDR GKTWHCGTGV NQQTTEAQVI ELEDGSVMIN ARCNWGGSRI VGVTKDLGQT WEKHPTNRTA QLKEPVCQGS LLAVDGVPGA GRVVLFSNPN TTSGRSHMTL KASTNDAGSW PEDKWLLYDA RKGWGYSCLA PVDKNHVGVL YESQGALNFL KIPYKDVLNA KNAR
|
| |