Gene Amuc_1835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1835 
Symbol 
ID6275505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2229943 
End bp2231967 
Gene Length2025 bp 
Protein Length674 aa 
Translation table11 
GC content57% 
IMG OID642613899 
ProductExo-alpha-sialidase 
Protein accessionYP_001878434 
Protein GI187736322 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4409] Neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCTTG GCCTGTTGTG CGCACTGGGC CTGTCTATTC CCTCCGTTCT CGGCAAGGAA 
AGCTTTGAGC AGGCCAGGCG TGGCAAATTT ACAACGCTTT CCACCAAATA CGGCCTTATG
TCCTGCCGGA ACGGTGTGGC GGAAATCGGA GGAGGGGGAA AATCCGGAGA AGCCTCCCTG
CGGATGTTCG GCGGACAGGA TGCTGAATTG AAACTGGACT TGAAGGATAC GCCTTCCAGG
GAAGTCCGGC TTTCTGCCTG GGCGGAGCGA TGGACCGGGC AGGCCCCTTT TGAATTTTCC
ATTGTGGCCA TAGGGCCGAA TGGAGAAAAG AAAATTTATG ACGGCAAGGA TATCAGGACG
GGCGGGTTTC ATACCAGGAT AGAAGCCAGT GTTCCTGCCG GAACGCGTTC CCTGGTGTTC
AGGCTTACTT CTCCGGAAAA CAAGGGAATG AAGCTGGACG ACCTGTTTCT TGTTCCCTGT
ATTCCCATGA AAGTGAATCC GCAGGTGGAG ATGGCCTCTT CCGCTTACCC GGTGATGGTG
CGTATCCCGT GCAGCCCCGT TCTTTCTCTG AATGTCCGGA CGGACGGCTG CCTTAATCCT
CAGTTCCTGA CAGCTGTCAA TCTGGATTTT ACGGGTACGA CGAAGCTTTC CGACATTGAG
TCCGTGGCTG TAATACGGGG GGAAGAGGCC CCTATCATCC ATCATGGGGA AGAGCCGTTC
CCGAAAGACT CTTCCCAGGT TCTTGGTACA GTAAAGCTTG CCGGTTCCGC CAGACCCCAG
ATTTCTGTGA AGGGGAAAAT GGAGCTGGAG CCCGGAGACA ATTACCTGTG GGCTTGCGTG
ACGATGAAAG AAGGAGCCTC CCTGGACGGC AGGGTGGTGG TGCGTCCGGC CAGCGTTGTG
GCGGGCAATA AACCGGTGAG GGTTGCCAAT GCGGCTCCCG TGGCGCAGCG CATCGGCGTG
GCCGTAGTCA GGCATGGGGA TTTCAAATCA AAATTCTACC GTATTCCCGG TCTGGCCCGT
TCCAGGAAGG GGACCCTGCT GGCCGTGTAC GATATCCGGT ACAACCATTC CGGAGACCTT
CCGGCCAACA TTGATGTGGG CGTAAGCCGC TCTACGGACG GAGGCCGCAC CTGGTCTGAT
GTCAAAATCG CCATTGATGA TTCCAAGATT GACCCCTCTC TGGGGGCTAC CAGGGGCGTA
GGGGATCCGG CCATTCTGGT GGATGAAAAG ACGGGGCGCA TCTGGGTGGC CGCCATATGG
AGCCACAGGC ATTCCATCTG GGGCAGCAAG TCCGGAGACA ATTCTCCGGA GGCCTGCGGA
CAGCTGGTGC TGGCCTACAG CGATGACGAT GGCCTGACCT GGTCCAGTCC GATCAATATC
ACGGAACAAA CCAAGAACAA GGATTGGCGC ATTTTATTTA ATGGCCCCGG CAATGGCATT
TGCATGAAAG ACGGCACGCT GGTCTTCGCC GCCCAGTACT GGGACGGCAA AGGGGTGCCG
TGGTCCACCA TTGTTTATTC CAAAGACCGG GGAAAAACCT GGCACTGCGG CACGGGCGTC
AACCAGCAGA CGACGGAAGC CCAGGTGATT GAGCTGGAAG ACGGCTCCGT CATGATCAAC
GCCCGATGCA ACTGGGGCGG TTCCCGCATC GTGGGCGTTA CGAAAGACCT GGGCCAAACG
TGGGAAAAAC ACCCCACCAA CCGCACTGCC CAGCTGAAGG AACCGGTCTG CCAGGGCAGC
CTGCTTGCCG TGGACGGCGT TCCGGGCGCG GGCAGAGTGG TTCTGTTTTC CAATCCCAAT
ACCACATCCG GACGTTCCCA CATGACGTTG AAAGCTTCTA CGAATGATGC CGGGTCATGG
CCGGAAGACA AATGGCTTCT TTATGATGCC CGCAAAGGCT GGGGATATTC CTGCCTGGCG
CCGGTAGATA AGAACCATGT GGGCGTGCTG TACGAATCCC AGGGGGCGCT GAACTTCCTG
AAAATTCCCT ATAAGGATGT TCTTAACGCA AAAAATGCGC GCTGA
 
Protein sequence
MGLGLLCALG LSIPSVLGKE SFEQARRGKF TTLSTKYGLM SCRNGVAEIG GGGKSGEASL 
RMFGGQDAEL KLDLKDTPSR EVRLSAWAER WTGQAPFEFS IVAIGPNGEK KIYDGKDIRT
GGFHTRIEAS VPAGTRSLVF RLTSPENKGM KLDDLFLVPC IPMKVNPQVE MASSAYPVMV
RIPCSPVLSL NVRTDGCLNP QFLTAVNLDF TGTTKLSDIE SVAVIRGEEA PIIHHGEEPF
PKDSSQVLGT VKLAGSARPQ ISVKGKMELE PGDNYLWACV TMKEGASLDG RVVVRPASVV
AGNKPVRVAN AAPVAQRIGV AVVRHGDFKS KFYRIPGLAR SRKGTLLAVY DIRYNHSGDL
PANIDVGVSR STDGGRTWSD VKIAIDDSKI DPSLGATRGV GDPAILVDEK TGRIWVAAIW
SHRHSIWGSK SGDNSPEACG QLVLAYSDDD GLTWSSPINI TEQTKNKDWR ILFNGPGNGI
CMKDGTLVFA AQYWDGKGVP WSTIVYSKDR GKTWHCGTGV NQQTTEAQVI ELEDGSVMIN
ARCNWGGSRI VGVTKDLGQT WEKHPTNRTA QLKEPVCQGS LLAVDGVPGA GRVVLFSNPN
TTSGRSHMTL KASTNDAGSW PEDKWLLYDA RKGWGYSCLA PVDKNHVGVL YESQGALNFL
KIPYKDVLNA KNAR