Gene Amuc_0625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0625 
Symbol 
ID6274199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp733948 
End bp735207 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content57% 
IMG OID642612676 
ProductExo-alpha-sialidase 
Protein accessionYP_001877243 
Protein GI187735131 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4409] Neuraminidase (sialidase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.377281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.254881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATGGT TGTTGTGCGG CAGGGGAAAA TGGAATAAGG TGAAGAGGAT GATGAATTCC 
GTATTCAAGT GTTTGATGAG TGCCGTATGC GCCGTGGCAT TGCCGGCGTT CGGGCAGGAA
GAGAAAACCG GTTTCCCTAC GGACAGGGCT GTGACCGTAT TCAGCGCCGG GGAGGGTAAT
CCCTATGCGT CCATCCGTAT TCCCGCCCTG CTCAGTATCG GCAAGGGCCA GCTTCTGGCA
TTCGCCGAAG GACGGTACAA AAATACCGAC CAGGGGGAGA ACGATATTAT CATGAGCGTC
AGCAAGAATG GCGGGAAGAC CTGGTCCCGT CCCCGGGCGA TAGCCAAGGC CCATGGCGCC
ACGTTCAATA ATCCGTGCCC CGTTTATGAT GCCAAAACCA GGACCGTGAC TGTCGTATTC
CAGCGTTACC CTGCCGGGGT CAAGGAGCGG CAGCCCAATA TCCCGGACGG ATGGGATGAT
GAAAAGTGCA TCCGCAATTT CATGATTCAG AGCAGGAACG GAGGTTCTTC CTGGACGAAG
CCGCAGGAGA TCACGAAGAC GACCAAGCGT CCTTCCGGAG TGGATATTAT GGCGTCCGGC
CCGAATGCGG GAACCCAGCT GAAGAGCGGC GCCCACAAGG GCCGCCTGGT GATTCCGATG
AATGAAGGGC CGTTCGGCAA ATGGGTGATT TCCTGCATTT ACAGCGATGA CGGCGGCAAG
AGCTGGAAGC TGGGCCAGCC GACTGCCAAT ATGAAGGGCA TGGTGAACGA GACGTCCATT
GCGGAAACGG ATAACGGCGG CGTTGTGATG GTTGCGCGCC ATTGGGGCGC AGGCAATTGC
CGCCGTATTG CGTGGTCCCA GGATGGCGGG GAGACCTGGG GACAGGTGGA GGACGCTCCG
GAGCTGTTTT GCGACAGTAC CCAGAATTCC CTGATGACGT ATTCCCTGAG CGACCAGCCT
GCCTATGGCG GCAAAAGCCG CATTCTGTTT TCCGGGCCCA GTGCGGGCCG GCGCATTAAG
GGACAGGTGG CCATGAGCTA TGACAACGGC AAGACCTGGC CGGTGAAGAA ATTGCTGGGC
GAGGGCGGTT TTGCCTATTC CAGCCTTGCC ATGGTGGAAC CCGGCATCGT TGGGGTGCTT
TATGAGGAGA ACCAGGAGCA TATTAAAAAG CTGAAGTTTG TTCCCATTAC CATGGAATGG
CTGACGGACG GAGAAGACAC AGGGCTGGCT CCCGGCAAAA AAGCTCCTGT TCTCAAGTAG
 
Protein sequence
MTWLLCGRGK WNKVKRMMNS VFKCLMSAVC AVALPAFGQE EKTGFPTDRA VTVFSAGEGN 
PYASIRIPAL LSIGKGQLLA FAEGRYKNTD QGENDIIMSV SKNGGKTWSR PRAIAKAHGA
TFNNPCPVYD AKTRTVTVVF QRYPAGVKER QPNIPDGWDD EKCIRNFMIQ SRNGGSSWTK
PQEITKTTKR PSGVDIMASG PNAGTQLKSG AHKGRLVIPM NEGPFGKWVI SCIYSDDGGK
SWKLGQPTAN MKGMVNETSI AETDNGGVVM VARHWGAGNC RRIAWSQDGG ETWGQVEDAP
ELFCDSTQNS LMTYSLSDQP AYGGKSRILF SGPSAGRRIK GQVAMSYDNG KTWPVKKLLG
EGGFAYSSLA MVEPGIVGVL YEENQEHIKK LKFVPITMEW LTDGEDTGLA PGKKAPVLK