Gene Amuc_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0235 
Symbol 
ID6275288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp292855 
End bp294519 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content56% 
IMG OID642612283 
Productalpha-glucan phosphorylase 
Protein accessionYP_001876859 
Protein GI187734747 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0058] Glucan phosphorylase 
TIGRFAM ID[TIGR02094] alpha-glucan phosphorylases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.400789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAAT CATTTTTACC CACCATTTTC GAGCACCCGT ATGAAATAGA CCCCAAATAT 
GGGAAAAGCG TTGCGTACTT CTCCATGGAA TACGCCATTG ATAATTCATT TAAAATCTAT
TCCGGCGGCC TGGGCTATCT GTCCGGCTCC CATATGCGCA GTGCGCATGA TCTGCGCCAG
AACCTCGTAG GCGTCGGCAT TCTGTGGAGC TACGGCTACT ACAACCAGAT CCGTGCGGAG
GACGGATCCA TGGCCACGCA ATACATGCGT AAAAACTACC CGTTCCTGGA AGATCACAAC
ATCAAATTCC TCATTCATGT CTGCGGAGCG CCCGTCTGGG TAAAAGCCTA CTTCCTGAAT
CCGGAAACAT TCGGCACTGC GCCCATGTTC TTCCTTTCCA CGGACCTGGA GGAAAATGAC
GAGGAAAGCC GCAATATCTC CCGCCGCCTG TACGACGCCA ACGGCTTCAC CCGCATCGCC
CAGTACGTAC TGCTGGGCAA AGGAGGCGCC CGCCTTTTTG ACGAACTGGG CATTGAACCG
GAAATCTACC ATCTAAACGA AGCCCACGGC CTGGCTGCGG CCTTCCACGT GCTTGCCAAG
ACGGGCAGCG TGGAAGAAGT GCGCAAGCGT TTCGTTTTCA CTACCCATAC CCCGGAAGAA
GCCGGCAACG AAAAGATGGA CGTAAACACC ATGAATACGT TCTCCTTCTT CGACGGGCTC
TCCATGGAAC AGGTGCGCGC CGCCGTAGGC ATGACGGACA ATACGTTCAA CTACACGCTG
GCCGCCCTGC GCCTCTCCCA CATCGCCAAC GGCGTTTCCA AACTCCATGG AGAAGTATCC
CGCCAGATGT GGAAGGACTA TGAAGGCATA TGCCCCATCA TCCATATCAC GAATGCCCAG
AACCAGAAAT ACTGGCAGGA TCCGGAACTG GCGGAAGCTT TCCAGGCCCG CAACAAGGAA
GCGTTCATCC GCCGCAAGCG GGCTCTGAAA AAAGCCCTCT TCCGCATGGT GGGTGAACAA
ACGGGCCGCG TCTTTGACCC GAACTGCCTC ACCATCGTAT GGGCTCGCCG TTTTGCGGCC
TACAAGCGTC CGGACCTGGT CACCGGCAAT CCCACGATGT TCGAACGCAT GCTCCAGCGC
ACCAACTATC CGGTGCAGTT CATCTGGGCC GGCAAACCCT ACCCGATGGA TCATGGCGCT
ATTGAAATCT TCAACCGCCT GAATGACTTG ACGGCCAAAT ATCCCCGTTC CGCCGTGCTC
ACCGGTTATG AACTGGGCCT GAGCCGCTAC CTGAAAAACG GCTCCGACGT ATGGCTCAAC
AACCCCGTGG TAACCCGGGA AGCATCCGGC ACCTCCGGCA TGTCCGCCGC CATGAACGGC
TCCATCTCCG TCTCCACCAA TGACGGCTGG ATCTGTGAAT TCGCCAAGGA CGGAGAAAAC
TGCTTCGTCA TTCCGGAAGC GCCCGCCCAT CTCTCCCCGG AAGCCCGCGA CCGCTCCGAC
CGCGATAACT TCTACAACAT TCTGGACGAC AAGATTCTTC CCCTCTACTA CGATCACAAC
GACAAGTGGA TGGACATCGT TTTCAATGCG ATGACAGACA TCTATCCCGA ATTCGACTCC
GACCGCATGG CGGATCAGTA CTACACGGAA ATGTACAACA GCTAA
 
Protein sequence
MAKSFLPTIF EHPYEIDPKY GKSVAYFSME YAIDNSFKIY SGGLGYLSGS HMRSAHDLRQ 
NLVGVGILWS YGYYNQIRAE DGSMATQYMR KNYPFLEDHN IKFLIHVCGA PVWVKAYFLN
PETFGTAPMF FLSTDLEEND EESRNISRRL YDANGFTRIA QYVLLGKGGA RLFDELGIEP
EIYHLNEAHG LAAAFHVLAK TGSVEEVRKR FVFTTHTPEE AGNEKMDVNT MNTFSFFDGL
SMEQVRAAVG MTDNTFNYTL AALRLSHIAN GVSKLHGEVS RQMWKDYEGI CPIIHITNAQ
NQKYWQDPEL AEAFQARNKE AFIRRKRALK KALFRMVGEQ TGRVFDPNCL TIVWARRFAA
YKRPDLVTGN PTMFERMLQR TNYPVQFIWA GKPYPMDHGA IEIFNRLNDL TAKYPRSAVL
TGYELGLSRY LKNGSDVWLN NPVVTREASG TSGMSAAMNG SISVSTNDGW ICEFAKDGEN
CFVIPEAPAH LSPEARDRSD RDNFYNILDD KILPLYYDHN DKWMDIVFNA MTDIYPEFDS
DRMADQYYTE MYNS