Gene Amuc_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1008 
Symbol 
ID6274113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1200228 
End bp1204163 
Gene Length3936 bp 
Protein Length1311 aa 
Translation table11 
GC content55% 
IMG OID642613057 
Productglycoside hydrolase family 31 
Protein accessionYP_001877615 
Protein GI187735503 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1501] Alpha-glucosidases, family 31 of glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.590426 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAACT CAATAGACCA TGGCATGTTC CGGAAGTCCG TCCTGCTGGC TTTCGGATTG 
TCCATTTCCT TCTCGGGCTT CGCGCTTGCC GCATCGGAAG AACCGGTTTC GGAGCAACAG
GCCCAGGCCG TTGAAGGCGC CAGAAAAATC AATCCTGCCG CCGTGGAAGT GTTGCTTGAC
AACAACCGCC GGATGACGCT CGATTTTTAC GGAGACAATG TTTTCCGGAT TTTCCGGGAT
GACAGAGGCG GTATTATCCG AGATCCCAAG GCGGAACCGG AAGCCCGCAT TCTGGCGGAC
AGTCCCCGGA GGCCGGTTTC CCGGCTGGAT GTGGACCAGC AGGGCGATGC GGTTGTCATC
ACTACGGGGA AGGTCAGGAT TGAGATAAAT AAAAAGACCT CTCTTTTCAA GGTGATCAAT
CTGAAGGATC ATTCCGTGGT GGTGGAGCAG GCGTCCCCCG TGCTTTTTGG AAAGGGCAAA
ACCAGCTTTT CCCTGAAGGC GAAGCCGGAC GAATATTTTT ACGGCGGCGG CGTACAGAAC
GGCCGTTTTT CCCACAAGGG AAAGGTCATT TCCATAGAAA ACCAGAACAG CTGGACGGAT
GGCGGCGTGG CTTCCCCCAC TCCCTTTTAC TGGTCCACCG GCGGGTACGG CGTCATGTGG
CATACCTTCA AGAAAGGGCA GTATGATTTC GGTTCCAGGG AAGAGAATCT GGTGAACCTC
TCGCATGATG AAAATTATCT GGATGTCTTT TTCATGGTCA GCGACGGGCC GGTCAGCCTT
TTGAGGGATT TCTACCAGCT TACTGGCGCT CCCGTTCTGC TGCCCAAGTT CGCCTTTTAC
CAGGGCCACC TGAACGCCTA TAACCGGGAT TACTGGAAGG AAGATGAAAA AGGCATCCTG
TTTGAGGACG GGAAACGGTA CAAGGAAAGC CAGAAGGATA ACGGAGGCAT TAAGGAATCC
CTGAACGGGG AATTGAATAA TTACCAGTTT TCCGGCCGCG CCGTCGTGGA CCGTTACAAG
GCCCATGACA TGCCGCTGGG ATGGCTTCTG CCGAACGACG GCTACGGAGC CGGGTACGGC
CAGACGGATA CCCTGGACGG CAATATTGCG AATTTGAAGA GCCTGGCGGA CTACGCCAGG
AAAAACGGCG TGGAAATCGG TTTGTGGACC CAGTCCGACC TGCATCCCAA GCCGGAAATC
AGCGCTTTGC TCCAGCGCGA TATCGTGAAG GAAGTGCGGG ACGCCGGAGT GCGCGTGCTG
AAGACGGACG TCGCCTGGGT AGGCGCGGGC TATTCCTTCG GTCTGAACGG TATTACGGAC
GTAGCCCAGA TCATGACTTA CTACGGGAAT AACAGCCGCC CGTTCATTAT TTCCCTGGAC
GGCTGGGCCG GAACCCAGCG GTATGCCGGC ATTTGGTCGG GCGACCAGAC GGGCGGCGTC
TGGGAGTATA TCCGTTTCCA TATTCCCACC TACATCGGCT CCGGCCTTTC CGGGCAGCCC
AATATCTGTT CGGACATGGA CGGCATTTTC GGCGGAAAGA ACCCGTTGGT GAACGTCCGC
GATTTCCAGT GGAAAACGTT CACCCCCATG GAACTGAACA TGGACGGCTG GGGAGCCAAT
GAGAAGTATC CCCATGCCTT CGGGGAACCT TACACCTCCA TCAACCGATG GTACCTCAAG
CTCAAGTCGG AACTGCTTCC GTACGCTTAC AGCATTGCGG AGGAATCAGT TTCCGGCCTG
CCCATGATCC GGGCCATGTT CCTGGAATAT CCCAATCCCT ACACGCTGGG GAAAGCGACG
CAGTACCAGT TCCTCTACGG CCCTTACTTC CTGGTGGCAC CCGTTTACCA GGAAACCAGG
GCGGATAAGG AGGGGAATGA CGTCCGCCAT GGCATTTACC TGCCGGAAGG GCAGTGGATT
GATTATTTCA CCGGGGATTT GTATGAGGGC GGCAAGATTT ACAATGATGT TGACGCTCCT
TTGTGGAAGC TGCCCGTGTT CGTTAAAAAC GGCGCGATCA TCCCTATGGC GAACCCGAGC
AACAATGTCT CGGAAATCAA TCCCAATCTG CGCATTTACG AACTTTATCC GCACGGCAGC
ACCTCCTTTG CCACGTATGA CGACGATGGC GTGACGGAGG AATACAGAAC CGGCAGGGGC
GTCCGCACTC TGGTGGAGTC CAGGGTGGAC GGCAAAAACA ACGTGACCGT TACTGTTCAT
CCGGCGGTGG GGGATTTCGA CGGCTTCCAG AAAAAGAAGG CCACGGAATT CCGCATCAAT
GTGACGCAGA AGCCGTCCGG GGTTTCCGCC AAAATCGGGG GAAACAGCAT CAACCTGGCG
GAAGCGAATT CCCTGGAGGA TTTCAAATCC AGGGAGAATG TTTATTTTTA TGACCGGGCT
CCCAACCTGA ACAGGTTCGC AACGAAAGGC AGCGATTTTG AAAAAAAGGT GATCGCCGGG
AATCCGCAAT TGCTGGTGAA GCTGGCCGCA GCAGATATCA CGGCGGCCCC TACGGTGCTT
TCCGTGAAAG GGTTCCGGTT TGAACCGGCT GAAAAGTACC GCCTTTCTTC CGGAGCCCTG
ACCGCTCCCG CCAATGCTGC GGTGACGGAG GAAAATGCGG CGGCCTATAC CCTGAAGCCC
ACGTGGGACG CCGTCCCGAA CGCCGATTTT TATGAAATCG AGTTTGGAGG AATGCTGTAC
ACCACCATCA AGGGAACGGA ATTCCTGTTT GAGGATTTGG AGGCCGAGAC TCCTTATTCC
TTCAAGGTGC GGGCCGCCAA CCGGGACGGC CATTCCGCGT GGACGGCCGT CAGCGCGAAG
ACGAAAGCCA ATCCGCTTGA GTTCGCCATT CCGGGAATCG AAGGTGAAAC GAGCGTGGAA
AACCAGGGCA GCTCCCTGGC GAAGCTGTTT GATTTCAAGG AAAAGGATGT GTGGCACACG
AAGCATGATG CGAAGGCTGT TCCGTTTGAC CTGGTCATGG ACCTGAAAAC GATTAACAGG
CTTGAGAAGT TTCATTACGT TCCCCGCGAG GACGGCGGCA ACGGAACGCT GCTGAAAGGC
GCTGTTTATT ACGGCATGGA CCGGGAGAAC TGGACGAAGG CAGGAACGTT CCAGTGGGAT
AAAAATGGAG ACGTGAAGAT ATTCGGCTTC AAGGATGCTC CCACGGCGCG CTATATCAAG
CTTCATGTCA CGGAAAGCGC GGGTGATTAC GGCTCCGGCA GGGAAATCTA CGTTTTCAAG
GTTCCGGGTA CGGAAAGCTA TTTGCCGGGC GATATCAACA ACGACGGCAA GATTGACCGG
AATGACCTGA CTTCCTACAT CAACTATACG GGCCTGCGGA AAGGCGATTC CGACTTTGAA
GGCTACATCA GTAACGGGGA CATCAACAAG AATGGCCTTA TTGACGCTTA CGATATTTCC
GTGGTAGCCA CCCAGCTGGA AGACGAAGCC GACCAGCCGC AGGAAGAAGC CGACAAGAAA
GATGAGGAAG AGGATAAGGC GGGAGGGGAC GACGAGAAGA AAAAATCCGC CGAGGAGGCA
AAGAAGAAAG TCCGCGTGGA CGGAACGCTT TTGCTTAGTG CGGATAAGAA AAGGTATTCC
AGGGGAGATG CCGTCAAGGT GTCCGTGAAG GGCCGGAACC TCCGGCTGGT GAATGCCCTG
AGCTTCGCTC TTCCGTATGA CCCCAAGGAT TTGGAATTCG TGGGCGTGGA GGTGAAGAAC
ATGAAGAACA TGGAGAACCT GACGAACGAC AGGCTTCATA CGAACGGGAC CAAGGCCCTG
TACCCCACCT TCGTCAACAT CGGAGACAAA GAGCCTGTGG AAGGAACCTC CGAACTGTTT
GTGCTGAAAT TCAAGGCTCG TCGCGATATG AAGTTCCAGC CGAAGCTCAC GGACGGCATG
GTGGTGGATA AAAAGCTGAA CACCAAAAAA TTGTAA
 
Protein sequence
MGNSIDHGMF RKSVLLAFGL SISFSGFALA ASEEPVSEQQ AQAVEGARKI NPAAVEVLLD 
NNRRMTLDFY GDNVFRIFRD DRGGIIRDPK AEPEARILAD SPRRPVSRLD VDQQGDAVVI
TTGKVRIEIN KKTSLFKVIN LKDHSVVVEQ ASPVLFGKGK TSFSLKAKPD EYFYGGGVQN
GRFSHKGKVI SIENQNSWTD GGVASPTPFY WSTGGYGVMW HTFKKGQYDF GSREENLVNL
SHDENYLDVF FMVSDGPVSL LRDFYQLTGA PVLLPKFAFY QGHLNAYNRD YWKEDEKGIL
FEDGKRYKES QKDNGGIKES LNGELNNYQF SGRAVVDRYK AHDMPLGWLL PNDGYGAGYG
QTDTLDGNIA NLKSLADYAR KNGVEIGLWT QSDLHPKPEI SALLQRDIVK EVRDAGVRVL
KTDVAWVGAG YSFGLNGITD VAQIMTYYGN NSRPFIISLD GWAGTQRYAG IWSGDQTGGV
WEYIRFHIPT YIGSGLSGQP NICSDMDGIF GGKNPLVNVR DFQWKTFTPM ELNMDGWGAN
EKYPHAFGEP YTSINRWYLK LKSELLPYAY SIAEESVSGL PMIRAMFLEY PNPYTLGKAT
QYQFLYGPYF LVAPVYQETR ADKEGNDVRH GIYLPEGQWI DYFTGDLYEG GKIYNDVDAP
LWKLPVFVKN GAIIPMANPS NNVSEINPNL RIYELYPHGS TSFATYDDDG VTEEYRTGRG
VRTLVESRVD GKNNVTVTVH PAVGDFDGFQ KKKATEFRIN VTQKPSGVSA KIGGNSINLA
EANSLEDFKS RENVYFYDRA PNLNRFATKG SDFEKKVIAG NPQLLVKLAA ADITAAPTVL
SVKGFRFEPA EKYRLSSGAL TAPANAAVTE ENAAAYTLKP TWDAVPNADF YEIEFGGMLY
TTIKGTEFLF EDLEAETPYS FKVRAANRDG HSAWTAVSAK TKANPLEFAI PGIEGETSVE
NQGSSLAKLF DFKEKDVWHT KHDAKAVPFD LVMDLKTINR LEKFHYVPRE DGGNGTLLKG
AVYYGMDREN WTKAGTFQWD KNGDVKIFGF KDAPTARYIK LHVTESAGDY GSGREIYVFK
VPGTESYLPG DINNDGKIDR NDLTSYINYT GLRKGDSDFE GYISNGDINK NGLIDAYDIS
VVATQLEDEA DQPQEEADKK DEEEDKAGGD DEKKKSAEEA KKKVRVDGTL LLSADKKRYS
RGDAVKVSVK GRNLRLVNAL SFALPYDPKD LEFVGVEVKN MKNMENLTND RLHTNGTKAL
YPTFVNIGDK EPVEGTSELF VLKFKARRDM KFQPKLTDGM VVDKKLNTKK L