Gene Amuc_0480 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0480 
Symbol 
ID6275422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp568821 
End bp571205 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content54% 
IMG OID642612530 
ProductGlycosyl hydrolase family 98 putative carbohydrate binding module 
Protein accessionYP_001877099 
Protein GI187734987 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.133222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGCA ATCTCTCTTT TTCTCTGATG GAAGCTTCTG GACGATCCAT ATTTTTTCTG 
ATAGAAGGAA TACGGGAACA GAGTATAAAA AATATGTTCA GCCGGATGTT TTCATGGAGT
TTTGTAGTTG CAGCGTGTTT GGCGGGATTA TTCCCGGCAC AGTCCCAGGG CGAGGAAAAA
GCTGCCCAAT CCGGAACCAT AGCCGTAAAA GTTCCTGCCT CCAGCCTGCT GATGACTCGC
CAGGAGACGG GGGAAACCCG GTTGGACCGG TCTTTCAGTA ATGCGGGGTT GAGCATAGGC
GGCAAAAAAT ATGCCACGGG AATCGGTACG CACGCTACGT CCATGATTCC CCTGCCCGTT
CCGGAAAACC CGAAAGTTTT GAGGCTGGAA GGTGCATGCG GAATTGATGA CGGTGCGGAT
GGAGACGGAA GCGTGGAATT CCGCGTGATG AGCGGGTCCG AGGTTTTATG GAGTTCCGGC
GTGATGAGGA GGGGGATGGC CGCGAAGAAG TTTTCCATTC CGGTGGCGGA AAACGGCATA
CGGCATCTTT ACCTGATGGC CGACCGGGTG GACAATAATT CTTACGACCA TGCGGACTGG
GTAGATCTGG CCTGGAAGAC GACAGGAAGT GGGCAGGGGA TGAAGGGAGC TGTGGTGAAT
GCCTCTGAAT TCGGCATGGT TCCCGGCGTC AGAAAAGATC AGGGGCCGGC GCTGCGGGCT
GCCGTTTCCG CTCTCCGGAG GCAGGGCGGG GGCGTTTTGA ACATCCCCAG AGGCATTTAC
CATTTTTATC CGGAGGGGGC TTTGAACATG AGTTTCCATA TTTCCAACCA TGACCAGCCG
CTGATTCATC CGGTGTGTGT GCCTCTGGCC GATTTGCGGA ATGTCCGCGT GGAGGGCAAT
GGTTCCCTGT TTCTTTTTCA CGGCAAGGTG GTTCCCCTCC TGGTGATGGA CAGCGAAAAT
GTCAGCATCA ACCGTTTATC CGTGGATTAC GAACGGTCCT GGTGCACGGA GGCGCGCGTC
GTGAAGACGG ATGACCGGTT CACGGAAGTG GAAATAGACA AAAAGGCCTA CCCTTACGAA
ATCCGGAACA ACAGGTTCGT TTTCCAGGGG AAAGGCTGGG AAGAAGGAAT GGGAAGCTGC
ATGGCCTTTG AGAAAGGGAC GGGACATATT ATCGCCAATA CATCGGACAT CGGCTGGAAC
GGGCATGTGG AGCCGCTGGG CGGGAGCCGT CTCCGTCTGT CCTGGAACCT CAGGCAGAAG
GGAATCAAGC CGGGGGATAC GCTGGTTCTT CGGAATTATA ACCGCCCCCA TCCGGGATGC
GTGGTGTACC GGGCTCGGAA AACATCACTG AATGACGTAT CTCTGCACCA GAGCTCAGGC
ATGGCCCTGC TGGTCCAGCG ATCGGAGGAC TTTCATATGA AAGGCGGCGG AGTCATGGTC
AGGAAAGGCA CAGGGCGCGT ACACACCGCC GGAGCGGACG CAACCCATTT TTCCAATACC
CGGGGCGGAA TCGTGGTGGA GAAAGCCCTG TTCGAAGGGA TGATGGATGA TGCCATCAAT
GTTCATTCCA CCTGCCTGGG CGTGATGGAA GTAGTGGACA GCCATACCCT GAAGTGCAAA
TACATGCATC GGCAGGCGGT GGGGTTTGAG GTGTTTCTTC CCGGTGAAAA AATCCGTTTC
ATCAACGGTC CTACGCTGGA ACCCGGCGGA ACGGCCACTG TGAAGACGGC AGTGAAAAAG
AATTCCGCGG AGATGGTGAT TACGGTGGAA GAGCCGCTCC CCTCCTCCGT CAGGGCCGGG
GATGCCGTGG AGAATGCGGA TTTTTACCCT TCCGTGGTTT TCCGCAACAA TATCGTCCGC
AACAACCGTG CCAGAGGATC TCTTTTTACA ACGCCGGAGA GAGTGCTGGT GGAGGGCAAT
TTGTTTGACC ATTCCTCCGG CTCCGCCATT CTATTGGCAG GGGATGCCCA GGGGTGGTAT
GAAAGCGGCG CCTGCCATGA AGTGGTGATC CGCAAAAATA CATTCATTAA CAACCTGACC
TCGCGTTACC AGTTTACGAA TGCCATCATT TCCATTTACC CGGAAGTCAA GCAGCTGGAC
AGGCAGAGGG ATTACTATCA TCGCAATGTG CTGATAGAAA ATAATGTTTT CAAGACGTTC
GATGTGCCGC TGCTGTTCGC CATTTCCACG GACAACCTCA AGTTCATCAA TAATAAGGTC
ATTTACAATG ACGAGTTTAA GGGATGGGGG CAGAAACCTT TCCAATTCAG AAGATGCGCC
AATATTCTGA TTAAAGATAA CAAGGTGCTG CCTCCCCGCA CATGGACCCT TGAGGACTGC
AAGCTGGAAA ATACCCCATC AGATCAGGTC CGCTTTGGTG GATAA
 
Protein sequence
MEGNLSFSLM EASGRSIFFL IEGIREQSIK NMFSRMFSWS FVVAACLAGL FPAQSQGEEK 
AAQSGTIAVK VPASSLLMTR QETGETRLDR SFSNAGLSIG GKKYATGIGT HATSMIPLPV
PENPKVLRLE GACGIDDGAD GDGSVEFRVM SGSEVLWSSG VMRRGMAAKK FSIPVAENGI
RHLYLMADRV DNNSYDHADW VDLAWKTTGS GQGMKGAVVN ASEFGMVPGV RKDQGPALRA
AVSALRRQGG GVLNIPRGIY HFYPEGALNM SFHISNHDQP LIHPVCVPLA DLRNVRVEGN
GSLFLFHGKV VPLLVMDSEN VSINRLSVDY ERSWCTEARV VKTDDRFTEV EIDKKAYPYE
IRNNRFVFQG KGWEEGMGSC MAFEKGTGHI IANTSDIGWN GHVEPLGGSR LRLSWNLRQK
GIKPGDTLVL RNYNRPHPGC VVYRARKTSL NDVSLHQSSG MALLVQRSED FHMKGGGVMV
RKGTGRVHTA GADATHFSNT RGGIVVEKAL FEGMMDDAIN VHSTCLGVME VVDSHTLKCK
YMHRQAVGFE VFLPGEKIRF INGPTLEPGG TATVKTAVKK NSAEMVITVE EPLPSSVRAG
DAVENADFYP SVVFRNNIVR NNRARGSLFT TPERVLVEGN LFDHSSGSAI LLAGDAQGWY
ESGACHEVVI RKNTFINNLT SRYQFTNAII SIYPEVKQLD RQRDYYHRNV LIENNVFKTF
DVPLLFAIST DNLKFINNKV IYNDEFKGWG QKPFQFRRCA NILIKDNKVL PPRTWTLEDC
KLENTPSDQV RFGG