Gene Amuc_0824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0824 
Symbol 
ID6274353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp969353 
End bp973147 
Gene Length3795 bp 
Protein Length1264 aa 
Translation table11 
GC content56% 
IMG OID642612874 
Productglycoside hydrolase family 2 TIM barrel 
Protein accessionYP_001877438 
Protein GI187735326 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000954733 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.00757756 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGAACGA CCAGATCCTG GCTTTTATCC TGTACCGCGC TGGCGGTAAG TTCCGCCATG 
TGCGGCCTGG GAGCACCGGG CGATTCCGCC CCCGCGCCAG CCCCCACAGG GCTGGAATGG
GAACAGGAAC AAAACCTGCA TTTGAATAAG GAAGCTCCAA CCGCCTTTTT CGCGTCTTTC
AGCGATTTGC AGTCCGCCTT GAAAGTGCTG CCTGAAAACA GTAAATGGCG CAGGTCCCTG
AACGGTCAGT GGAAGTTCCA CTGGGCGAAG GATCCCCAGA GCCGCCCGGC CGATTTTTAC
AAGCCGGATT ACGACGTAAA GGACTGGAAG GAGATTAAGG TGCCTTCCTC CTGGCAGACT
CAGGGTTACG GCACCCCCAT TTATTCCAAT CAGCCGTATC CCTTTGAACG CTCCTGGCCT
TATGTAATGA AGGAGCCTTC CAACAAGAAT TATACGTCCT ACAAGGAACG GAACCCCGTA
GGTTCCTACC GCCGCACTTT TGAAGTACCT GCGGACTGGG ACGGCAGGGA AGTGTACATG
CAGTTTGACG GAGTGGATTC CTTCTTCTAC CTCTGGATCA ACGGGCAGTA TGTAGGTTTT
TCCAAAGATT CCCGGAATCC GGCCCGTTTT GACATCAGCC CCTACCTTAA GAAGGGGGAG
AATGTGGTGG CCGCAGAGGT GTACCGCCAT TCCGACGGAG CATATCTGGA ATGCCAGGAC
ATGTTCCGCC TGTCCGGCAT TTTCCGCAAT GTTTCCATCT TTGCGCTGCC GAAAGTTCAC
ATCCGCGACT TTTTCGCACA GGCCAATCCG GTGGACCAGA GGGATTGGGC TTTGAATATT
GACCATGCCA AACCGGGGAC CGTGGACGGC GATTGGCGCC TTCAGGTGGA TGTGGATGTT
CGCAATCTGT TTCCGGCAAC GGAAAAGCTG GACGGCTGCA CGGTTTCCAT GGCCCTGTAT
GACGCTGCCG GAAAACTGGT AGAACCTGTC AAGCCCAAGG ATGCGCCATA TGACGGCGTG
TTGGAAAAGC CCTTGCGCAT TACCGGCATG AAGGATTTTA AAACTTCCCT GCTGGGCATT
TATTCCAAAC CCAGACTATG GTCTGCGGAG GATCCCAACC TGTACACCCT GGTACTGACG
CTGAAGCGTG ACGGGAAGAC GGAAGAAATG GTTTCCTCCC GCGTGGGTTT CCGCAATGTG
GTGATTAAGG ACAGCGTGTT CCTGGTAAAC GGCCAGCCGG TGAAGGTAAA GGGCGTGAAC
CGCCATGAAA GCCATCCGGA AACAGGGCAT TACGTGACTC CGGAGCAGAT GGAGGAAGAA
GTGCGGATGA TGAAACGCGC CAATATCAAC CACGTGCGCT GTTCCCATTA TCCCGCGGAT
CCTTATTTTT ATTACCTCTG TGACAAGTAC GGCATTTACG TGCAGGATGA GGCCAACATC
GAGTCCCACG GCTATTACTA TGGCAAGGAG TCCCTTTCCC ATCCCATTGA GTGGATGCCG
GCCCACGTGG ACCGCATCAT GGCGATGGTG GAGCGCAACA AGAACCATCC CTGCGTGATT
ATGTGGTCCC TGGGGAATGA AGCCGGCCCC GGCCAGAATT TCCGCAGTGC GGAAAAGATG
GTGAAGGCCA GGGATATGTC CCGCCCCACC CACTACGAAC GCAATAATGA CATTGTGGAC
TTGGGGTCCA ACCAGTATCC GTCCGTGGAC TGGACGCGTT CCATGGCGGG CAACAAGGAT
TTCCCCAAGC CCTACTATAT TTCCGAGTAC GCACACAACA TGATGAATGC CATGGGCAAC
CTGGCGGATT ACTGGGAGGC CATCGAGTCT TCCGACCGCA TTATGGGCGG CGCCATCTGG
GACTGGGTGG ACCAGGGCTT GTACAAGACC CTGCCGAATG GAGAAAAGAT GCTCTGCTAT
GGCGGCGATT TCAACGACCA TCCCAATAGC GGCCAGTTTG TGTTCAACGG CACCATCCTG
TCGGATCGTA CGCCGGAACC GGGTTATTTT GAAGTCAAGC ACGTCTATCA GAATATTTCC
ACATCTTTGA CGGACGACGG CAGGATTTCC ATTTTCAACA AGAATTTCTT TACGGACCTT
TCTTCATACG ACATTACCTG GACCCTCACG GAAAACGGCA ATGCAGTGGC TGAAGGCAGG
TTGGACACGC CTCCGGCCGG TCCCAGAGAA AAGATTGTGG TTCCCATTCC GGACATTCCC
CAGTTGAAAA ACCGGAAACC GGGGGTGGAG TATGCCTTGC GCATAAGTTA CAAGCTGAAG
AAGGACAGGG GGTGGGCCAA GAAGGGATAT GAATTGGCCT TTGACCAGCT CCAGCTTCCC
GTGCAGGGAG ATCTGCCTGT GTTCAAGGCT CCTGCGGGCA AAGTCAGCCT CAGCACGGAC
AAGCATACCG TTTCCGGCAA GGATTTTTCC GTGCAGTTTG ACGCGTCCAC CGGGGAACTG
GCCCAGTTCA CGGTAAACGG CAAGCCTCTG TTTAAAACGC CCATGGCGGT GAACGCCCTG
CGCGCCGCCT CCAGCAATGA GCCGGGCGTC ATGGCCAAGA GCATGGCTAA CGGCCTCCGT
GAACTGAAGC ATGAACTGCT CAGTTACGAA GCCATTGATA ACGGCAATAG CGTCACCGTC
AAGCAATCCA TCAAGGTAAG CGGCAAACAG GCTGAAAACA TCAGCGGCTA CGGCGATACC
AAGACCACCA TCACGGCCAG GAAGCAACCC CTGAACGATA CGAACACCCA TTTCATCAAT
AATTTGGAAT GGACCATCTA TGCGGATGGA ACTGTCGTCT GCCAGTCCGT ACTGCTTCCG
CGCGGCAATC CCCTGGAACT GCTGCGCCTG GGATACGAAC TCCAGTTGCC GGCGAATATG
GACAACGTAG CCTATTACGG GCGCGGGCCG GAAGAAAACT ATGCGGACCG CAAGAGCGGC
ATGCCTCTGG GCGTGTATAA AACGACAGCC TGGGATTCTT TCTTCCCGTA CGGCAGACCG
CAGGATTGCG GCAACCATGA GGATACCCGC TGGGTGGCCG TTACGGACGA CAAGGGGAAC
GGCCTGCTTT TCGGTTCCGT GGGCGCACCG TTCGCTTTCT CCGCCCTTCC GTATACCACC
ACGGATTTGA TCCTGGCAAA CCACCCCGTG GAACTGCCGA AGACGACGGA TAAGACCGTT
CTGGTTCTCT CTTCCGCCAC GCGCGGCCTG GGGGGTGCTT CCTGCGGTCC CGGCCCTATG
GGCAGGGACA TCATCAAGGC CAACAAGCCC TACCCGATGT CCTTCTTTAT GCGGCCCATT
ACCGCCAAGT CCTACAAGGG GGAAATCCGC GTGCCTGCGG CCCGGCTGGA TATGACCATG
CTGACCCGCA CGGACAAGTA TACGGTCAAG AGTGTAACCA GCCAGGAGCA GGGCGAAGCG
GACGCCGAAT TCGCCATTGA CGGTGATCCC GGCACCTTCT GGCACTCTGA ATACAATAAA
ACCGTGACCA AACATCCGCA TGTACTGGCC GTGGACCTGG GTAAGGAGCG GGAATTCTCC
GGAATCACTT ATCTTCCACG TCAGGATGGC AGCAGCAATG GCCGCGTGAA AGATTATTCC
GTGGACGTGA GCACGGACGG AGAGAAATGG CAGCCTGCCG CCAAGGGCTC CTTCCCGGAC
AGTGCTGACC TGCAGGAAGT GAAATTCCAA GCTCCCGTCA AGGCGCGTTA TTTCCGCTTC
TCCGCCCTTA GTGAGGCGCA GGGGCGGGAT TACGCCGCCG TAGCGGAACT GGATATCATT
CCCGTTAAGA AATAA
 
Protein sequence
MRTTRSWLLS CTALAVSSAM CGLGAPGDSA PAPAPTGLEW EQEQNLHLNK EAPTAFFASF 
SDLQSALKVL PENSKWRRSL NGQWKFHWAK DPQSRPADFY KPDYDVKDWK EIKVPSSWQT
QGYGTPIYSN QPYPFERSWP YVMKEPSNKN YTSYKERNPV GSYRRTFEVP ADWDGREVYM
QFDGVDSFFY LWINGQYVGF SKDSRNPARF DISPYLKKGE NVVAAEVYRH SDGAYLECQD
MFRLSGIFRN VSIFALPKVH IRDFFAQANP VDQRDWALNI DHAKPGTVDG DWRLQVDVDV
RNLFPATEKL DGCTVSMALY DAAGKLVEPV KPKDAPYDGV LEKPLRITGM KDFKTSLLGI
YSKPRLWSAE DPNLYTLVLT LKRDGKTEEM VSSRVGFRNV VIKDSVFLVN GQPVKVKGVN
RHESHPETGH YVTPEQMEEE VRMMKRANIN HVRCSHYPAD PYFYYLCDKY GIYVQDEANI
ESHGYYYGKE SLSHPIEWMP AHVDRIMAMV ERNKNHPCVI MWSLGNEAGP GQNFRSAEKM
VKARDMSRPT HYERNNDIVD LGSNQYPSVD WTRSMAGNKD FPKPYYISEY AHNMMNAMGN
LADYWEAIES SDRIMGGAIW DWVDQGLYKT LPNGEKMLCY GGDFNDHPNS GQFVFNGTIL
SDRTPEPGYF EVKHVYQNIS TSLTDDGRIS IFNKNFFTDL SSYDITWTLT ENGNAVAEGR
LDTPPAGPRE KIVVPIPDIP QLKNRKPGVE YALRISYKLK KDRGWAKKGY ELAFDQLQLP
VQGDLPVFKA PAGKVSLSTD KHTVSGKDFS VQFDASTGEL AQFTVNGKPL FKTPMAVNAL
RAASSNEPGV MAKSMANGLR ELKHELLSYE AIDNGNSVTV KQSIKVSGKQ AENISGYGDT
KTTITARKQP LNDTNTHFIN NLEWTIYADG TVVCQSVLLP RGNPLELLRL GYELQLPANM
DNVAYYGRGP EENYADRKSG MPLGVYKTTA WDSFFPYGRP QDCGNHEDTR WVAVTDDKGN
GLLFGSVGAP FAFSALPYTT TDLILANHPV ELPKTTDKTV LVLSSATRGL GGASCGPGPM
GRDIIKANKP YPMSFFMRPI TAKSYKGEIR VPAARLDMTM LTRTDKYTVK SVTSQEQGEA
DAEFAIDGDP GTFWHSEYNK TVTKHPHVLA VDLGKEREFS GITYLPRQDG SSNGRVKDYS
VDVSTDGEKW QPAAKGSFPD SADLQEVKFQ APVKARYFRF SALSEAQGRD YAAVAELDII
PVKK