Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0824 |
Symbol | |
ID | 6274353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 969353 |
End bp | 973147 |
Gene Length | 3795 bp |
Protein Length | 1264 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642612874 |
Product | glycoside hydrolase family 2 TIM barrel |
Protein accession | YP_001877438 |
Protein GI | 187735326 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000954733 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.00757756 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGAACGA CCAGATCCTG GCTTTTATCC TGTACCGCGC TGGCGGTAAG TTCCGCCATG TGCGGCCTGG GAGCACCGGG CGATTCCGCC CCCGCGCCAG CCCCCACAGG GCTGGAATGG GAACAGGAAC AAAACCTGCA TTTGAATAAG GAAGCTCCAA CCGCCTTTTT CGCGTCTTTC AGCGATTTGC AGTCCGCCTT GAAAGTGCTG CCTGAAAACA GTAAATGGCG CAGGTCCCTG AACGGTCAGT GGAAGTTCCA CTGGGCGAAG GATCCCCAGA GCCGCCCGGC CGATTTTTAC AAGCCGGATT ACGACGTAAA GGACTGGAAG GAGATTAAGG TGCCTTCCTC CTGGCAGACT CAGGGTTACG GCACCCCCAT TTATTCCAAT CAGCCGTATC CCTTTGAACG CTCCTGGCCT TATGTAATGA AGGAGCCTTC CAACAAGAAT TATACGTCCT ACAAGGAACG GAACCCCGTA GGTTCCTACC GCCGCACTTT TGAAGTACCT GCGGACTGGG ACGGCAGGGA AGTGTACATG CAGTTTGACG GAGTGGATTC CTTCTTCTAC CTCTGGATCA ACGGGCAGTA TGTAGGTTTT TCCAAAGATT CCCGGAATCC GGCCCGTTTT GACATCAGCC CCTACCTTAA GAAGGGGGAG AATGTGGTGG CCGCAGAGGT GTACCGCCAT TCCGACGGAG CATATCTGGA ATGCCAGGAC ATGTTCCGCC TGTCCGGCAT TTTCCGCAAT GTTTCCATCT TTGCGCTGCC GAAAGTTCAC ATCCGCGACT TTTTCGCACA GGCCAATCCG GTGGACCAGA GGGATTGGGC TTTGAATATT GACCATGCCA AACCGGGGAC CGTGGACGGC GATTGGCGCC TTCAGGTGGA TGTGGATGTT CGCAATCTGT TTCCGGCAAC GGAAAAGCTG GACGGCTGCA CGGTTTCCAT GGCCCTGTAT GACGCTGCCG GAAAACTGGT AGAACCTGTC AAGCCCAAGG ATGCGCCATA TGACGGCGTG TTGGAAAAGC CCTTGCGCAT TACCGGCATG AAGGATTTTA AAACTTCCCT GCTGGGCATT TATTCCAAAC CCAGACTATG GTCTGCGGAG GATCCCAACC TGTACACCCT GGTACTGACG CTGAAGCGTG ACGGGAAGAC GGAAGAAATG GTTTCCTCCC GCGTGGGTTT CCGCAATGTG GTGATTAAGG ACAGCGTGTT CCTGGTAAAC GGCCAGCCGG TGAAGGTAAA GGGCGTGAAC CGCCATGAAA GCCATCCGGA AACAGGGCAT TACGTGACTC CGGAGCAGAT GGAGGAAGAA GTGCGGATGA TGAAACGCGC CAATATCAAC CACGTGCGCT GTTCCCATTA TCCCGCGGAT CCTTATTTTT ATTACCTCTG TGACAAGTAC GGCATTTACG TGCAGGATGA GGCCAACATC GAGTCCCACG GCTATTACTA TGGCAAGGAG TCCCTTTCCC ATCCCATTGA GTGGATGCCG GCCCACGTGG ACCGCATCAT GGCGATGGTG GAGCGCAACA AGAACCATCC CTGCGTGATT ATGTGGTCCC TGGGGAATGA AGCCGGCCCC GGCCAGAATT TCCGCAGTGC GGAAAAGATG GTGAAGGCCA GGGATATGTC CCGCCCCACC CACTACGAAC GCAATAATGA CATTGTGGAC TTGGGGTCCA ACCAGTATCC GTCCGTGGAC TGGACGCGTT CCATGGCGGG CAACAAGGAT TTCCCCAAGC CCTACTATAT TTCCGAGTAC GCACACAACA TGATGAATGC CATGGGCAAC CTGGCGGATT ACTGGGAGGC CATCGAGTCT TCCGACCGCA TTATGGGCGG CGCCATCTGG GACTGGGTGG ACCAGGGCTT GTACAAGACC CTGCCGAATG GAGAAAAGAT GCTCTGCTAT GGCGGCGATT TCAACGACCA TCCCAATAGC GGCCAGTTTG TGTTCAACGG CACCATCCTG TCGGATCGTA CGCCGGAACC GGGTTATTTT GAAGTCAAGC ACGTCTATCA GAATATTTCC ACATCTTTGA CGGACGACGG CAGGATTTCC ATTTTCAACA AGAATTTCTT TACGGACCTT TCTTCATACG ACATTACCTG GACCCTCACG GAAAACGGCA ATGCAGTGGC TGAAGGCAGG TTGGACACGC CTCCGGCCGG TCCCAGAGAA AAGATTGTGG TTCCCATTCC GGACATTCCC CAGTTGAAAA ACCGGAAACC GGGGGTGGAG TATGCCTTGC GCATAAGTTA CAAGCTGAAG AAGGACAGGG GGTGGGCCAA GAAGGGATAT GAATTGGCCT TTGACCAGCT CCAGCTTCCC GTGCAGGGAG ATCTGCCTGT GTTCAAGGCT CCTGCGGGCA AAGTCAGCCT CAGCACGGAC AAGCATACCG TTTCCGGCAA GGATTTTTCC GTGCAGTTTG ACGCGTCCAC CGGGGAACTG GCCCAGTTCA CGGTAAACGG CAAGCCTCTG TTTAAAACGC CCATGGCGGT GAACGCCCTG CGCGCCGCCT CCAGCAATGA GCCGGGCGTC ATGGCCAAGA GCATGGCTAA CGGCCTCCGT GAACTGAAGC ATGAACTGCT CAGTTACGAA GCCATTGATA ACGGCAATAG CGTCACCGTC AAGCAATCCA TCAAGGTAAG CGGCAAACAG GCTGAAAACA TCAGCGGCTA CGGCGATACC AAGACCACCA TCACGGCCAG GAAGCAACCC CTGAACGATA CGAACACCCA TTTCATCAAT AATTTGGAAT GGACCATCTA TGCGGATGGA ACTGTCGTCT GCCAGTCCGT ACTGCTTCCG CGCGGCAATC CCCTGGAACT GCTGCGCCTG GGATACGAAC TCCAGTTGCC GGCGAATATG GACAACGTAG CCTATTACGG GCGCGGGCCG GAAGAAAACT ATGCGGACCG CAAGAGCGGC ATGCCTCTGG GCGTGTATAA AACGACAGCC TGGGATTCTT TCTTCCCGTA CGGCAGACCG CAGGATTGCG GCAACCATGA GGATACCCGC TGGGTGGCCG TTACGGACGA CAAGGGGAAC GGCCTGCTTT TCGGTTCCGT GGGCGCACCG TTCGCTTTCT CCGCCCTTCC GTATACCACC ACGGATTTGA TCCTGGCAAA CCACCCCGTG GAACTGCCGA AGACGACGGA TAAGACCGTT CTGGTTCTCT CTTCCGCCAC GCGCGGCCTG GGGGGTGCTT CCTGCGGTCC CGGCCCTATG GGCAGGGACA TCATCAAGGC CAACAAGCCC TACCCGATGT CCTTCTTTAT GCGGCCCATT ACCGCCAAGT CCTACAAGGG GGAAATCCGC GTGCCTGCGG CCCGGCTGGA TATGACCATG CTGACCCGCA CGGACAAGTA TACGGTCAAG AGTGTAACCA GCCAGGAGCA GGGCGAAGCG GACGCCGAAT TCGCCATTGA CGGTGATCCC GGCACCTTCT GGCACTCTGA ATACAATAAA ACCGTGACCA AACATCCGCA TGTACTGGCC GTGGACCTGG GTAAGGAGCG GGAATTCTCC GGAATCACTT ATCTTCCACG TCAGGATGGC AGCAGCAATG GCCGCGTGAA AGATTATTCC GTGGACGTGA GCACGGACGG AGAGAAATGG CAGCCTGCCG CCAAGGGCTC CTTCCCGGAC AGTGCTGACC TGCAGGAAGT GAAATTCCAA GCTCCCGTCA AGGCGCGTTA TTTCCGCTTC TCCGCCCTTA GTGAGGCGCA GGGGCGGGAT TACGCCGCCG TAGCGGAACT GGATATCATT CCCGTTAAGA AATAA
|
Protein sequence | MRTTRSWLLS CTALAVSSAM CGLGAPGDSA PAPAPTGLEW EQEQNLHLNK EAPTAFFASF SDLQSALKVL PENSKWRRSL NGQWKFHWAK DPQSRPADFY KPDYDVKDWK EIKVPSSWQT QGYGTPIYSN QPYPFERSWP YVMKEPSNKN YTSYKERNPV GSYRRTFEVP ADWDGREVYM QFDGVDSFFY LWINGQYVGF SKDSRNPARF DISPYLKKGE NVVAAEVYRH SDGAYLECQD MFRLSGIFRN VSIFALPKVH IRDFFAQANP VDQRDWALNI DHAKPGTVDG DWRLQVDVDV RNLFPATEKL DGCTVSMALY DAAGKLVEPV KPKDAPYDGV LEKPLRITGM KDFKTSLLGI YSKPRLWSAE DPNLYTLVLT LKRDGKTEEM VSSRVGFRNV VIKDSVFLVN GQPVKVKGVN RHESHPETGH YVTPEQMEEE VRMMKRANIN HVRCSHYPAD PYFYYLCDKY GIYVQDEANI ESHGYYYGKE SLSHPIEWMP AHVDRIMAMV ERNKNHPCVI MWSLGNEAGP GQNFRSAEKM VKARDMSRPT HYERNNDIVD LGSNQYPSVD WTRSMAGNKD FPKPYYISEY AHNMMNAMGN LADYWEAIES SDRIMGGAIW DWVDQGLYKT LPNGEKMLCY GGDFNDHPNS GQFVFNGTIL SDRTPEPGYF EVKHVYQNIS TSLTDDGRIS IFNKNFFTDL SSYDITWTLT ENGNAVAEGR LDTPPAGPRE KIVVPIPDIP QLKNRKPGVE YALRISYKLK KDRGWAKKGY ELAFDQLQLP VQGDLPVFKA PAGKVSLSTD KHTVSGKDFS VQFDASTGEL AQFTVNGKPL FKTPMAVNAL RAASSNEPGV MAKSMANGLR ELKHELLSYE AIDNGNSVTV KQSIKVSGKQ AENISGYGDT KTTITARKQP LNDTNTHFIN NLEWTIYADG TVVCQSVLLP RGNPLELLRL GYELQLPANM DNVAYYGRGP EENYADRKSG MPLGVYKTTA WDSFFPYGRP QDCGNHEDTR WVAVTDDKGN GLLFGSVGAP FAFSALPYTT TDLILANHPV ELPKTTDKTV LVLSSATRGL GGASCGPGPM GRDIIKANKP YPMSFFMRPI TAKSYKGEIR VPAARLDMTM LTRTDKYTVK SVTSQEQGEA DAEFAIDGDP GTFWHSEYNK TVTKHPHVLA VDLGKEREFS GITYLPRQDG SSNGRVKDYS VDVSTDGEKW QPAAKGSFPD SADLQEVKFQ APVKARYFRF SALSEAQGRD YAAVAELDII PVKK
|
| |