Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1667 |
Symbol | |
ID | 6274629 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2017753 |
End bp | 2020329 |
Gene Length | 2577 bp |
Protein Length | 858 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642613725 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_001878266 |
Protein GI | 187736154 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.611357 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.331093 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAAGA TGATGGCCTG CATGCTGGGC GCTGCCCTGT GTTTTTCCAT GGCCGGAGCG GAAACAGGCG TTTCCTCTGC GGAAAAAGCT TCACCCCTGT TCGGGAGGAT GATCTGGAAT CGCGGCTGGG AGTTCCGGCT TGAGGACGAG CCCGGAAAGA GCGGCTGGCA GACGGTTCAT CTGCCCCATA CGTTCAGTCT GCCCTACTTT ATGTCCGATT CCTTTTACAC GGGGTTCGGC TCCTACCGCA AGAAGCTTGT CATGCCGGAG GAATGGCAGG GGAAGAAGGT GTTTCTGGAT TGCGGCGCAG CTTTCCAGGT GGCGGAGGTG AAGGTGAATG GGAAGACTGC GGGAAGGCAT GAAGGGGGGT ATACGGCCTT CCGGTCGGAT TTGACGCCTT TCCTGAAGCC CGGCGAGAAC TGGGTGGAGG TACGGGTGGA TAACCGCTGG AGCCCCCGCA TTGCGCCTCG GGCGGGGGAG CATACTTTTT CCGGAGGACT GTACCGGAAC GTGCATCTGC ACGTGTGTTC TCCCGTGTAT TTGCCCGCGC ACGGAGTATG GGTTCAGACG CCGGAGGTGA CGGCGGCCCG CGGGAAGGTC CGGATTTTGA CGGAAGTGAA GAATGATACC GGGACAGTGA GGAAAGTGAC GGTGCGGCAT ACTGTGCGGG ATGAGCAAAG CGGCTGCGTG GTGCTGCGGG GAGAGAAGGG CGCGGAACTG GGGGCCGGGG AAGATTCTCG CGTACAGGCT GACCTGCCTC CTCTGGCTTC TCCCAGGTTG TGGAGCCCTG CGGCCCCCAA TATGTACCGT GTGACGACGG AATTGTTGGA CGAGAAGGGG AAGGTGCTGG ACCGGGCTGA AAATCCCCTG GGTTTCCGTA CTCTGGAGCT GACGGCGGAC CAGGGTATGC TGATTAACGG GAAACCTGTT TATCTGCAGG GGGCCAATGT GCACCAGGAC CACGCCGGGT GGGGGGATGC CGTGACGGAT GCCGGGGCCC GCCGTGATGT ACGGCTGATG AAGAACGCCG GGTTTAATTT CATCCGCGGC TCCCATTACC CTCATTCCCA GGCTTTTCTG GATGCTTGCG ACCGGGAGGG CATGTGCATG CTGAATGAAG GCATTTTCTG GGGAATGGGC GGGTTCAAGG AGCATGACAG ATACTGGAAT TGCGATGCCT ATCCTGCGGA AAAGAAGGAT CGTATTGCGT TTGAGGAAAG CTGCATGCGC CAGGTGAGGG AGATGGTGCT TCAATTCCGC AATCATCCTT CCATCGTCAT CTGGAGCATC AGCAATGAAC CGTTTTTTAC CAGGCATGCG CCGGAGGCCA GGGCTTTGTG CAACAGGCTC ATTACCTTGG TGAAGGAGTT GGACCCGACG CGCCCCGTGT GCGCCGGGGG CGGGCAGCGC GGAAATTTTG ACAAGCTGGG GGACATGGCC GCCTTGAACG GGGACGGTTC CCATGTGAAG ACGCCCGGCA GGCCCAGCAT GGTGACGGAG TATGGTTCCG TGAGCTGCCG GAGGCCGGGG GCGTATGCGC CGGGCTGGGG GGATATGAAG AAGGATAAGG AAGCGGGGAT TCGCTATCCC TGGCGCGTGG GAGAGGCGGT CTGGTGCGGC TTTGACCACG GCAGCATCTG GCCTTCCGGA GGGCGCATGG GCATTGTGGA TTACTTCCGC ATTCCCAAGC GGGCCTGGTA CTGGTACAGG AATGCGTTGA GGAATATTCC TCCTCCGGAG TGGCCTGTAG AGGGGACTCC GGCACAGGTG AAACTGTCCG CGGACAAGAA GGTTATCAGC CCGGCGGACG GCACGGACGA TGTGCATGTG ACCGTGAAAG TGGCGGACGC CGCCGGAAGG CAGATATCCA ATGCCGTGCC GGTGACGCTT ACGGTGGTAT CGGGTCCCGG GGAGTTCCCT ACCGGCAAAA GCATTACGTT CACGCCCGGA ACGGATATTG ATTTAATTGA CGGCTGTGCG GCGATTGAGT TTCGGTCTTA TTATGCCGGG AAGACAGTGA TCAGGGCGTC TTCTCCCGGA TTGAAGGGGG ATAGCCTCCA AATCGTTTGC CGGAATGCTC CGGCTTACGT AGCGGGCAGG AGCGCGGAAA CGCGGGAAAG GCCCTACAAG CGTTTTAGCG CAAAGGAGAG GGATATCCAG CTGGCGCGGT ACGGACGGCC CGAATCCGGA GAGAAAGCCA ATTTGGCTGT ATTGAGGCCC TGTTCCGCTT CCTCCGGATT TCAGGAGACG ATGAAGGCTT CCGACGGTGA CGATGTTTCT GCATGGCATC CGTCCGTGGA AGACAAAGCG CCGTGGTGGC AGCTGGACAT GGAGTTTGAA TTCCGCCTGG ACCGGGTGGA GGTGAAGGCA GCCGGAAGCT GGAAGGGGCC GGCTCCCTCG GTTCAGGTCA GCAGGGACGG AAATAGCTGG AAGGACATTA AAGCCGTTTT GCGCAAGGAT GGGACGGAGC TGACCGCAAT GTGTCCGGAA GGGACCGGCG CCAGGTACGT ACGCATCCGC CTGTCTCCGG GGCAGGGAAT TGCGGAAGTG GCCGTATGGC CGGCGGATGC GTCATGA
|
Protein sequence | MRKMMACMLG AALCFSMAGA ETGVSSAEKA SPLFGRMIWN RGWEFRLEDE PGKSGWQTVH LPHTFSLPYF MSDSFYTGFG SYRKKLVMPE EWQGKKVFLD CGAAFQVAEV KVNGKTAGRH EGGYTAFRSD LTPFLKPGEN WVEVRVDNRW SPRIAPRAGE HTFSGGLYRN VHLHVCSPVY LPAHGVWVQT PEVTAARGKV RILTEVKNDT GTVRKVTVRH TVRDEQSGCV VLRGEKGAEL GAGEDSRVQA DLPPLASPRL WSPAAPNMYR VTTELLDEKG KVLDRAENPL GFRTLELTAD QGMLINGKPV YLQGANVHQD HAGWGDAVTD AGARRDVRLM KNAGFNFIRG SHYPHSQAFL DACDREGMCM LNEGIFWGMG GFKEHDRYWN CDAYPAEKKD RIAFEESCMR QVREMVLQFR NHPSIVIWSI SNEPFFTRHA PEARALCNRL ITLVKELDPT RPVCAGGGQR GNFDKLGDMA ALNGDGSHVK TPGRPSMVTE YGSVSCRRPG AYAPGWGDMK KDKEAGIRYP WRVGEAVWCG FDHGSIWPSG GRMGIVDYFR IPKRAWYWYR NALRNIPPPE WPVEGTPAQV KLSADKKVIS PADGTDDVHV TVKVADAAGR QISNAVPVTL TVVSGPGEFP TGKSITFTPG TDIDLIDGCA AIEFRSYYAG KTVIRASSPG LKGDSLQIVC RNAPAYVAGR SAETRERPYK RFSAKERDIQ LARYGRPESG EKANLAVLRP CSASSGFQET MKASDGDDVS AWHPSVEDKA PWWQLDMEFE FRLDRVEVKA AGSWKGPAPS VQVSRDGNSW KDIKAVLRKD GTELTAMCPE GTGARYVRIR LSPGQGIAEV AVWPADAS
|
| |