Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0290 |
Symbol | |
ID | 6275131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 347442 |
End bp | 350402 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642612344 |
Product | glycoside hydrolase family 2 sugar binding |
Protein accession | YP_001876913 |
Protein GI | 187734801 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCACCA ATATTACACG CACGACCCTC TGCATTACTG CGTTCAGCAT CGCCAGCCTC ATGGCGGCGC CTCTGAATGC AACCAAAACG GAGAGCCTGG ATTGGAACTG GAAATTCGCC CGTTTCGGGA AAATGCCGGA TGGCAGTACG CAACCGGAAC CGGGAAAAGC CATGGGATTC GCCACTGCCA CCAGTGAAGA ATCCGGCAAT CCGGCGGACA ATGCCGTGGA CGGGGACAAG TCCACCCGCT GGTGTGCCGC CAGTGGCAAA AGCGGAGAAA AAATCACCGT GGACATGGGA CGCCCCGTAG ATGTAAAAAC CGTTAACATC CTGTGGGAAA AACAAAGCAA CCATCTTTTC AAGCTGGAAG GCTCCGGTGA CGGAAAACGC TGGGCAACTA TTGAAGACAA AACTTCCGGG CAAAACGACT CCAAGGAAGA CACGGTAGAA AACAAAACCG GCAAACCGCG ATACTTCCGC ATCACCGTCA CGGGCAACAA CCAGAGCAAC TGGGCCAGCA TCCGTGAAAT CACCTTTAAA AACGACAAGG GGGAAATTAT CCGCCCTCAG GCCGCCGCCG GAACCAGTAA GGCGGACAAT CCCTCCAGCC CCTCTTTCAA CGACAAAAAC TGGCGTTCCT TGAACCTGCC GCACGACTGG GGCGTGGAAG GACCCTTCCG GATGGAAATT GAAAACAGAA CCGGAAAACT CCCCTGGGTC GGCATTGGCT GGTACCGCAA AACGCTGGAA ATCCCGGCGG ACGCCAAGGG CAACCAATTC TATCTGGACT TTGACGGCGT TATGTCCCGC CCCAAAATTT ATGTGAACGG ACACCTGGCC GGCGAATGGA AATACGGTTA CAGCTCCTTC CGCGTGGACA TCACGCCCTT CCTGAAATTC GGGCAGCAAA ATACCATTGC CGTCAGAGTG GACAATCCCC CCAGCACCTC CCGATGGTAT CCGGGCGGCG GCATCTACCG CCATGTGTGG CTCACGGAAT CCAACCCTGT GCACATCGAA CACTGGGGCG TTTTCGTCAA AACTCCGGAA ATCACCAAAT CCGCCGCCAA GGTAGAAGTG GACACCACGG TGAAAAACAC CACGGACAAA GCCGTCATCC CCACTGTTAC TGAAGAAATC CTGGACGGAG GTAAAATCGT AGCCTCCACA ACCACCAAAG GGGAAGAAAT TCCCGCCGGG GAAAAGGGCA AAATCACCAG TACGCTGACG CTCAAAAACC CCACTCTGTG GACGCTAAAC GCTCCCCATC TGTATAAGAT GAAAACCACG GTCAGGATGG GAGACAAAGT CATAGACCAA AAATTCACCA ACTTCGGCGT AAGAACCGTT GAATGGAAAC CCACGGGATT CTACCTTAAC GGGGAGCGCG TGCAGCTCAA GGGCGTTTGC CAGCACCATG ACCTGGGACC GCTCGGCTCC GCCGCCCACA CGCGAGGCTA TGAACGCCAG ATTGAAATCC TGAAGGAATT CGGCGTCAAC TCCATCCGCA CGTCCCACAA CCCGCCTGCG CCGGAAGTGC TGGACCTGTG CGATAAAATG GGCATCCTGG TCATTGACGA ACTTTTCGAC GTATGGCAAT GCTCCAAAGA AGGCGTCAAC AACGAATCCT TTAACGAATG GCATGAACGG GACGTGGTTA ACCTCTGCCA CCGGGACCGA AACCACCCCT GTGTCATTGC ATGGAGTTCG GGAAATGAAG TTCCGGAACA GGGAATGAAA AATCTGCACC ATATCTCCCA AACCCTGACG GATCTTTTCC ACCGGGAAGA CCCCACGCGT AAAGTGACTT CCGGCTGCAA CAACGCCAAT GCCGCACGCA ACGGCTTTGG GGACACCCTG GATGTTTACG GCTATAACTA CAAGCCCTGG GCCTACAAGG ACTTCGCCAA GGACCGCCCC CACCAGCCGT TCTATGGTGC GGAAACCGCC TCCTGTGTCA GCTCCCGCGG AGAATACTTC TTCCCCGTGG ACTGGAACAA AGGCAAGGGA TTCTACCTCT ACCAGGTCAG TTCCTATGAC CTGTACGCCC CCGGCTGGGC CAACCGTCCG GATGTGGAAT TCGCCGCTCA GGAAGACAAT CCCAACAGCG CGGGAGAATA TGTATGGACG GGCTTTGACT ACATTGGGGA ACCCACCCCG TACAATCTGG ACGCCACCAA CGCCCTGAAC GTGCCGGAAG GGCCGGAACG CGAAAAGCTG ATGGCGGAAC TCAAAAAACT GGGAGACCGC GCCCCCTCCC GCAGCTCCTA CTTCGGCATC GTGGACCTGT GCGGCTTCAA AAAGGACCGC TTCTACATCT ACCAGGCCCA CTGGAGGCCG GATCTCAAGA TGGCGCACAT CCTGCCGCAC TGGAACTGGC CGGAACGCAA GGGGCAGGTA ACGCCCGTGC ATGTCTACAC CAGCGGGGAT GAAGCGGAAC TCTTCCTGAA TGGGAAATCC CAGGGCGTCC GCAAAAAGGG CACCGGGGAA AAGGACCGCT ACCGCCTCGT GTGGGAAGAC GTTAAATACA CGCCCGGCAC CCTCAAAGTA GTCGCCAAAA AGGACGGTAA AATCTGGGCT ACGGACACGG TAACCACTAC CGGAAAACCT GCGGCGCTCA CCCTCAAGCC GGACCGCAAT GAAATCAAGG GAGACGGCTA TGACCTGTCT TATGTCACCG TAGCCGTCCG CGACGCCCAG GGCCGTATGG TGCCCCGAAG CAAAAACCAG CTCACCTTCA AGGTAAGCGG CCCCGCGGAC ATCGCCGGCA TCTGCAACGG TGATCCCACG GACTTCACCA CCATGGCGAA TCCGGAAAAC AAGAAAATCA TGAAAATCAA GGCCTTCAAT GGTCTTGCCC AGGTCATTCT GCGCTCCCGC AAGGGAGAAT CCGGAAAAGT GACGCTCCAA GTCATCTCCA ACGGACTCAA GCCGGCTCAG ACAACTGTGA CGGTCAAATA A
|
Protein sequence | MFTNITRTTL CITAFSIASL MAAPLNATKT ESLDWNWKFA RFGKMPDGST QPEPGKAMGF ATATSEESGN PADNAVDGDK STRWCAASGK SGEKITVDMG RPVDVKTVNI LWEKQSNHLF KLEGSGDGKR WATIEDKTSG QNDSKEDTVE NKTGKPRYFR ITVTGNNQSN WASIREITFK NDKGEIIRPQ AAAGTSKADN PSSPSFNDKN WRSLNLPHDW GVEGPFRMEI ENRTGKLPWV GIGWYRKTLE IPADAKGNQF YLDFDGVMSR PKIYVNGHLA GEWKYGYSSF RVDITPFLKF GQQNTIAVRV DNPPSTSRWY PGGGIYRHVW LTESNPVHIE HWGVFVKTPE ITKSAAKVEV DTTVKNTTDK AVIPTVTEEI LDGGKIVAST TTKGEEIPAG EKGKITSTLT LKNPTLWTLN APHLYKMKTT VRMGDKVIDQ KFTNFGVRTV EWKPTGFYLN GERVQLKGVC QHHDLGPLGS AAHTRGYERQ IEILKEFGVN SIRTSHNPPA PEVLDLCDKM GILVIDELFD VWQCSKEGVN NESFNEWHER DVVNLCHRDR NHPCVIAWSS GNEVPEQGMK NLHHISQTLT DLFHREDPTR KVTSGCNNAN AARNGFGDTL DVYGYNYKPW AYKDFAKDRP HQPFYGAETA SCVSSRGEYF FPVDWNKGKG FYLYQVSSYD LYAPGWANRP DVEFAAQEDN PNSAGEYVWT GFDYIGEPTP YNLDATNALN VPEGPEREKL MAELKKLGDR APSRSSYFGI VDLCGFKKDR FYIYQAHWRP DLKMAHILPH WNWPERKGQV TPVHVYTSGD EAELFLNGKS QGVRKKGTGE KDRYRLVWED VKYTPGTLKV VAKKDGKIWA TDTVTTTGKP AALTLKPDRN EIKGDGYDLS YVTVAVRDAQ GRMVPRSKNQ LTFKVSGPAD IAGICNGDPT DFTTMANPEN KKIMKIKAFN GLAQVILRSR KGESGKVTLQ VISNGLKPAQ TTVTVK
|
| |