Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0036 |
Symbol | |
ID | 6275170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 45594 |
End bp | 51467 |
Gene Length | 5874 bp |
Protein Length | 1957 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642612077 |
Product | YD repeat protein |
Protein accession | YP_001876664 |
Protein GI | 187734552 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.13873 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAC GGCAGTTTCA GCATCATGTG GATTCTCTGC CTGATAATGC TCTCACAGTG TCCGGAGGTG GGCCGGCTCC GTTTGACAAG GAGGCCGGGG CGTTTAGCGG CAGCTGGGAC TGGGGGATTA TTGTTGATCC GCTCGGTCCG GGGGAAACGG CAGAGTGCCG TTTAATGCTG GGGGTAGATG ATAACGGTAC TCTGACCGTT GACGGTGAAG GAATTTTCAT TCCAGGGGAG GGTAAATACC ATGGAGGCAC TTATAGGGAA AAAAGCTTGA GCTTTCCCAT TGAACCGGGA CCGCACCGTG TACACATGAC TTATGAGAAC GTTGCTGTTC CTCCTGAATG GACTAATCTG GCAATCCTTA ATTATTCCAT TGAAGTCATG GTCTCGGATG GCACCTCTTC ATCCTCCTAT CAGCCGGATG AAGGTTATCC GGAACCTGTG AGTACGGAAG ATGAGGGAGA AGACGTGCCG TGTGAATGCG GGGGGGACGG CAATGGCTCC CAAAGCAGTT CCAGTTCTCC CTGTCCGGAG GAAGACAACG GAGAAGGGGA TAACGGTGAA GATGAGGCCC CCTGCACCTG TGAAAACAAT GAGGGCGGCA GCAGCAGCTC TTCGGCGCAC GGCGTGCGCG GTTCCTCCAC CCTTTATGGT TCGTCGGCTG CCGGCAGGCG CGTGGTCGCC CGGACGCTCA AGACGGAAAT GGTCTGGCGT ACCAATTTTG GCTCTTTCCG GGGCATGACG GGGTTGCCGC AGGGGTTGCT GGACATAGTA GGGTACACAT TCTCGTCTGA ACTCTGGTCC CCTCGTGCGC TCCATTACTG GCATCCGATG ACTCATGAAA TCATTTCGGC GCCGCTATCG GGAATTGGAG CGAATACCGC TTTCCAAATC AGGAGCGGCG GAACTGACAT TAATTACTAC TGCTACGCGG ATGGTGGCGG CAGCGGGGTT GACAGTGTTG CCCCCATTGC GGGATCCGCC AAGCGCGGAG GGTCTGCGTG GTTCGGGAGG AGGTCCGCTT CCTCAGCAAA GGCGGGAGAA TTCTGCCTCA GTATCCGTTC AACCGCGGGC AATACTGTTA ACTATACCGC AGGTGTGGCC TCTTCCTTGA AATACTCCAC CGGCTACACC GCTAAAAATG GCGCGTCCTA CACAAGAGAA GAATTTGACG GAAAGCTGGA TATCGTGCGG GCTTCGGATG GAAGCATCCG GCAGATATGG AACCTGTGGG ATGGGCTTGC CAACATCGAA AATGTTACGG AATCCGGATA TCGCATCGCC CTCTACCTCC CCGAACAGGT GGATGGGAAA AACAGCTCCA CCGGCCTTTA TCCCGTTACG GGCGACCCAT TTAAAACATT CACGATCACC GGGGACGCGG CAGCGGGCAG GCTTGCCGTC ACCGAACAAA CTGCCGGGCG CGCGCCCTTC ACCACCCGCT ACTGGCAGGG AACTGACGGA GCATGGAACA TGTCTCAGGG GGAAGGAGAA GATTCCATCT TCACTCTGAA GGAAAAGCAG ATTGTCTCTC CCGGAACGTG GAAACTCATC ACTACCATCC AGAGAGGGGA AAACGGTATC CCTATCTCCC GTGTCTGTGA GACTTATGCC GTCACCCGCA ATGGCAATTT GTGCACCAGC CGTATTGAAG GCTACGGAAC GGATTACGCC CGGGAGACGA CCTACGAGTA CACAGGCATG GGAAAAATCG CGAAGGAAAC TGCTCCTGAC GGAAGTGTCA AAACATGGGC GTACGACCGT TTTGGGCGTG AAATCGTGTC CAGTGTCCCC TGGGCGGAAG CTGGAGATAA AGTCACTTAC ACCACTTATC GGGATCAGAC GCAGGCGGAT CCGGATATTC TCACCCAATG GGTGACGTTG ACGGCTACTG CGGCGGAACT CTGGCGAACG GATTATACTT ACATTGAGGA AAATCATGTG CGGCGCGTGG AAAAACGCAC CACCGCATTG GGGGAAGAAA ATGTACGGCT GGAAGTGAAG GAATCCTGGC TCGGCACGGC TCCCAATGTT TATGCCCGCG GACGCTTGAA AATGAAACAG GACATGGGCG GAATTCAGAC GCATTACGCT TATGAAGAGA CGGACCAGTA CGGAGCCCTT TACAAGGTGA CTGCGGAAAC GCGGATAGCG GGTGCGTGCG TGCCCGGCCT GAGTTCACGG AAGGTTACTT ACGTTTCCGG CCAGGGCAAT AACATGCGTT ATGAGCAATA CGTACAGCTT GCCGATGGTG TTTGGTCAAT GACGGATGCG TCTTCTTATG AATATGATGT TGAGAACCGT TGGGTAAAGC GCACCCGCGC CAATGGACGC GTTTACGAGC GCATGATGAC CTGTTGCGGC CCTTTATGGG AAAAGGATGA GAATGGCGTG ACGACTTCCT ATTCCTACAA TACGGCCCGC CAGCTGACAG AAACCATCCG TTCAGCCATT GCAGACGGGG AAACCATGAT CACTCCGGAA ACCATCACGA GTTACACTCG TGATGCCCTC GGTAAAATTA CCTCCCTGCG CAGGGATACA GGCCCAATGA CGACGGTGGA AAGCCGGGAA TACGATTTGC TGGGGCGTCT GGTAAAAGAA ACCGATATTC TGGGGCGCTC CACCGTTCGC TCCTATAGCG GGGATGGCTT GGTAGAAACT GTTACCACCC CCGCCGGCGC TACCCTCATC ACCCGGAAAA GTGCTTCCGG AACAATATTG CGGCGTTACG GCACCGGGCA GCAGGATATC CTCTATACTG TTGAGGTCAC GGAAGAAGGG ATCCGCACGA CGGAGGCTGT GCCGGGCGGG GAGGGGGGCG ATCCCCACGT CGTGACGGGA AGCTCCACTG TCAACGGCTT TGGCGACCTT GTACGGGTCG CTGCCGCCAA TACCTTGAAT GGGGAAAACG TGCGCACGCT TGCATACGAT AACAAGGGGC GCCTCATCAG GGAACAGCTG GCGGATATGG CGCCCGCCCT CTACGCGTAT GATGCTTTTG GGAACAGGAC CAGGGCAATC GTTGCCCTGG AGGATGACCC GACCATCCTG AACTCCCGCA TTACTGACTA TGCTTATGTG ACCGAGAGCC GTGAAGACGG AGTCTATAGT CTTGTTTCCA TAACAAGCTA TACTTCCTCA GGTACTCCTG TTCTTCAAAA GCGGGCGGCC CTGCTCTCTG CCCTGAGCCC TGTCCTTGCC GGCAAAACCG TCATGACGGA TTCCCGCGGC CATGATGCCG TGGAATGGAT GGAATATGGC GGCGGTTCTG TTCGCCTGCG GAAGAAAACC GTGCCCGGCG TCGAGAGCGT CTTCCTCACG CGCATCGTGG ACGGATTCAC GACGGCGGTT ACGGGATTTG ACGGTGCAAC TGTCCTTCGG CGGCGCACCT ATACGGAAAC GGGCATCACC TACGCGGACA CAGATGTGCG CGGGAATGTC TTTACCTCCG TCTGCGACAT CGCCGGCCGC ATCATCTCCG GCACGGATGC CGCGGGCAAC ACGACCACTT ACGCTTATGG CCAGCCTTTT GATCTGCCTA CCTGCGTTAC TAATCCTCTT GGCAAAACGG CCTGTTCTTT CTATGACATC CGGGGCCGCA AGGAAGCCGA ATGGGGGACA GCCGTACAGC CCGCCGTATA TGCTTATGAC GCCGCCGGCC ATATGGTTAG CCTTTCCACT TTCCGTGTTC CCGGAGATGT CATTACCACT GATCCCCGCC TCCGGACGGA TGGGGACATC ACCACATGGA CGTATGATAT CGCTACGGGG CTGGTTATCC GCAAAACCTA TGCCGACGCC ACCCATGTGG ATACCGTTTA CGATATGCTC AACCGCGTTG CCGCTACGAC GGACGCGCGA GGAACCGTTG CTTCGCGCTC CTACGCCCCC CTTACTGGGG AGCTCGTTTC CATTACCTTT AATGACGACG GTTTTACCCC CTCCATCAGC AGCCTCTACA ATCATCTGGG GCAGCTGACG CAGATTGACG ACGCTTCCGG AACGCGCATG TTCACGTACA ACCAGTATAA CGAACAGGAA ACGGAAACGA CGGCTGGCCT TGCGGCAAGC GTCTTAACCC TCCGCCGTGA CGGAGTGGGG AGGCCTGCGG GCTACTGTCT GGATTATGCG GGCTCCCCCG CCTTGCAGAC GGCATGGGCC TATGACGCCT ACGGAAGGCT ATCCTCCGTT TCGCTGAACG CCGTCGGGAA GCCCTTCACC TATGGCTACA ATGAGGAAAC CGGCCTGCTG GACACTCTTG ATTACCCCAA CACCCTGAAA CGGTGGCGCA CTTTGGAAGA AAAGCGCGAC CTCCCGGTGA AGATTGACTA CCTGCGCCCC GGCAGCGCCA ACTACCCGGC TAAAACCGAC TACTCTTACG ACATACTGGG GCGCCCGGTC ACGAAGAAAG ACTACTTTAA CGCGCCTGCA CCCGACCTGA CGCACACCTG CGCCTACGAT GACAGGAATG AACTTGTGAG CGATGCGATG AGCCGGGGCG GCACATACAG CTACTCCTAC GATAACATAG GCAACCGCAA AACGTCCCTG GAGGGAACGG ATTCCCTTCC CACTACATAC GTTGCCAACC GGGTCAATCA ATATACGGAT ATCACCGAAG GTGAGGAGGC TCCTTTTGTG CCGAATTATG ACGCCGACGG CAACCAGACG AAGCTCCGGA CCGCCACCGG GGAATGGGAA GCCTCCTACA ACGCGTTGAA TCAGGCGGTC AGCTTCATAC AGGGGGACAG GCGTATTGAA TGCGTATACG ATTATCTGAA CAGGCGGGTT GAAAAATCCG TCTATGAGGG AGAATCGCTT ATGTCAAGGA AACGGTTCAT CTATCACGGG TACCTGCAAA TCGCGGAACT GGATGCCACG GAGGTTTTGG AGTCTGTGGC GCCCGTCCTG CGTAAAACGT ACCTGTGGGA TCCGCAGGAA CCGGTGGCCA CGCGCATTCT GGCCATGGGC GTCTTTGATG AAACGGGAGC CTACGTGGAA GATCTTTACT ACACGCATGA CGCATTGAAG AACACGACGG CGCTCTTCGG CATCAAGGCG GGGCGCCGAG CCTTGTACGA ATACGGCCCG TACGGCTCTG CCGTGAAGAT GGAAGGAAAT GCGGCGGAGT TGAATCCGTT CCGGTTCTCC AGCGAGTATG CTGATGACGA GCTGGGGTTG GTTTACTACA ATTACCGCTA TTATAATCCC CAAAATGGTA GGTGGATCAG CAGAGATCCT ATGACAGAAA AAGAGAGCTA TCTTTTGTAT GGATATGTTA ATAATATGCC TACCTTATAT TCTGACGAAT TAGGATTAGC GCGTACAATA ACTACAAATA AGGATGATTG TTCGATCAAT GTGAGCCTGA ATATCGTGAT ATATCCGAAA GGTGGAGATT CAATTAATAA TATAGAAATG CGTACTACAG CACAGCGAAT AAAACAATCG ATTGAAAGCA ATTGGAATGG ATATGAGAAA GGATGCTGTG TAGTTAATGT GACAGCGGAT GTGTCTGTTC AGAGCAGGAA AAGCAGATGG CTTTATAGAT TTCTCAATAG TGATGAGAAT AATATAGAAA TAACATCAGA TTCATCTCAT CGTTCATATG TTAACGGAGT TGGAGGCAGA TATGGAGTAT GGGGTTCTCA GGCCGTCCCA TGGGTTTATG CGCATGAAGC CGGCCATCTT ATGGGGCTTT CTGACGATTA TCAAGATGTC GCCAATTCAG ATGGCTCCTT AACTTCCGTT CCTAACGCGG GACATGAAGG ACATATAATG GGGGAATATG GAGGAAAAGC CAATCAACAT GAAATAGATG CTATTTTAAA AAATATCGAA TGTCCATGCG ATGAAAATCA ATAA
|
Protein sequence | MKKRQFQHHV DSLPDNALTV SGGGPAPFDK EAGAFSGSWD WGIIVDPLGP GETAECRLML GVDDNGTLTV DGEGIFIPGE GKYHGGTYRE KSLSFPIEPG PHRVHMTYEN VAVPPEWTNL AILNYSIEVM VSDGTSSSSY QPDEGYPEPV STEDEGEDVP CECGGDGNGS QSSSSSPCPE EDNGEGDNGE DEAPCTCENN EGGSSSSSAH GVRGSSTLYG SSAAGRRVVA RTLKTEMVWR TNFGSFRGMT GLPQGLLDIV GYTFSSELWS PRALHYWHPM THEIISAPLS GIGANTAFQI RSGGTDINYY CYADGGGSGV DSVAPIAGSA KRGGSAWFGR RSASSAKAGE FCLSIRSTAG NTVNYTAGVA SSLKYSTGYT AKNGASYTRE EFDGKLDIVR ASDGSIRQIW NLWDGLANIE NVTESGYRIA LYLPEQVDGK NSSTGLYPVT GDPFKTFTIT GDAAAGRLAV TEQTAGRAPF TTRYWQGTDG AWNMSQGEGE DSIFTLKEKQ IVSPGTWKLI TTIQRGENGI PISRVCETYA VTRNGNLCTS RIEGYGTDYA RETTYEYTGM GKIAKETAPD GSVKTWAYDR FGREIVSSVP WAEAGDKVTY TTYRDQTQAD PDILTQWVTL TATAAELWRT DYTYIEENHV RRVEKRTTAL GEENVRLEVK ESWLGTAPNV YARGRLKMKQ DMGGIQTHYA YEETDQYGAL YKVTAETRIA GACVPGLSSR KVTYVSGQGN NMRYEQYVQL ADGVWSMTDA SSYEYDVENR WVKRTRANGR VYERMMTCCG PLWEKDENGV TTSYSYNTAR QLTETIRSAI ADGETMITPE TITSYTRDAL GKITSLRRDT GPMTTVESRE YDLLGRLVKE TDILGRSTVR SYSGDGLVET VTTPAGATLI TRKSASGTIL RRYGTGQQDI LYTVEVTEEG IRTTEAVPGG EGGDPHVVTG SSTVNGFGDL VRVAAANTLN GENVRTLAYD NKGRLIREQL ADMAPALYAY DAFGNRTRAI VALEDDPTIL NSRITDYAYV TESREDGVYS LVSITSYTSS GTPVLQKRAA LLSALSPVLA GKTVMTDSRG HDAVEWMEYG GGSVRLRKKT VPGVESVFLT RIVDGFTTAV TGFDGATVLR RRTYTETGIT YADTDVRGNV FTSVCDIAGR IISGTDAAGN TTTYAYGQPF DLPTCVTNPL GKTACSFYDI RGRKEAEWGT AVQPAVYAYD AAGHMVSLST FRVPGDVITT DPRLRTDGDI TTWTYDIATG LVIRKTYADA THVDTVYDML NRVAATTDAR GTVASRSYAP LTGELVSITF NDDGFTPSIS SLYNHLGQLT QIDDASGTRM FTYNQYNEQE TETTAGLAAS VLTLRRDGVG RPAGYCLDYA GSPALQTAWA YDAYGRLSSV SLNAVGKPFT YGYNEETGLL DTLDYPNTLK RWRTLEEKRD LPVKIDYLRP GSANYPAKTD YSYDILGRPV TKKDYFNAPA PDLTHTCAYD DRNELVSDAM SRGGTYSYSY DNIGNRKTSL EGTDSLPTTY VANRVNQYTD ITEGEEAPFV PNYDADGNQT KLRTATGEWE ASYNALNQAV SFIQGDRRIE CVYDYLNRRV EKSVYEGESL MSRKRFIYHG YLQIAELDAT EVLESVAPVL RKTYLWDPQE PVATRILAMG VFDETGAYVE DLYYTHDALK NTTALFGIKA GRRALYEYGP YGSAVKMEGN AAELNPFRFS SEYADDELGL VYYNYRYYNP QNGRWISRDP MTEKESYLLY GYVNNMPTLY SDELGLARTI TTNKDDCSIN VSLNIVIYPK GGDSINNIEM RTTAQRIKQS IESNWNGYEK GCCVVNVTAD VSVQSRKSRW LYRFLNSDEN NIEITSDSSH RSYVNGVGGR YGVWGSQAVP WVYAHEAGHL MGLSDDYQDV ANSDGSLTSV PNAGHEGHIM GEYGGKANQH EIDAILKNIE CPCDENQ
|
| |