Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0735 |
Symbol | |
ID | 6273783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 865444 |
End bp | 871260 |
Gene Length | 5817 bp |
Protein Length | 1938 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642612786 |
Product | YD repeat protein |
Protein accession | YP_001877352 |
Protein GI | 187735240 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.206584 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0550996 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGAAC AATCCTTTGA TAACTATGTC AATTCCCTGA CGGGCATGAG CAATTCCAAT AGCGGAGCTC CCTCCGCGAT CGACGAACAA GCGCCAACGG ACGCTTTTGA AAGAAGCTGG GACTGGGATT TTGAAGTCAG GGAGCTGGGG CCTGATGAAA CGGCCGAATG CAAGGTGACC ATGGGCGCCG ACGATCTGGC TAACTTGACC GTGGATGGAG AAGAACGGCT GGACATCGGC CCGCGCGGAC AGTACGGAGG CGGCAGCTAC GAGCCGCAGA CGGCTTCTTT CAGCATTGAG CCCGGAATGC ATCGGGCACA TGTGGACTAT AGCAATATTT CCATTCCCAA TGCCAATAAC AACATTGCCA AATTCACCTT TGACCTGAAG GTGGAAATTA CCAACCGGCA AACGGGTTCT TCTTCTTCCT ATGTGCCCCC GGAGACGGAA ACGGAGCCAG TGGACAACAA CGACGAGGGC GATGATGATC CATGCGGAGG CTCCAGCAGT GGAAGCAGCA GTTCTCCCAA CAGCAGTTCC AGCAATCCGT GCCCGAATGG CGACAACGGC GGAGATGAGG ATGAGATTGA TCCGGACAAT CCATTTTCCC CGGATGACTG CATGGACAAC CGGGGCGGTT CCCCCGGCGC GTCTGCCGTG CGCAGCCTGG TTTCCTCCGC CTCCGCCTAT GGAAAGTTCT CTTCCGCAGG CAAACGGGTG ACTGCCCAGA CGCGGAAAAC CAGCATGGTA TGGCGCACCA GCTTCGGTTC CTTCCGCGGT ATGGAAGGCG TGCCGTACGG GATGCTGGAG ATCGTGGCTT ATAACTTCTC TTCCAGGTTG TGGACGCCCG CAGCCCTGCA ATACCTGCAT CCCATGGCCA GCTGTATTTT GCCCCCTTCC GGTCGGGAGC TGGGGGCTGA CATGGCATTC CAGATCCGGA ACGGAGGCAC CCGCGCCAAC TATTACTGCT ATGCCGGCGC GGCCAGTGCA GGCTCCATCG GCGGCTCGCA GAAAAGGACG GGTTCCGTTT CCATGGCGTA TGCCGCGGCA GAGGGGCGTG CCGTCTCGGC TTCCGCCAGC GCGGCGGAAA TGAGGGTCAG CAACGCCAGG GGCAATACAG TCATCTACGG CGGTTCTTCC GTTTCCGCTT TGGGTGCGGC CTCCGGCTAC CGGACCAAGC TGGGTTCTTC TTGGACAGCT CAGGATTTTG CCAATTACCT GGACATCGTC CGAAGCGCGG ATGATGTCAT CCGCCAGGTC TGGAACCTGT GGGACGGTCT GGCCAATATT GAAAACGTGA CGGATACGGG CTATGTGATC GCCTTTTATC TGCCCGAGCA GGTGGGGGCC AAAAATGTCT CCACCGGTCT TTACGCCGTT ACGGGGACGC CTTTTAAAAT CTTCACCATT GAGGGCAATA CGGAAACCGG CAAGCTCACC GTCACGGAGC AGGCGGAAGG GCGCGCGCCT TACGTCACCC GCTACTGGCA GGGGACGGGA GGGGCCTGGT GCATGTCCCA GGGGGAAGGG GAAGACTCTA TTTTCACGAT CCGTGAAAGG CAGGAGGTTT CTTCGGGGAT TTGGAAACTC TTCACTACCG TGCAGCGCGG GGAAAACGGC ACTCCTATTT CCCGGGTGTG CGAAACCTAT GAACAGGGCC GCAGCGGCAA CCTGTGCACC AGCCGTATTG AGGCTTATGG AACCGACTAT GCCCGTGAAA CGACTTACGC TTATAACGCC GTGGGCAAAC TGATCCGGGA GACGGCTCCC GACGGAAGCG AGAAGACCTG GTCCTACGAC GCCTTCGGGC GTGAAACCGT CCGGATGGAA CCCTGGGCGG GCGGGGGAAG GAAGGGCACC TACACCTATT ACCGCTGCTC CGACCATGCC GATCCGGATA TTGCGCACCA GTACGTGGTG CTCACTATGA ACGCCGCACG GCTGGCGGAT ACGCATTACA CCTATACGGA AGCCAACCAT GTGCGCCGTG TGGAAAAACG CACCACGGCG TTGGGTGCGG AGGGAGAACA GCTGGAAGTG ACGGAAACGT GGCTGCCGGC GGCGCCCAAC GAATACGCCC GGGGGAGGCT GAAGATGAAG CAGTCCGCCA GCGGCGTGCA GACGGTCTAT GGCTATGAAG CGGCCAGCCA GTACGGCGCC CTCTACAGGG AGACCAGGGA AACGCAGATA GCGGGGCAGG CCGTGCCGGG GCACAGCACG AGGAAAGTCA CGTACGTTTC CGTTCAGGGA AATAACACGC GCATTGAGAA ATACGCCTTG CTGACGGATG GAACCTGGAC GCTGACGGAT ACGGCGGATT ACGAATACGA CAGAGAAAAC CGGTGGATTA AGCGTACGCG CGGCAACGGC AGAGTGACGG AACGGGAGAT GATGTGCTGC GGCCCCTTGT GGGAAAAGGA TGAAGACGGC ATCATGACCA CCTACTCCTA TAATACGGCG CGCCAGCTGG TGGAAGTCAG CCGGTCGGAA GTCGCGGACG GAGAAACGGT CGTCACTCCT GAAACCATCG TCAGCTACAA GCGGGACGCC TTCGGCAGAA TCCTGCAAAC GCGCCGGGAC GTCGGCCCCA TGACGACGAC GGAAAGCAAG GTTTACGATC TGCTGGGGCA GCTGGTGCAG GAAACGGATG TCTTAGGAAG GAGCAGTACG CGTGCCTACA GTGCGGACGG CCTGACGGAA ACCGTCACCA CGCCGACGGG AGCGACGCTC GTCACTATCC GCCATGCGGA CGGAACCGTG CTGGAACAAA GCGGCACGGG GCAGAGGCAT CTCCTCTGCC GTACGGAATA CTCCGCGGAA GGCGTGGTTC GTTCCACGCT TCTTCCCCGG GCGGAGGGAG AACCGGAGCT GGTGGAACAA ACCGTCACGG ACGGAAGGGG CAACATGGTG CGTGTCTCCC GGGCGAACGC CAACGGAGGC CTTGTTCATG ACCGGCGCGT TTTTGATCTG AACAATAGGC TTTTGCGGCA GCAGGTGGAT GGAATGGCTC CTTTGCTTTA TGACTATGAC CCGTTTGGCA ATATCGTCAA AACCACGCTC AAGCTGGCCG AGAACCCCAC TCCCGCCAAC TCCCTCGTCA CGGAGTACGC CTATGCCCGC CGGCAACGGG AAGACGGCGT GTACCGGGTA ACCACAGTCA CGCGCTGCAA CAGCCAGGGA ACAACATATG CGGAAAGCAC GGCCGAGCTG GTTTCTTTCC TGTCCTCCTC GCTGGCCGGA AAAAACATCT CCACCGATCC GCGGGGCAAT GAAACCCTGC AATGGACGGA ATACACGGCT CCGGCCAGAC GGACGGTAAA AACGCAGTCC CCGGCTTCTT CCGTCATTGC GGAAACTGTC GTCATAGACG GGTACACCGT ATCCCGGAAA GACCATGCGG GTGTTCTGAC CGCCTCTTCC CGCGCTTATA CGGCCAGCGG CAGCACGGAG ACTTATACGG ACGCCCGTGG CAATGCCGCC GTTACCGTCT TTGACATCGC CGGCCGTGAA ACAGCCAGGA CGGATGCAGC CGGCAATACG ACCACCATCC AGTATGACCC GTCCACAGCT TCTCCCTCCT GCGTCACGGA TGCTTTGGGC AACACGGCCT GCTACGCTTA TGACCCGCGG GGACGCAAAA CCGCCGAATA CGGTACGGCC CTCCAGCCCT CCGTCTTTGC GTACGATGAT GCGGACAGGC TGGTATCCCT CATGACATTC CGCGTTCCGG GGGAAACCAT CGCTGCCGAT CCGCGGGAAC GGACGGACGG GGACATGACG ACGTGGGGCT ACGACGACGC CTCCGGCCTG ATGACGGCTA AAACCTATGC CGACGGCCAT GGGGAAAGCT ACTCTTACGA TGACTGGAAC AGGCTGGCGG TTAAACGACA GGCCCGGACG GTGGACGGGC AGGGAACGCC TCTGGCAACT TCTTATGCCT ATGATCCGCA GACGGGCAAC CTGGTCTCCG TCATTCACAA TGATGCGACT CCTTCGCTCA ATTATGTCTA CAACCACCTG AACCTGCTCA CTCAGGTTGC GGATGATTCC GGAACAAGGA TGTTGGCTTA CAACCAATAT AATGAAGCGG AATCGGAAAC TACGGCAGGA CTGGCGGCAA GCGCGCTCAA CTATTTGCGC GACGGTTTGG GGCGGCCTTC GGGCTACAGT CTGCATTATG GAGAGGGTAT TGTCCAGCAG ACGGCCTGGG AATATGACGG CTGCGGACGT CTTTCTACGG TCTCGCTCAA TAACGGCGCC GATCCCTTCG TCTATGGCTA CCACGCCGTC AACGGACTGC TGGAAACGCT CGACTACCCC AATACCCTCC GGAGATGGTA CACCCGGGAA GAAAAACGGA ATCTGCTGAC CGGAATCGAC TATCTGCGTC CCGGCAGCGC CAATTATCCG GCCAAAAACG ACTATGCCTA CGACGCGCTG GGAAGGCCCA CGGAAAAGAA GGACTACTTC AATACCCCCG CTCCCGACCT GACGCACAGC TACAGTTACA ACGGCCGCGG CGAACTGGCC GCCGACGCGA TGAGCCGGGG AGGAACGTAT TCCTATGCGT ACGACAACAT CGGCAACCGC GTCACCTCTC GGGAAGGTTC GGGCGCGTCA GCGGAGGCGT ACACGGCCAA TAATCTGAAC CAGTACACGG CCATCACCCG GGAGGAAGGA GCGTCTTTTG CACCTGCCTA TGATGCCGAC GGCAACCAGA CGAAGATTCA AACGTCTACG GGAGAATGGG AAGTCTCTTA TAATGCCCTG AACCAGGCGG CAAGGTTCAT TCAGGGGAAC AGGCGGGTGG AGTGCCGCTA CGACTATCTG AACAGACGGA TTGAGAAAGC CGTCTATGAA GGAGAGATCC TGATGTCGAA GAAACGGTTC ATCTATCACG GCTACCTGCA AATCGCGGAA CTGGATGCCG CCGCGACGGA ATCAGCGATG CCCGTACTGC GAAAAACCTA TCTGTGGGAT CCGCTGGAAC CGGCAGCCAC GCGCATCCTG GCCATGAGCC TCTTTGATGA GACGGGAACC TGGGTGGAAA ACCTGTACTA CACGCACGAC CTGTTGAAAA ACACCACGGC GCTTTTCGGC ATCAGAGCGG GACGCCGCGC CTTGTACGAA TACGGCCCGT ATGGGAATAT TCTCAGGATG GAAGGGAATG CCGCAGAGGA CAATCCGTTC CGGTTTTCCA GCGAATACGC TGATGACGAA CTGGGGCTGG TTTACTACAA TTACCGCTAT TATAATCCCC AAAATGGCAG GTGGATTAGT AGAGATCCTA TTATAGAGAA ACAGAAAGAT AATGTTTATT CATATGCGTA TAACACACCT TCTATTTTGA TTGATGTGCA GGGGCAATTC GCGTTTGCGA TTGCTCTATT TAATCCTATA GGAGCGGCGG TGGTAGCGGC GGCGGCAGTA GGTGTAGCTG TTGCGGTAGT GGTGGTAGTT GCCGAAAAAG TAATAGATGA AATAAGCTCG GATACTACAA AGAAAACAGT TCCAGAAACA GTTCCAATAG CAATTCCTCA AAAACCTAGA TATGGAAATT GTTCAAAACA AAGACATAGT GAATTAAATA AAGAGGTTGG TCGTAAGTGC AAAGGTTCAT CAATGCATTG TAAAAACAAA AATATGTGTA AAAATGAAAT AGAAAGAAAT ATAAAAAGAT TTCAAGACTG TATTGATGCA AGAACAAAAA TAAATAATGA ATGCTTCAAT GGTGGAGATA ATGCACATAA TGATGAAATT GAGCGCGCTT TAGCGGGCAA AAAACGTTGC CAAGATAAAC TCAATCAATT ATTATGA
|
Protein sequence | MKEQSFDNYV NSLTGMSNSN SGAPSAIDEQ APTDAFERSW DWDFEVRELG PDETAECKVT MGADDLANLT VDGEERLDIG PRGQYGGGSY EPQTASFSIE PGMHRAHVDY SNISIPNANN NIAKFTFDLK VEITNRQTGS SSSYVPPETE TEPVDNNDEG DDDPCGGSSS GSSSSPNSSS SNPCPNGDNG GDEDEIDPDN PFSPDDCMDN RGGSPGASAV RSLVSSASAY GKFSSAGKRV TAQTRKTSMV WRTSFGSFRG MEGVPYGMLE IVAYNFSSRL WTPAALQYLH PMASCILPPS GRELGADMAF QIRNGGTRAN YYCYAGAASA GSIGGSQKRT GSVSMAYAAA EGRAVSASAS AAEMRVSNAR GNTVIYGGSS VSALGAASGY RTKLGSSWTA QDFANYLDIV RSADDVIRQV WNLWDGLANI ENVTDTGYVI AFYLPEQVGA KNVSTGLYAV TGTPFKIFTI EGNTETGKLT VTEQAEGRAP YVTRYWQGTG GAWCMSQGEG EDSIFTIRER QEVSSGIWKL FTTVQRGENG TPISRVCETY EQGRSGNLCT SRIEAYGTDY ARETTYAYNA VGKLIRETAP DGSEKTWSYD AFGRETVRME PWAGGGRKGT YTYYRCSDHA DPDIAHQYVV LTMNAARLAD THYTYTEANH VRRVEKRTTA LGAEGEQLEV TETWLPAAPN EYARGRLKMK QSASGVQTVY GYEAASQYGA LYRETRETQI AGQAVPGHST RKVTYVSVQG NNTRIEKYAL LTDGTWTLTD TADYEYDREN RWIKRTRGNG RVTEREMMCC GPLWEKDEDG IMTTYSYNTA RQLVEVSRSE VADGETVVTP ETIVSYKRDA FGRILQTRRD VGPMTTTESK VYDLLGQLVQ ETDVLGRSST RAYSADGLTE TVTTPTGATL VTIRHADGTV LEQSGTGQRH LLCRTEYSAE GVVRSTLLPR AEGEPELVEQ TVTDGRGNMV RVSRANANGG LVHDRRVFDL NNRLLRQQVD GMAPLLYDYD PFGNIVKTTL KLAENPTPAN SLVTEYAYAR RQREDGVYRV TTVTRCNSQG TTYAESTAEL VSFLSSSLAG KNISTDPRGN ETLQWTEYTA PARRTVKTQS PASSVIAETV VIDGYTVSRK DHAGVLTASS RAYTASGSTE TYTDARGNAA VTVFDIAGRE TARTDAAGNT TTIQYDPSTA SPSCVTDALG NTACYAYDPR GRKTAEYGTA LQPSVFAYDD ADRLVSLMTF RVPGETIAAD PRERTDGDMT TWGYDDASGL MTAKTYADGH GESYSYDDWN RLAVKRQART VDGQGTPLAT SYAYDPQTGN LVSVIHNDAT PSLNYVYNHL NLLTQVADDS GTRMLAYNQY NEAESETTAG LAASALNYLR DGLGRPSGYS LHYGEGIVQQ TAWEYDGCGR LSTVSLNNGA DPFVYGYHAV NGLLETLDYP NTLRRWYTRE EKRNLLTGID YLRPGSANYP AKNDYAYDAL GRPTEKKDYF NTPAPDLTHS YSYNGRGELA ADAMSRGGTY SYAYDNIGNR VTSREGSGAS AEAYTANNLN QYTAITREEG ASFAPAYDAD GNQTKIQTST GEWEVSYNAL NQAARFIQGN RRVECRYDYL NRRIEKAVYE GEILMSKKRF IYHGYLQIAE LDAAATESAM PVLRKTYLWD PLEPAATRIL AMSLFDETGT WVENLYYTHD LLKNTTALFG IRAGRRALYE YGPYGNILRM EGNAAEDNPF RFSSEYADDE LGLVYYNYRY YNPQNGRWIS RDPIIEKQKD NVYSYAYNTP SILIDVQGQF AFAIALFNPI GAAVVAAAAV GVAVAVVVVV AEKVIDEISS DTTKKTVPET VPIAIPQKPR YGNCSKQRHS ELNKEVGRKC KGSSMHCKNK NMCKNEIERN IKRFQDCIDA RTKINNECFN GGDNAHNDEI ERALAGKKRC QDKLNQLL
|
| |