Gene Amuc_0735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0735 
Symbol 
ID6273783 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp865444 
End bp871260 
Gene Length5817 bp 
Protein Length1938 aa 
Translation table11 
GC content57% 
IMG OID642612786 
ProductYD repeat protein 
Protein accessionYP_001877352 
Protein GI187735240 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.206584 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0550996 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGAAC AATCCTTTGA TAACTATGTC AATTCCCTGA CGGGCATGAG CAATTCCAAT 
AGCGGAGCTC CCTCCGCGAT CGACGAACAA GCGCCAACGG ACGCTTTTGA AAGAAGCTGG
GACTGGGATT TTGAAGTCAG GGAGCTGGGG CCTGATGAAA CGGCCGAATG CAAGGTGACC
ATGGGCGCCG ACGATCTGGC TAACTTGACC GTGGATGGAG AAGAACGGCT GGACATCGGC
CCGCGCGGAC AGTACGGAGG CGGCAGCTAC GAGCCGCAGA CGGCTTCTTT CAGCATTGAG
CCCGGAATGC ATCGGGCACA TGTGGACTAT AGCAATATTT CCATTCCCAA TGCCAATAAC
AACATTGCCA AATTCACCTT TGACCTGAAG GTGGAAATTA CCAACCGGCA AACGGGTTCT
TCTTCTTCCT ATGTGCCCCC GGAGACGGAA ACGGAGCCAG TGGACAACAA CGACGAGGGC
GATGATGATC CATGCGGAGG CTCCAGCAGT GGAAGCAGCA GTTCTCCCAA CAGCAGTTCC
AGCAATCCGT GCCCGAATGG CGACAACGGC GGAGATGAGG ATGAGATTGA TCCGGACAAT
CCATTTTCCC CGGATGACTG CATGGACAAC CGGGGCGGTT CCCCCGGCGC GTCTGCCGTG
CGCAGCCTGG TTTCCTCCGC CTCCGCCTAT GGAAAGTTCT CTTCCGCAGG CAAACGGGTG
ACTGCCCAGA CGCGGAAAAC CAGCATGGTA TGGCGCACCA GCTTCGGTTC CTTCCGCGGT
ATGGAAGGCG TGCCGTACGG GATGCTGGAG ATCGTGGCTT ATAACTTCTC TTCCAGGTTG
TGGACGCCCG CAGCCCTGCA ATACCTGCAT CCCATGGCCA GCTGTATTTT GCCCCCTTCC
GGTCGGGAGC TGGGGGCTGA CATGGCATTC CAGATCCGGA ACGGAGGCAC CCGCGCCAAC
TATTACTGCT ATGCCGGCGC GGCCAGTGCA GGCTCCATCG GCGGCTCGCA GAAAAGGACG
GGTTCCGTTT CCATGGCGTA TGCCGCGGCA GAGGGGCGTG CCGTCTCGGC TTCCGCCAGC
GCGGCGGAAA TGAGGGTCAG CAACGCCAGG GGCAATACAG TCATCTACGG CGGTTCTTCC
GTTTCCGCTT TGGGTGCGGC CTCCGGCTAC CGGACCAAGC TGGGTTCTTC TTGGACAGCT
CAGGATTTTG CCAATTACCT GGACATCGTC CGAAGCGCGG ATGATGTCAT CCGCCAGGTC
TGGAACCTGT GGGACGGTCT GGCCAATATT GAAAACGTGA CGGATACGGG CTATGTGATC
GCCTTTTATC TGCCCGAGCA GGTGGGGGCC AAAAATGTCT CCACCGGTCT TTACGCCGTT
ACGGGGACGC CTTTTAAAAT CTTCACCATT GAGGGCAATA CGGAAACCGG CAAGCTCACC
GTCACGGAGC AGGCGGAAGG GCGCGCGCCT TACGTCACCC GCTACTGGCA GGGGACGGGA
GGGGCCTGGT GCATGTCCCA GGGGGAAGGG GAAGACTCTA TTTTCACGAT CCGTGAAAGG
CAGGAGGTTT CTTCGGGGAT TTGGAAACTC TTCACTACCG TGCAGCGCGG GGAAAACGGC
ACTCCTATTT CCCGGGTGTG CGAAACCTAT GAACAGGGCC GCAGCGGCAA CCTGTGCACC
AGCCGTATTG AGGCTTATGG AACCGACTAT GCCCGTGAAA CGACTTACGC TTATAACGCC
GTGGGCAAAC TGATCCGGGA GACGGCTCCC GACGGAAGCG AGAAGACCTG GTCCTACGAC
GCCTTCGGGC GTGAAACCGT CCGGATGGAA CCCTGGGCGG GCGGGGGAAG GAAGGGCACC
TACACCTATT ACCGCTGCTC CGACCATGCC GATCCGGATA TTGCGCACCA GTACGTGGTG
CTCACTATGA ACGCCGCACG GCTGGCGGAT ACGCATTACA CCTATACGGA AGCCAACCAT
GTGCGCCGTG TGGAAAAACG CACCACGGCG TTGGGTGCGG AGGGAGAACA GCTGGAAGTG
ACGGAAACGT GGCTGCCGGC GGCGCCCAAC GAATACGCCC GGGGGAGGCT GAAGATGAAG
CAGTCCGCCA GCGGCGTGCA GACGGTCTAT GGCTATGAAG CGGCCAGCCA GTACGGCGCC
CTCTACAGGG AGACCAGGGA AACGCAGATA GCGGGGCAGG CCGTGCCGGG GCACAGCACG
AGGAAAGTCA CGTACGTTTC CGTTCAGGGA AATAACACGC GCATTGAGAA ATACGCCTTG
CTGACGGATG GAACCTGGAC GCTGACGGAT ACGGCGGATT ACGAATACGA CAGAGAAAAC
CGGTGGATTA AGCGTACGCG CGGCAACGGC AGAGTGACGG AACGGGAGAT GATGTGCTGC
GGCCCCTTGT GGGAAAAGGA TGAAGACGGC ATCATGACCA CCTACTCCTA TAATACGGCG
CGCCAGCTGG TGGAAGTCAG CCGGTCGGAA GTCGCGGACG GAGAAACGGT CGTCACTCCT
GAAACCATCG TCAGCTACAA GCGGGACGCC TTCGGCAGAA TCCTGCAAAC GCGCCGGGAC
GTCGGCCCCA TGACGACGAC GGAAAGCAAG GTTTACGATC TGCTGGGGCA GCTGGTGCAG
GAAACGGATG TCTTAGGAAG GAGCAGTACG CGTGCCTACA GTGCGGACGG CCTGACGGAA
ACCGTCACCA CGCCGACGGG AGCGACGCTC GTCACTATCC GCCATGCGGA CGGAACCGTG
CTGGAACAAA GCGGCACGGG GCAGAGGCAT CTCCTCTGCC GTACGGAATA CTCCGCGGAA
GGCGTGGTTC GTTCCACGCT TCTTCCCCGG GCGGAGGGAG AACCGGAGCT GGTGGAACAA
ACCGTCACGG ACGGAAGGGG CAACATGGTG CGTGTCTCCC GGGCGAACGC CAACGGAGGC
CTTGTTCATG ACCGGCGCGT TTTTGATCTG AACAATAGGC TTTTGCGGCA GCAGGTGGAT
GGAATGGCTC CTTTGCTTTA TGACTATGAC CCGTTTGGCA ATATCGTCAA AACCACGCTC
AAGCTGGCCG AGAACCCCAC TCCCGCCAAC TCCCTCGTCA CGGAGTACGC CTATGCCCGC
CGGCAACGGG AAGACGGCGT GTACCGGGTA ACCACAGTCA CGCGCTGCAA CAGCCAGGGA
ACAACATATG CGGAAAGCAC GGCCGAGCTG GTTTCTTTCC TGTCCTCCTC GCTGGCCGGA
AAAAACATCT CCACCGATCC GCGGGGCAAT GAAACCCTGC AATGGACGGA ATACACGGCT
CCGGCCAGAC GGACGGTAAA AACGCAGTCC CCGGCTTCTT CCGTCATTGC GGAAACTGTC
GTCATAGACG GGTACACCGT ATCCCGGAAA GACCATGCGG GTGTTCTGAC CGCCTCTTCC
CGCGCTTATA CGGCCAGCGG CAGCACGGAG ACTTATACGG ACGCCCGTGG CAATGCCGCC
GTTACCGTCT TTGACATCGC CGGCCGTGAA ACAGCCAGGA CGGATGCAGC CGGCAATACG
ACCACCATCC AGTATGACCC GTCCACAGCT TCTCCCTCCT GCGTCACGGA TGCTTTGGGC
AACACGGCCT GCTACGCTTA TGACCCGCGG GGACGCAAAA CCGCCGAATA CGGTACGGCC
CTCCAGCCCT CCGTCTTTGC GTACGATGAT GCGGACAGGC TGGTATCCCT CATGACATTC
CGCGTTCCGG GGGAAACCAT CGCTGCCGAT CCGCGGGAAC GGACGGACGG GGACATGACG
ACGTGGGGCT ACGACGACGC CTCCGGCCTG ATGACGGCTA AAACCTATGC CGACGGCCAT
GGGGAAAGCT ACTCTTACGA TGACTGGAAC AGGCTGGCGG TTAAACGACA GGCCCGGACG
GTGGACGGGC AGGGAACGCC TCTGGCAACT TCTTATGCCT ATGATCCGCA GACGGGCAAC
CTGGTCTCCG TCATTCACAA TGATGCGACT CCTTCGCTCA ATTATGTCTA CAACCACCTG
AACCTGCTCA CTCAGGTTGC GGATGATTCC GGAACAAGGA TGTTGGCTTA CAACCAATAT
AATGAAGCGG AATCGGAAAC TACGGCAGGA CTGGCGGCAA GCGCGCTCAA CTATTTGCGC
GACGGTTTGG GGCGGCCTTC GGGCTACAGT CTGCATTATG GAGAGGGTAT TGTCCAGCAG
ACGGCCTGGG AATATGACGG CTGCGGACGT CTTTCTACGG TCTCGCTCAA TAACGGCGCC
GATCCCTTCG TCTATGGCTA CCACGCCGTC AACGGACTGC TGGAAACGCT CGACTACCCC
AATACCCTCC GGAGATGGTA CACCCGGGAA GAAAAACGGA ATCTGCTGAC CGGAATCGAC
TATCTGCGTC CCGGCAGCGC CAATTATCCG GCCAAAAACG ACTATGCCTA CGACGCGCTG
GGAAGGCCCA CGGAAAAGAA GGACTACTTC AATACCCCCG CTCCCGACCT GACGCACAGC
TACAGTTACA ACGGCCGCGG CGAACTGGCC GCCGACGCGA TGAGCCGGGG AGGAACGTAT
TCCTATGCGT ACGACAACAT CGGCAACCGC GTCACCTCTC GGGAAGGTTC GGGCGCGTCA
GCGGAGGCGT ACACGGCCAA TAATCTGAAC CAGTACACGG CCATCACCCG GGAGGAAGGA
GCGTCTTTTG CACCTGCCTA TGATGCCGAC GGCAACCAGA CGAAGATTCA AACGTCTACG
GGAGAATGGG AAGTCTCTTA TAATGCCCTG AACCAGGCGG CAAGGTTCAT TCAGGGGAAC
AGGCGGGTGG AGTGCCGCTA CGACTATCTG AACAGACGGA TTGAGAAAGC CGTCTATGAA
GGAGAGATCC TGATGTCGAA GAAACGGTTC ATCTATCACG GCTACCTGCA AATCGCGGAA
CTGGATGCCG CCGCGACGGA ATCAGCGATG CCCGTACTGC GAAAAACCTA TCTGTGGGAT
CCGCTGGAAC CGGCAGCCAC GCGCATCCTG GCCATGAGCC TCTTTGATGA GACGGGAACC
TGGGTGGAAA ACCTGTACTA CACGCACGAC CTGTTGAAAA ACACCACGGC GCTTTTCGGC
ATCAGAGCGG GACGCCGCGC CTTGTACGAA TACGGCCCGT ATGGGAATAT TCTCAGGATG
GAAGGGAATG CCGCAGAGGA CAATCCGTTC CGGTTTTCCA GCGAATACGC TGATGACGAA
CTGGGGCTGG TTTACTACAA TTACCGCTAT TATAATCCCC AAAATGGCAG GTGGATTAGT
AGAGATCCTA TTATAGAGAA ACAGAAAGAT AATGTTTATT CATATGCGTA TAACACACCT
TCTATTTTGA TTGATGTGCA GGGGCAATTC GCGTTTGCGA TTGCTCTATT TAATCCTATA
GGAGCGGCGG TGGTAGCGGC GGCGGCAGTA GGTGTAGCTG TTGCGGTAGT GGTGGTAGTT
GCCGAAAAAG TAATAGATGA AATAAGCTCG GATACTACAA AGAAAACAGT TCCAGAAACA
GTTCCAATAG CAATTCCTCA AAAACCTAGA TATGGAAATT GTTCAAAACA AAGACATAGT
GAATTAAATA AAGAGGTTGG TCGTAAGTGC AAAGGTTCAT CAATGCATTG TAAAAACAAA
AATATGTGTA AAAATGAAAT AGAAAGAAAT ATAAAAAGAT TTCAAGACTG TATTGATGCA
AGAACAAAAA TAAATAATGA ATGCTTCAAT GGTGGAGATA ATGCACATAA TGATGAAATT
GAGCGCGCTT TAGCGGGCAA AAAACGTTGC CAAGATAAAC TCAATCAATT ATTATGA
 
Protein sequence
MKEQSFDNYV NSLTGMSNSN SGAPSAIDEQ APTDAFERSW DWDFEVRELG PDETAECKVT 
MGADDLANLT VDGEERLDIG PRGQYGGGSY EPQTASFSIE PGMHRAHVDY SNISIPNANN
NIAKFTFDLK VEITNRQTGS SSSYVPPETE TEPVDNNDEG DDDPCGGSSS GSSSSPNSSS
SNPCPNGDNG GDEDEIDPDN PFSPDDCMDN RGGSPGASAV RSLVSSASAY GKFSSAGKRV
TAQTRKTSMV WRTSFGSFRG MEGVPYGMLE IVAYNFSSRL WTPAALQYLH PMASCILPPS
GRELGADMAF QIRNGGTRAN YYCYAGAASA GSIGGSQKRT GSVSMAYAAA EGRAVSASAS
AAEMRVSNAR GNTVIYGGSS VSALGAASGY RTKLGSSWTA QDFANYLDIV RSADDVIRQV
WNLWDGLANI ENVTDTGYVI AFYLPEQVGA KNVSTGLYAV TGTPFKIFTI EGNTETGKLT
VTEQAEGRAP YVTRYWQGTG GAWCMSQGEG EDSIFTIRER QEVSSGIWKL FTTVQRGENG
TPISRVCETY EQGRSGNLCT SRIEAYGTDY ARETTYAYNA VGKLIRETAP DGSEKTWSYD
AFGRETVRME PWAGGGRKGT YTYYRCSDHA DPDIAHQYVV LTMNAARLAD THYTYTEANH
VRRVEKRTTA LGAEGEQLEV TETWLPAAPN EYARGRLKMK QSASGVQTVY GYEAASQYGA
LYRETRETQI AGQAVPGHST RKVTYVSVQG NNTRIEKYAL LTDGTWTLTD TADYEYDREN
RWIKRTRGNG RVTEREMMCC GPLWEKDEDG IMTTYSYNTA RQLVEVSRSE VADGETVVTP
ETIVSYKRDA FGRILQTRRD VGPMTTTESK VYDLLGQLVQ ETDVLGRSST RAYSADGLTE
TVTTPTGATL VTIRHADGTV LEQSGTGQRH LLCRTEYSAE GVVRSTLLPR AEGEPELVEQ
TVTDGRGNMV RVSRANANGG LVHDRRVFDL NNRLLRQQVD GMAPLLYDYD PFGNIVKTTL
KLAENPTPAN SLVTEYAYAR RQREDGVYRV TTVTRCNSQG TTYAESTAEL VSFLSSSLAG
KNISTDPRGN ETLQWTEYTA PARRTVKTQS PASSVIAETV VIDGYTVSRK DHAGVLTASS
RAYTASGSTE TYTDARGNAA VTVFDIAGRE TARTDAAGNT TTIQYDPSTA SPSCVTDALG
NTACYAYDPR GRKTAEYGTA LQPSVFAYDD ADRLVSLMTF RVPGETIAAD PRERTDGDMT
TWGYDDASGL MTAKTYADGH GESYSYDDWN RLAVKRQART VDGQGTPLAT SYAYDPQTGN
LVSVIHNDAT PSLNYVYNHL NLLTQVADDS GTRMLAYNQY NEAESETTAG LAASALNYLR
DGLGRPSGYS LHYGEGIVQQ TAWEYDGCGR LSTVSLNNGA DPFVYGYHAV NGLLETLDYP
NTLRRWYTRE EKRNLLTGID YLRPGSANYP AKNDYAYDAL GRPTEKKDYF NTPAPDLTHS
YSYNGRGELA ADAMSRGGTY SYAYDNIGNR VTSREGSGAS AEAYTANNLN QYTAITREEG
ASFAPAYDAD GNQTKIQTST GEWEVSYNAL NQAARFIQGN RRVECRYDYL NRRIEKAVYE
GEILMSKKRF IYHGYLQIAE LDAAATESAM PVLRKTYLWD PLEPAATRIL AMSLFDETGT
WVENLYYTHD LLKNTTALFG IRAGRRALYE YGPYGNILRM EGNAAEDNPF RFSSEYADDE
LGLVYYNYRY YNPQNGRWIS RDPIIEKQKD NVYSYAYNTP SILIDVQGQF AFAIALFNPI
GAAVVAAAAV GVAVAVVVVV AEKVIDEISS DTTKKTVPET VPIAIPQKPR YGNCSKQRHS
ELNKEVGRKC KGSSMHCKNK NMCKNEIERN IKRFQDCIDA RTKINNECFN GGDNAHNDEI
ERALAGKKRC QDKLNQLL