Gene Amuc_0036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0036 
Symbol 
ID6275170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp45594 
End bp51467 
Gene Length5874 bp 
Protein Length1957 aa 
Translation table11 
GC content54% 
IMG OID642612077 
ProductYD repeat protein 
Protein accessionYP_001876664 
Protein GI187734552 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.13873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC GGCAGTTTCA GCATCATGTG GATTCTCTGC CTGATAATGC TCTCACAGTG 
TCCGGAGGTG GGCCGGCTCC GTTTGACAAG GAGGCCGGGG CGTTTAGCGG CAGCTGGGAC
TGGGGGATTA TTGTTGATCC GCTCGGTCCG GGGGAAACGG CAGAGTGCCG TTTAATGCTG
GGGGTAGATG ATAACGGTAC TCTGACCGTT GACGGTGAAG GAATTTTCAT TCCAGGGGAG
GGTAAATACC ATGGAGGCAC TTATAGGGAA AAAAGCTTGA GCTTTCCCAT TGAACCGGGA
CCGCACCGTG TACACATGAC TTATGAGAAC GTTGCTGTTC CTCCTGAATG GACTAATCTG
GCAATCCTTA ATTATTCCAT TGAAGTCATG GTCTCGGATG GCACCTCTTC ATCCTCCTAT
CAGCCGGATG AAGGTTATCC GGAACCTGTG AGTACGGAAG ATGAGGGAGA AGACGTGCCG
TGTGAATGCG GGGGGGACGG CAATGGCTCC CAAAGCAGTT CCAGTTCTCC CTGTCCGGAG
GAAGACAACG GAGAAGGGGA TAACGGTGAA GATGAGGCCC CCTGCACCTG TGAAAACAAT
GAGGGCGGCA GCAGCAGCTC TTCGGCGCAC GGCGTGCGCG GTTCCTCCAC CCTTTATGGT
TCGTCGGCTG CCGGCAGGCG CGTGGTCGCC CGGACGCTCA AGACGGAAAT GGTCTGGCGT
ACCAATTTTG GCTCTTTCCG GGGCATGACG GGGTTGCCGC AGGGGTTGCT GGACATAGTA
GGGTACACAT TCTCGTCTGA ACTCTGGTCC CCTCGTGCGC TCCATTACTG GCATCCGATG
ACTCATGAAA TCATTTCGGC GCCGCTATCG GGAATTGGAG CGAATACCGC TTTCCAAATC
AGGAGCGGCG GAACTGACAT TAATTACTAC TGCTACGCGG ATGGTGGCGG CAGCGGGGTT
GACAGTGTTG CCCCCATTGC GGGATCCGCC AAGCGCGGAG GGTCTGCGTG GTTCGGGAGG
AGGTCCGCTT CCTCAGCAAA GGCGGGAGAA TTCTGCCTCA GTATCCGTTC AACCGCGGGC
AATACTGTTA ACTATACCGC AGGTGTGGCC TCTTCCTTGA AATACTCCAC CGGCTACACC
GCTAAAAATG GCGCGTCCTA CACAAGAGAA GAATTTGACG GAAAGCTGGA TATCGTGCGG
GCTTCGGATG GAAGCATCCG GCAGATATGG AACCTGTGGG ATGGGCTTGC CAACATCGAA
AATGTTACGG AATCCGGATA TCGCATCGCC CTCTACCTCC CCGAACAGGT GGATGGGAAA
AACAGCTCCA CCGGCCTTTA TCCCGTTACG GGCGACCCAT TTAAAACATT CACGATCACC
GGGGACGCGG CAGCGGGCAG GCTTGCCGTC ACCGAACAAA CTGCCGGGCG CGCGCCCTTC
ACCACCCGCT ACTGGCAGGG AACTGACGGA GCATGGAACA TGTCTCAGGG GGAAGGAGAA
GATTCCATCT TCACTCTGAA GGAAAAGCAG ATTGTCTCTC CCGGAACGTG GAAACTCATC
ACTACCATCC AGAGAGGGGA AAACGGTATC CCTATCTCCC GTGTCTGTGA GACTTATGCC
GTCACCCGCA ATGGCAATTT GTGCACCAGC CGTATTGAAG GCTACGGAAC GGATTACGCC
CGGGAGACGA CCTACGAGTA CACAGGCATG GGAAAAATCG CGAAGGAAAC TGCTCCTGAC
GGAAGTGTCA AAACATGGGC GTACGACCGT TTTGGGCGTG AAATCGTGTC CAGTGTCCCC
TGGGCGGAAG CTGGAGATAA AGTCACTTAC ACCACTTATC GGGATCAGAC GCAGGCGGAT
CCGGATATTC TCACCCAATG GGTGACGTTG ACGGCTACTG CGGCGGAACT CTGGCGAACG
GATTATACTT ACATTGAGGA AAATCATGTG CGGCGCGTGG AAAAACGCAC CACCGCATTG
GGGGAAGAAA ATGTACGGCT GGAAGTGAAG GAATCCTGGC TCGGCACGGC TCCCAATGTT
TATGCCCGCG GACGCTTGAA AATGAAACAG GACATGGGCG GAATTCAGAC GCATTACGCT
TATGAAGAGA CGGACCAGTA CGGAGCCCTT TACAAGGTGA CTGCGGAAAC GCGGATAGCG
GGTGCGTGCG TGCCCGGCCT GAGTTCACGG AAGGTTACTT ACGTTTCCGG CCAGGGCAAT
AACATGCGTT ATGAGCAATA CGTACAGCTT GCCGATGGTG TTTGGTCAAT GACGGATGCG
TCTTCTTATG AATATGATGT TGAGAACCGT TGGGTAAAGC GCACCCGCGC CAATGGACGC
GTTTACGAGC GCATGATGAC CTGTTGCGGC CCTTTATGGG AAAAGGATGA GAATGGCGTG
ACGACTTCCT ATTCCTACAA TACGGCCCGC CAGCTGACAG AAACCATCCG TTCAGCCATT
GCAGACGGGG AAACCATGAT CACTCCGGAA ACCATCACGA GTTACACTCG TGATGCCCTC
GGTAAAATTA CCTCCCTGCG CAGGGATACA GGCCCAATGA CGACGGTGGA AAGCCGGGAA
TACGATTTGC TGGGGCGTCT GGTAAAAGAA ACCGATATTC TGGGGCGCTC CACCGTTCGC
TCCTATAGCG GGGATGGCTT GGTAGAAACT GTTACCACCC CCGCCGGCGC TACCCTCATC
ACCCGGAAAA GTGCTTCCGG AACAATATTG CGGCGTTACG GCACCGGGCA GCAGGATATC
CTCTATACTG TTGAGGTCAC GGAAGAAGGG ATCCGCACGA CGGAGGCTGT GCCGGGCGGG
GAGGGGGGCG ATCCCCACGT CGTGACGGGA AGCTCCACTG TCAACGGCTT TGGCGACCTT
GTACGGGTCG CTGCCGCCAA TACCTTGAAT GGGGAAAACG TGCGCACGCT TGCATACGAT
AACAAGGGGC GCCTCATCAG GGAACAGCTG GCGGATATGG CGCCCGCCCT CTACGCGTAT
GATGCTTTTG GGAACAGGAC CAGGGCAATC GTTGCCCTGG AGGATGACCC GACCATCCTG
AACTCCCGCA TTACTGACTA TGCTTATGTG ACCGAGAGCC GTGAAGACGG AGTCTATAGT
CTTGTTTCCA TAACAAGCTA TACTTCCTCA GGTACTCCTG TTCTTCAAAA GCGGGCGGCC
CTGCTCTCTG CCCTGAGCCC TGTCCTTGCC GGCAAAACCG TCATGACGGA TTCCCGCGGC
CATGATGCCG TGGAATGGAT GGAATATGGC GGCGGTTCTG TTCGCCTGCG GAAGAAAACC
GTGCCCGGCG TCGAGAGCGT CTTCCTCACG CGCATCGTGG ACGGATTCAC GACGGCGGTT
ACGGGATTTG ACGGTGCAAC TGTCCTTCGG CGGCGCACCT ATACGGAAAC GGGCATCACC
TACGCGGACA CAGATGTGCG CGGGAATGTC TTTACCTCCG TCTGCGACAT CGCCGGCCGC
ATCATCTCCG GCACGGATGC CGCGGGCAAC ACGACCACTT ACGCTTATGG CCAGCCTTTT
GATCTGCCTA CCTGCGTTAC TAATCCTCTT GGCAAAACGG CCTGTTCTTT CTATGACATC
CGGGGCCGCA AGGAAGCCGA ATGGGGGACA GCCGTACAGC CCGCCGTATA TGCTTATGAC
GCCGCCGGCC ATATGGTTAG CCTTTCCACT TTCCGTGTTC CCGGAGATGT CATTACCACT
GATCCCCGCC TCCGGACGGA TGGGGACATC ACCACATGGA CGTATGATAT CGCTACGGGG
CTGGTTATCC GCAAAACCTA TGCCGACGCC ACCCATGTGG ATACCGTTTA CGATATGCTC
AACCGCGTTG CCGCTACGAC GGACGCGCGA GGAACCGTTG CTTCGCGCTC CTACGCCCCC
CTTACTGGGG AGCTCGTTTC CATTACCTTT AATGACGACG GTTTTACCCC CTCCATCAGC
AGCCTCTACA ATCATCTGGG GCAGCTGACG CAGATTGACG ACGCTTCCGG AACGCGCATG
TTCACGTACA ACCAGTATAA CGAACAGGAA ACGGAAACGA CGGCTGGCCT TGCGGCAAGC
GTCTTAACCC TCCGCCGTGA CGGAGTGGGG AGGCCTGCGG GCTACTGTCT GGATTATGCG
GGCTCCCCCG CCTTGCAGAC GGCATGGGCC TATGACGCCT ACGGAAGGCT ATCCTCCGTT
TCGCTGAACG CCGTCGGGAA GCCCTTCACC TATGGCTACA ATGAGGAAAC CGGCCTGCTG
GACACTCTTG ATTACCCCAA CACCCTGAAA CGGTGGCGCA CTTTGGAAGA AAAGCGCGAC
CTCCCGGTGA AGATTGACTA CCTGCGCCCC GGCAGCGCCA ACTACCCGGC TAAAACCGAC
TACTCTTACG ACATACTGGG GCGCCCGGTC ACGAAGAAAG ACTACTTTAA CGCGCCTGCA
CCCGACCTGA CGCACACCTG CGCCTACGAT GACAGGAATG AACTTGTGAG CGATGCGATG
AGCCGGGGCG GCACATACAG CTACTCCTAC GATAACATAG GCAACCGCAA AACGTCCCTG
GAGGGAACGG ATTCCCTTCC CACTACATAC GTTGCCAACC GGGTCAATCA ATATACGGAT
ATCACCGAAG GTGAGGAGGC TCCTTTTGTG CCGAATTATG ACGCCGACGG CAACCAGACG
AAGCTCCGGA CCGCCACCGG GGAATGGGAA GCCTCCTACA ACGCGTTGAA TCAGGCGGTC
AGCTTCATAC AGGGGGACAG GCGTATTGAA TGCGTATACG ATTATCTGAA CAGGCGGGTT
GAAAAATCCG TCTATGAGGG AGAATCGCTT ATGTCAAGGA AACGGTTCAT CTATCACGGG
TACCTGCAAA TCGCGGAACT GGATGCCACG GAGGTTTTGG AGTCTGTGGC GCCCGTCCTG
CGTAAAACGT ACCTGTGGGA TCCGCAGGAA CCGGTGGCCA CGCGCATTCT GGCCATGGGC
GTCTTTGATG AAACGGGAGC CTACGTGGAA GATCTTTACT ACACGCATGA CGCATTGAAG
AACACGACGG CGCTCTTCGG CATCAAGGCG GGGCGCCGAG CCTTGTACGA ATACGGCCCG
TACGGCTCTG CCGTGAAGAT GGAAGGAAAT GCGGCGGAGT TGAATCCGTT CCGGTTCTCC
AGCGAGTATG CTGATGACGA GCTGGGGTTG GTTTACTACA ATTACCGCTA TTATAATCCC
CAAAATGGTA GGTGGATCAG CAGAGATCCT ATGACAGAAA AAGAGAGCTA TCTTTTGTAT
GGATATGTTA ATAATATGCC TACCTTATAT TCTGACGAAT TAGGATTAGC GCGTACAATA
ACTACAAATA AGGATGATTG TTCGATCAAT GTGAGCCTGA ATATCGTGAT ATATCCGAAA
GGTGGAGATT CAATTAATAA TATAGAAATG CGTACTACAG CACAGCGAAT AAAACAATCG
ATTGAAAGCA ATTGGAATGG ATATGAGAAA GGATGCTGTG TAGTTAATGT GACAGCGGAT
GTGTCTGTTC AGAGCAGGAA AAGCAGATGG CTTTATAGAT TTCTCAATAG TGATGAGAAT
AATATAGAAA TAACATCAGA TTCATCTCAT CGTTCATATG TTAACGGAGT TGGAGGCAGA
TATGGAGTAT GGGGTTCTCA GGCCGTCCCA TGGGTTTATG CGCATGAAGC CGGCCATCTT
ATGGGGCTTT CTGACGATTA TCAAGATGTC GCCAATTCAG ATGGCTCCTT AACTTCCGTT
CCTAACGCGG GACATGAAGG ACATATAATG GGGGAATATG GAGGAAAAGC CAATCAACAT
GAAATAGATG CTATTTTAAA AAATATCGAA TGTCCATGCG ATGAAAATCA ATAA
 
Protein sequence
MKKRQFQHHV DSLPDNALTV SGGGPAPFDK EAGAFSGSWD WGIIVDPLGP GETAECRLML 
GVDDNGTLTV DGEGIFIPGE GKYHGGTYRE KSLSFPIEPG PHRVHMTYEN VAVPPEWTNL
AILNYSIEVM VSDGTSSSSY QPDEGYPEPV STEDEGEDVP CECGGDGNGS QSSSSSPCPE
EDNGEGDNGE DEAPCTCENN EGGSSSSSAH GVRGSSTLYG SSAAGRRVVA RTLKTEMVWR
TNFGSFRGMT GLPQGLLDIV GYTFSSELWS PRALHYWHPM THEIISAPLS GIGANTAFQI
RSGGTDINYY CYADGGGSGV DSVAPIAGSA KRGGSAWFGR RSASSAKAGE FCLSIRSTAG
NTVNYTAGVA SSLKYSTGYT AKNGASYTRE EFDGKLDIVR ASDGSIRQIW NLWDGLANIE
NVTESGYRIA LYLPEQVDGK NSSTGLYPVT GDPFKTFTIT GDAAAGRLAV TEQTAGRAPF
TTRYWQGTDG AWNMSQGEGE DSIFTLKEKQ IVSPGTWKLI TTIQRGENGI PISRVCETYA
VTRNGNLCTS RIEGYGTDYA RETTYEYTGM GKIAKETAPD GSVKTWAYDR FGREIVSSVP
WAEAGDKVTY TTYRDQTQAD PDILTQWVTL TATAAELWRT DYTYIEENHV RRVEKRTTAL
GEENVRLEVK ESWLGTAPNV YARGRLKMKQ DMGGIQTHYA YEETDQYGAL YKVTAETRIA
GACVPGLSSR KVTYVSGQGN NMRYEQYVQL ADGVWSMTDA SSYEYDVENR WVKRTRANGR
VYERMMTCCG PLWEKDENGV TTSYSYNTAR QLTETIRSAI ADGETMITPE TITSYTRDAL
GKITSLRRDT GPMTTVESRE YDLLGRLVKE TDILGRSTVR SYSGDGLVET VTTPAGATLI
TRKSASGTIL RRYGTGQQDI LYTVEVTEEG IRTTEAVPGG EGGDPHVVTG SSTVNGFGDL
VRVAAANTLN GENVRTLAYD NKGRLIREQL ADMAPALYAY DAFGNRTRAI VALEDDPTIL
NSRITDYAYV TESREDGVYS LVSITSYTSS GTPVLQKRAA LLSALSPVLA GKTVMTDSRG
HDAVEWMEYG GGSVRLRKKT VPGVESVFLT RIVDGFTTAV TGFDGATVLR RRTYTETGIT
YADTDVRGNV FTSVCDIAGR IISGTDAAGN TTTYAYGQPF DLPTCVTNPL GKTACSFYDI
RGRKEAEWGT AVQPAVYAYD AAGHMVSLST FRVPGDVITT DPRLRTDGDI TTWTYDIATG
LVIRKTYADA THVDTVYDML NRVAATTDAR GTVASRSYAP LTGELVSITF NDDGFTPSIS
SLYNHLGQLT QIDDASGTRM FTYNQYNEQE TETTAGLAAS VLTLRRDGVG RPAGYCLDYA
GSPALQTAWA YDAYGRLSSV SLNAVGKPFT YGYNEETGLL DTLDYPNTLK RWRTLEEKRD
LPVKIDYLRP GSANYPAKTD YSYDILGRPV TKKDYFNAPA PDLTHTCAYD DRNELVSDAM
SRGGTYSYSY DNIGNRKTSL EGTDSLPTTY VANRVNQYTD ITEGEEAPFV PNYDADGNQT
KLRTATGEWE ASYNALNQAV SFIQGDRRIE CVYDYLNRRV EKSVYEGESL MSRKRFIYHG
YLQIAELDAT EVLESVAPVL RKTYLWDPQE PVATRILAMG VFDETGAYVE DLYYTHDALK
NTTALFGIKA GRRALYEYGP YGSAVKMEGN AAELNPFRFS SEYADDELGL VYYNYRYYNP
QNGRWISRDP MTEKESYLLY GYVNNMPTLY SDELGLARTI TTNKDDCSIN VSLNIVIYPK
GGDSINNIEM RTTAQRIKQS IESNWNGYEK GCCVVNVTAD VSVQSRKSRW LYRFLNSDEN
NIEITSDSSH RSYVNGVGGR YGVWGSQAVP WVYAHEAGHL MGLSDDYQDV ANSDGSLTSV
PNAGHEGHIM GEYGGKANQH EIDAILKNIE CPCDENQ