Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1722 |
Symbol | |
ID | 6273719 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2094322 |
End bp | 2099850 |
Gene Length | 5529 bp |
Protein Length | 1842 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642613785 |
Product | outer membrane autotransporter barrel domain protein |
Protein accession | YP_001878321 |
Protein GI | 187736209 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.0890877 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTAC GTCTACCGCT CAAGTTGTTG TCCGCCTTAT TGTCGTTTTA TGCCATATCC GGTTTTCATG TGTTCGCTGC TACGGAGTGG GACGGCAGCA GATATTCCGG TAATATCTAT ACGTGGGTAG GGAGTTCGTC CAATGATTTT CATAACGGCG GCATTGTCCT GACCAATCCG GATGGAACAT ACGGTCCTGT CAATACGAAT TGGGGTTCCG TTTCAGAGGA GGGGAGTATT TGGAACCTTT TTGCCAATGA AACGGGACTG TCGGAATATA ACACATTGCG CTTTGTCGCA TCAGGCACGG AAGTGGGCGA TGTCACCGTA TCGGCGACCA ATAAAAATCC ATCCTGCTCC TTTACGCCAT TTACCATGGG GGGATTGATT GTGGAGGAAG GGGCATCAGG GTTCTCCCTT GTCCCGAACA ATAATGCCCA GAGGACTCTT ATACTGGGGA GAGCGGGTAC GGAAGAGCCG GTATTATTTA CGGTTCGTGA AGATTTTTCC CTGGGTTCTT CCTCCAATGC ATGGGGCCCG GTTAATGTGA ATGCAGACTG GAATGTCCTT GTGGCAAATG GCAAGACGCT GAATGTTTAC GGGGCACTGA ATTCTTCCGG CCGGACCATT ACGGTGGGCG GGGAAGGATT CAGCGGTACG TTGGTTTTAA ACAATGCGGC CAATGCATCC ATGACTGCTG AATGGGTCAT CACCTCCGGC TCTACTGTAA GGTGCATGAA TAATTCCGCT TTGGGAAGCG GCCTTGTTAC GCTGGATGGG GGGACGCTTG ATTTTAACTC GCAAACGATC GGATCAACCA TTACGTTGAA CATGAGCGGG ACCGGGACAT TGCAGAATGT GACAGTTGAT GGAGCAATTT TATCTTATAC GTCTTCATCG GGAAATGATG GGATTACTGC TTCAAATACC ACGTGGACGT CCGGGAAGAT AGATATTGGG CGTTTGTTGC AGGGATTTAG TGCCGAAGGT TCCAATACCG TTGATTTGGG AGCGGAGCTT GGCGAGGGCT TTAGCGTACT GGGTTTAGAT ACAGGGCAAT ACAGAATAGA AGGCAGCGTG CTGACGATAG AAGACGGGGC GATATCCAGC GTGACGTGGG TATCTGCGGG AGAAGGAGGG GAATTGGAAG AAACGGTCAA GAACGCCTTT ACGCTGGCTC TTGGGGAGGG GAGTGCGGCC AACGTCAGCC TGGGATATCT GAACGGGACC CTGACGACCA GCGGGGACAA AGTCTATCAA ATCACCAACA CCGGCGGAAC CAAAATCAAC CTGTCCGGAG TATACAACAG CGGGGCGACC CTGCCGTCCG GCAACCTCAA CTACCAGGGG GACATCTGGA TGGACATCAA CGGAGGAGCA TTCGGCATCA TCGCCGGAGG TGTCACCAAT GAATGGGGCA CCAACCTCCA GACCAGCACG CTGACTGGGG ACACCCATGT GCAGCTCTCT GGAAACGCCA CGGCGGAACA CGTCATCGGC GGGAACAACA AGGGAGCAAG CACTACCCTG ACGGGAAACA CCAACGTTAC CGTCAAAGAT AACGCCATCG TGGCGGGAGC CATCATCGGA GGAAGCACCT CCTCCCACAA TGCCGTCACC ACCATCACAG GAAACACGAG CGTCCTGGTC ACCAACATCC AGCACAGCAA CAGCGCGACC GTCAACCTGG GAGACTTCGG CAACGTCACA GCCCAAAACT TCATCACGGG CGGAAGCGCG TGGACGGCCA ACCAGACTTC AGGAACGACC ATCCAGGGCA ACACCTCGGT AACGATTAAT GTAGGCGATG CGGAACTCTC CGGCACGGAA GGGCACAACA ACTTCGTCAA AAACATCTAT GGAGGCAGTT ACGCAAACAC GAAATCCGAG GGCAACGGAG CCGTACAAAA AGTGGAAGGA AACTCAAGCG TCTCCATCAG CGGAAAAGAA GGAATCACCT TCACGGGAGA CATCATGGGC GGCTCCTTCT GGAACTGGGG CAACGGCACG ACGCTGACCA CCAACGGAAA CACCAGCGTC TCCATTGACG GAGGTTCCAC CTTCACGGGC AAAATCGTGG GCGGCTCCTG GAGAGGAAGC ACGTGGACGG CGGAAGATCC CACAGCCCTG CCCGTCAGCA TTGGCGGCAA CATCACCGTC ACCCTTGGAC AGGGCACCTA CCTGGGAGAC ATCTACGGGG CGGGCAACTG TGGCACGGTA GGAGGCGAAG TCCACGTCTC GCTGACGGGA GGAAGCATCT TTGGGGAGGA AGGAGAAGAA AGCGGCATCA CCATCGGCGG AAGCGCCGGA GCGGCGGTAG AAGGAAACAG GACGCTGGAA CTCAAGGGAA CCTTTGGAAC CGGGGACTTC CAAAACGTCA CCTTCACCCG GTTTGATGAA ATCAATATCG CGCAGGAAGA AACCTCGGCC ACCATCTACG CGCTGACCGA CAGCCCGGCC CTGACCAAAA CGGGAGCGGG AACCCTGACA CTCGGGGCGG ACGCGGCAGG AGCGGAAACC ATCCTTGACG GAACTACGGA AGGAATCACC ATCTCGGAAG GCAGCCTGAA CCTCTCCGGC GCCGGCGGCA GCCATATGAA AGGAACGTGG AACATCGCCT CCGGCTCCCG TCTCACCGGA GTCAGCGGAA CCGTAACCGT AGGAGAAGGA GGCCTGGACG GACTGACCAT CGCCCTGGGA ACGGAAAACA TCGGCCAGGA CACGCAGGCG TCTGGCGCCG TCATCATCAA CGGCAGCGGA ACGGGAAGCG ACCCGAACCT GTCCATAGAG GGGGAAGGGC TGACGCTGGA CCTGAGCAAC GACGCCGTCG TCAACCTTCT GCTGGACCAC AAAACCGACG ACTCCTCCAG CTACCTGACC CTGACCAGCG GAACGCTGAC CGTCGGCGAC CTGGGCGACA TCGCCTTCAC TACCGACCTT CTGTCCAACT ACGGCATCCG GGTCACCGGA ACGGACGGAG GCAGCCTGGT GCTCAGCGGA GCGGCCACCG GCCTCTACCG CGTGCAGGAG GAAGGAGGAA ACGCCCATGA AGTCAACTCC TATCAAACGC TCTCCGGCTA CGCCGGCGTC GTCATCGGCG GCGGCCAGAC CCTGACGGTC AACCTGGCCG GAGCCCCGGG CGAATCCGAC GGACAGGGAG CTAAAATCAA CAACCTGATG GGAGCCACGG GCAGCAGCCT GGTCGTCAAC AACACCGGAG ACGGCACGGC CGTCGTCATC CTCAACAACA AACAAATGAC GACGGGAGAA GACGACATCG ACCCGGCCGG CCAGGACACC GTCATGGGCG GCAGCATCAC GGGAGGCAAC AACGTCGCCT TCATCAAGGA AGGAACCGGG ACGCTGACGG TCGGAGGAAC GATGGACGTG GAAACCCTCG CCCTGCGTGA AGGAAACATC GTCCTCAACG GCGCGTCAAA CACCCTGGAC ACCCTCACTC TGGAAGGAGG CGGCCTGACC ATTAACGGCA ACGCCGAAGT CGGAACCATC ACCGGCACGG AAGCCGGAGG CTCCCTGACC ATCCAGGGAA CTCTCAACCT GACCGGAACA GGCGAAATAA ACGACGGCTC CATCACCGGA ACCGGCACCC TCCGCATCCA GGAAGGAGCG GAACTGGCCC TGGGCGGAGA AGCCCGGCTG GACGGAACCT CAGTCACCGC CGACGGCACG CTGGCCCTCT CCGGAACGGA ATCCGGCGCC ATCATCAGCC TCTCCGGCAG CGGCACCCTG TCCATGAACG GAGGCAGCCT CTCCATCTCC TCGGCAACAA CCTCCTCCGG AACCTTCTCC GGCACGCTGG CAGGCAGCGG CACCCTGGAC ATCTCCGGGC AAGCCACCCA ATACCTGCAA ACCGGCAATA AGGACTACGA CCTCGCCGTC CGTGACGGAG GCGTCCTGGT CCTGAAAGGC ACCGCGGACG CCCCCACGCT GAACTACAAC AGCATCACCG CCGGAAACAA CGGCACCCTG CGTATTGAAG CCACCGGAGA CGCCCAGGGC AGCGCCAACA CCACGCTCAA CGTGGAAAGC ATCACCTTCC AGAACGGCTC CACGACGGAA CTGATCTACA ACTTCAACCA GGACGCCCCC TTTGGCGCCC CCATGCTGAC GGCGGATACC ATCACCGTGC AGGACGGAGC CGGCTTCCTC CTCTCCAACA TGGAAGGAAA CGCCGCCATG AACGCGGGGA GCGACCTCCA TGACGTCATC CTGATGAGCG CTACCGGCAG CATCAGCGGC CTGGAAGACG GGCAAAGCCT CGCCGCCCGG ATATCCGGCC TCTTCGCCGT CTACTACCAG GACGCCACGC TGAACCGCGA CGGAAACGAC ATCCTGCTCA ATGCCACACT CCGGCAGGAA AACCTCTTCG CCTCCGCGGC CGACACGTGG AACTCCGCCG CCGGAGCCAG CCTGCTCTGG GAAGCCCGCA AAAACCTGGA CCCCGACTCC CAGCTGGCCC AATTCATGAA CGGCGTCAGC ACCATGATCA ATGACGGCAA CCTCTCCGGA GCCACCCGCG CTATGGCGGC GGCAGCTGGG AGCACGGTCA ACGCGCTGGG GACGGCGCAG AGGGACGCCC TGCGCGACCA GATGGGCTGG ATCAGGAACC GGACCACCCT CATGGGCGTC AACCCGGCCT ACGTCAACGA CGACCTCCCC CGCTTCCACA TGTGGATGGA AGGCACGGGC TCCTACGCCA AACTGGACAC CCGCGGGGAT GAAAGCGGCT ACCAGCTCAC CACCTGGGGA GGTACGGTAG GCGTGGACGC GGACCTCAGC GACCGCCTCA CAGTGGGAGC GGCCTTCACG GCCAGCTACG GCGACCTGAC GGCCGGCGCG GCGGACAGCG CCGACGGGCA CCTGGACAGC TACTACGCCA GCCTCTTCGG CCGCTACCAG GACAGGCGCT GGGCGCACAC GCTCATCCTG ACGGGAGGGT GGAACGACGC GAAACTCAAC CGTACGGTCA ACTACGGGGA AGGAAGCTAC GGGACGCAGG GAAGCACCAG CGGGTGGGGC TTTGGAGCGA TGTATGAACT CACCTACGAC GTATACCTCA ACGAAAACCG CAGCAGCGTG CTGCAGCCGC TGTTCAACGC CTCGGTGGTG ACGACGCGGA TGGACGGCTA TGAGGAAACG GGTGCGGGCA ACGCGGGCCT GAACGTCGGC AGGCAGGACT GGACGACGGG GACGCTGGCG CTGGGCGGCC GGTGGATGGG CCTGGTGGGC AGCAACATCT TCGGACGAGA AGCGCTGGCG GAAATCCGAG TAAACGCGGC GCAGGACCTG GGAGACCGGA GAGGGGAAAC GAACGTCTCT CTGCTGGGCA ACCCCGGCTT CGCGCAAAGC GTGAGGGGGG CGAAAGTGGG AACGACGGCG CTGCAGCTGG GAGCCGGACT GAGCGTGCCG GTGGGAACGA AGGGAACCAT CTACGTGAAC GGGAACGCGG ACATCCGTGA CGGGTCCAGC GCGCTGAACG GAAGCATCGG CTACCGCCAC GACTTCTAA
|
Protein sequence | MKLRLPLKLL SALLSFYAIS GFHVFAATEW DGSRYSGNIY TWVGSSSNDF HNGGIVLTNP DGTYGPVNTN WGSVSEEGSI WNLFANETGL SEYNTLRFVA SGTEVGDVTV SATNKNPSCS FTPFTMGGLI VEEGASGFSL VPNNNAQRTL ILGRAGTEEP VLFTVREDFS LGSSSNAWGP VNVNADWNVL VANGKTLNVY GALNSSGRTI TVGGEGFSGT LVLNNAANAS MTAEWVITSG STVRCMNNSA LGSGLVTLDG GTLDFNSQTI GSTITLNMSG TGTLQNVTVD GAILSYTSSS GNDGITASNT TWTSGKIDIG RLLQGFSAEG SNTVDLGAEL GEGFSVLGLD TGQYRIEGSV LTIEDGAISS VTWVSAGEGG ELEETVKNAF TLALGEGSAA NVSLGYLNGT LTTSGDKVYQ ITNTGGTKIN LSGVYNSGAT LPSGNLNYQG DIWMDINGGA FGIIAGGVTN EWGTNLQTST LTGDTHVQLS GNATAEHVIG GNNKGASTTL TGNTNVTVKD NAIVAGAIIG GSTSSHNAVT TITGNTSVLV TNIQHSNSAT VNLGDFGNVT AQNFITGGSA WTANQTSGTT IQGNTSVTIN VGDAELSGTE GHNNFVKNIY GGSYANTKSE GNGAVQKVEG NSSVSISGKE GITFTGDIMG GSFWNWGNGT TLTTNGNTSV SIDGGSTFTG KIVGGSWRGS TWTAEDPTAL PVSIGGNITV TLGQGTYLGD IYGAGNCGTV GGEVHVSLTG GSIFGEEGEE SGITIGGSAG AAVEGNRTLE LKGTFGTGDF QNVTFTRFDE INIAQEETSA TIYALTDSPA LTKTGAGTLT LGADAAGAET ILDGTTEGIT ISEGSLNLSG AGGSHMKGTW NIASGSRLTG VSGTVTVGEG GLDGLTIALG TENIGQDTQA SGAVIINGSG TGSDPNLSIE GEGLTLDLSN DAVVNLLLDH KTDDSSSYLT LTSGTLTVGD LGDIAFTTDL LSNYGIRVTG TDGGSLVLSG AATGLYRVQE EGGNAHEVNS YQTLSGYAGV VIGGGQTLTV NLAGAPGESD GQGAKINNLM GATGSSLVVN NTGDGTAVVI LNNKQMTTGE DDIDPAGQDT VMGGSITGGN NVAFIKEGTG TLTVGGTMDV ETLALREGNI VLNGASNTLD TLTLEGGGLT INGNAEVGTI TGTEAGGSLT IQGTLNLTGT GEINDGSITG TGTLRIQEGA ELALGGEARL DGTSVTADGT LALSGTESGA IISLSGSGTL SMNGGSLSIS SATTSSGTFS GTLAGSGTLD ISGQATQYLQ TGNKDYDLAV RDGGVLVLKG TADAPTLNYN SITAGNNGTL RIEATGDAQG SANTTLNVES ITFQNGSTTE LIYNFNQDAP FGAPMLTADT ITVQDGAGFL LSNMEGNAAM NAGSDLHDVI LMSATGSISG LEDGQSLAAR ISGLFAVYYQ DATLNRDGND ILLNATLRQE NLFASAADTW NSAAGASLLW EARKNLDPDS QLAQFMNGVS TMINDGNLSG ATRAMAAAAG STVNALGTAQ RDALRDQMGW IRNRTTLMGV NPAYVNDDLP RFHMWMEGTG SYAKLDTRGD ESGYQLTTWG GTVGVDADLS DRLTVGAAFT ASYGDLTAGA ADSADGHLDS YYASLFGRYQ DRRWAHTLIL TGGWNDAKLN RTVNYGEGSY GTQGSTSGWG FGAMYELTYD VYLNENRSSV LQPLFNASVV TTRMDGYEET GAGNAGLNVG RQDWTTGTLA LGGRWMGLVG SNIFGREALA EIRVNAAQDL GDRRGETNVS LLGNPGFAQS VRGAKVGTTA LQLGAGLSVP VGTKGTIYVN GNADIRDGSS ALNGSIGYRH DF
|
| |