Gene Amuc_1722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1722 
Symbol 
ID6273719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2094322 
End bp2099850 
Gene Length5529 bp 
Protein Length1842 aa 
Translation table11 
GC content60% 
IMG OID642613785 
Productouter membrane autotransporter barrel domain protein 
Protein accessionYP_001878321 
Protein GI187736209 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.0890877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTAC GTCTACCGCT CAAGTTGTTG TCCGCCTTAT TGTCGTTTTA TGCCATATCC 
GGTTTTCATG TGTTCGCTGC TACGGAGTGG GACGGCAGCA GATATTCCGG TAATATCTAT
ACGTGGGTAG GGAGTTCGTC CAATGATTTT CATAACGGCG GCATTGTCCT GACCAATCCG
GATGGAACAT ACGGTCCTGT CAATACGAAT TGGGGTTCCG TTTCAGAGGA GGGGAGTATT
TGGAACCTTT TTGCCAATGA AACGGGACTG TCGGAATATA ACACATTGCG CTTTGTCGCA
TCAGGCACGG AAGTGGGCGA TGTCACCGTA TCGGCGACCA ATAAAAATCC ATCCTGCTCC
TTTACGCCAT TTACCATGGG GGGATTGATT GTGGAGGAAG GGGCATCAGG GTTCTCCCTT
GTCCCGAACA ATAATGCCCA GAGGACTCTT ATACTGGGGA GAGCGGGTAC GGAAGAGCCG
GTATTATTTA CGGTTCGTGA AGATTTTTCC CTGGGTTCTT CCTCCAATGC ATGGGGCCCG
GTTAATGTGA ATGCAGACTG GAATGTCCTT GTGGCAAATG GCAAGACGCT GAATGTTTAC
GGGGCACTGA ATTCTTCCGG CCGGACCATT ACGGTGGGCG GGGAAGGATT CAGCGGTACG
TTGGTTTTAA ACAATGCGGC CAATGCATCC ATGACTGCTG AATGGGTCAT CACCTCCGGC
TCTACTGTAA GGTGCATGAA TAATTCCGCT TTGGGAAGCG GCCTTGTTAC GCTGGATGGG
GGGACGCTTG ATTTTAACTC GCAAACGATC GGATCAACCA TTACGTTGAA CATGAGCGGG
ACCGGGACAT TGCAGAATGT GACAGTTGAT GGAGCAATTT TATCTTATAC GTCTTCATCG
GGAAATGATG GGATTACTGC TTCAAATACC ACGTGGACGT CCGGGAAGAT AGATATTGGG
CGTTTGTTGC AGGGATTTAG TGCCGAAGGT TCCAATACCG TTGATTTGGG AGCGGAGCTT
GGCGAGGGCT TTAGCGTACT GGGTTTAGAT ACAGGGCAAT ACAGAATAGA AGGCAGCGTG
CTGACGATAG AAGACGGGGC GATATCCAGC GTGACGTGGG TATCTGCGGG AGAAGGAGGG
GAATTGGAAG AAACGGTCAA GAACGCCTTT ACGCTGGCTC TTGGGGAGGG GAGTGCGGCC
AACGTCAGCC TGGGATATCT GAACGGGACC CTGACGACCA GCGGGGACAA AGTCTATCAA
ATCACCAACA CCGGCGGAAC CAAAATCAAC CTGTCCGGAG TATACAACAG CGGGGCGACC
CTGCCGTCCG GCAACCTCAA CTACCAGGGG GACATCTGGA TGGACATCAA CGGAGGAGCA
TTCGGCATCA TCGCCGGAGG TGTCACCAAT GAATGGGGCA CCAACCTCCA GACCAGCACG
CTGACTGGGG ACACCCATGT GCAGCTCTCT GGAAACGCCA CGGCGGAACA CGTCATCGGC
GGGAACAACA AGGGAGCAAG CACTACCCTG ACGGGAAACA CCAACGTTAC CGTCAAAGAT
AACGCCATCG TGGCGGGAGC CATCATCGGA GGAAGCACCT CCTCCCACAA TGCCGTCACC
ACCATCACAG GAAACACGAG CGTCCTGGTC ACCAACATCC AGCACAGCAA CAGCGCGACC
GTCAACCTGG GAGACTTCGG CAACGTCACA GCCCAAAACT TCATCACGGG CGGAAGCGCG
TGGACGGCCA ACCAGACTTC AGGAACGACC ATCCAGGGCA ACACCTCGGT AACGATTAAT
GTAGGCGATG CGGAACTCTC CGGCACGGAA GGGCACAACA ACTTCGTCAA AAACATCTAT
GGAGGCAGTT ACGCAAACAC GAAATCCGAG GGCAACGGAG CCGTACAAAA AGTGGAAGGA
AACTCAAGCG TCTCCATCAG CGGAAAAGAA GGAATCACCT TCACGGGAGA CATCATGGGC
GGCTCCTTCT GGAACTGGGG CAACGGCACG ACGCTGACCA CCAACGGAAA CACCAGCGTC
TCCATTGACG GAGGTTCCAC CTTCACGGGC AAAATCGTGG GCGGCTCCTG GAGAGGAAGC
ACGTGGACGG CGGAAGATCC CACAGCCCTG CCCGTCAGCA TTGGCGGCAA CATCACCGTC
ACCCTTGGAC AGGGCACCTA CCTGGGAGAC ATCTACGGGG CGGGCAACTG TGGCACGGTA
GGAGGCGAAG TCCACGTCTC GCTGACGGGA GGAAGCATCT TTGGGGAGGA AGGAGAAGAA
AGCGGCATCA CCATCGGCGG AAGCGCCGGA GCGGCGGTAG AAGGAAACAG GACGCTGGAA
CTCAAGGGAA CCTTTGGAAC CGGGGACTTC CAAAACGTCA CCTTCACCCG GTTTGATGAA
ATCAATATCG CGCAGGAAGA AACCTCGGCC ACCATCTACG CGCTGACCGA CAGCCCGGCC
CTGACCAAAA CGGGAGCGGG AACCCTGACA CTCGGGGCGG ACGCGGCAGG AGCGGAAACC
ATCCTTGACG GAACTACGGA AGGAATCACC ATCTCGGAAG GCAGCCTGAA CCTCTCCGGC
GCCGGCGGCA GCCATATGAA AGGAACGTGG AACATCGCCT CCGGCTCCCG TCTCACCGGA
GTCAGCGGAA CCGTAACCGT AGGAGAAGGA GGCCTGGACG GACTGACCAT CGCCCTGGGA
ACGGAAAACA TCGGCCAGGA CACGCAGGCG TCTGGCGCCG TCATCATCAA CGGCAGCGGA
ACGGGAAGCG ACCCGAACCT GTCCATAGAG GGGGAAGGGC TGACGCTGGA CCTGAGCAAC
GACGCCGTCG TCAACCTTCT GCTGGACCAC AAAACCGACG ACTCCTCCAG CTACCTGACC
CTGACCAGCG GAACGCTGAC CGTCGGCGAC CTGGGCGACA TCGCCTTCAC TACCGACCTT
CTGTCCAACT ACGGCATCCG GGTCACCGGA ACGGACGGAG GCAGCCTGGT GCTCAGCGGA
GCGGCCACCG GCCTCTACCG CGTGCAGGAG GAAGGAGGAA ACGCCCATGA AGTCAACTCC
TATCAAACGC TCTCCGGCTA CGCCGGCGTC GTCATCGGCG GCGGCCAGAC CCTGACGGTC
AACCTGGCCG GAGCCCCGGG CGAATCCGAC GGACAGGGAG CTAAAATCAA CAACCTGATG
GGAGCCACGG GCAGCAGCCT GGTCGTCAAC AACACCGGAG ACGGCACGGC CGTCGTCATC
CTCAACAACA AACAAATGAC GACGGGAGAA GACGACATCG ACCCGGCCGG CCAGGACACC
GTCATGGGCG GCAGCATCAC GGGAGGCAAC AACGTCGCCT TCATCAAGGA AGGAACCGGG
ACGCTGACGG TCGGAGGAAC GATGGACGTG GAAACCCTCG CCCTGCGTGA AGGAAACATC
GTCCTCAACG GCGCGTCAAA CACCCTGGAC ACCCTCACTC TGGAAGGAGG CGGCCTGACC
ATTAACGGCA ACGCCGAAGT CGGAACCATC ACCGGCACGG AAGCCGGAGG CTCCCTGACC
ATCCAGGGAA CTCTCAACCT GACCGGAACA GGCGAAATAA ACGACGGCTC CATCACCGGA
ACCGGCACCC TCCGCATCCA GGAAGGAGCG GAACTGGCCC TGGGCGGAGA AGCCCGGCTG
GACGGAACCT CAGTCACCGC CGACGGCACG CTGGCCCTCT CCGGAACGGA ATCCGGCGCC
ATCATCAGCC TCTCCGGCAG CGGCACCCTG TCCATGAACG GAGGCAGCCT CTCCATCTCC
TCGGCAACAA CCTCCTCCGG AACCTTCTCC GGCACGCTGG CAGGCAGCGG CACCCTGGAC
ATCTCCGGGC AAGCCACCCA ATACCTGCAA ACCGGCAATA AGGACTACGA CCTCGCCGTC
CGTGACGGAG GCGTCCTGGT CCTGAAAGGC ACCGCGGACG CCCCCACGCT GAACTACAAC
AGCATCACCG CCGGAAACAA CGGCACCCTG CGTATTGAAG CCACCGGAGA CGCCCAGGGC
AGCGCCAACA CCACGCTCAA CGTGGAAAGC ATCACCTTCC AGAACGGCTC CACGACGGAA
CTGATCTACA ACTTCAACCA GGACGCCCCC TTTGGCGCCC CCATGCTGAC GGCGGATACC
ATCACCGTGC AGGACGGAGC CGGCTTCCTC CTCTCCAACA TGGAAGGAAA CGCCGCCATG
AACGCGGGGA GCGACCTCCA TGACGTCATC CTGATGAGCG CTACCGGCAG CATCAGCGGC
CTGGAAGACG GGCAAAGCCT CGCCGCCCGG ATATCCGGCC TCTTCGCCGT CTACTACCAG
GACGCCACGC TGAACCGCGA CGGAAACGAC ATCCTGCTCA ATGCCACACT CCGGCAGGAA
AACCTCTTCG CCTCCGCGGC CGACACGTGG AACTCCGCCG CCGGAGCCAG CCTGCTCTGG
GAAGCCCGCA AAAACCTGGA CCCCGACTCC CAGCTGGCCC AATTCATGAA CGGCGTCAGC
ACCATGATCA ATGACGGCAA CCTCTCCGGA GCCACCCGCG CTATGGCGGC GGCAGCTGGG
AGCACGGTCA ACGCGCTGGG GACGGCGCAG AGGGACGCCC TGCGCGACCA GATGGGCTGG
ATCAGGAACC GGACCACCCT CATGGGCGTC AACCCGGCCT ACGTCAACGA CGACCTCCCC
CGCTTCCACA TGTGGATGGA AGGCACGGGC TCCTACGCCA AACTGGACAC CCGCGGGGAT
GAAAGCGGCT ACCAGCTCAC CACCTGGGGA GGTACGGTAG GCGTGGACGC GGACCTCAGC
GACCGCCTCA CAGTGGGAGC GGCCTTCACG GCCAGCTACG GCGACCTGAC GGCCGGCGCG
GCGGACAGCG CCGACGGGCA CCTGGACAGC TACTACGCCA GCCTCTTCGG CCGCTACCAG
GACAGGCGCT GGGCGCACAC GCTCATCCTG ACGGGAGGGT GGAACGACGC GAAACTCAAC
CGTACGGTCA ACTACGGGGA AGGAAGCTAC GGGACGCAGG GAAGCACCAG CGGGTGGGGC
TTTGGAGCGA TGTATGAACT CACCTACGAC GTATACCTCA ACGAAAACCG CAGCAGCGTG
CTGCAGCCGC TGTTCAACGC CTCGGTGGTG ACGACGCGGA TGGACGGCTA TGAGGAAACG
GGTGCGGGCA ACGCGGGCCT GAACGTCGGC AGGCAGGACT GGACGACGGG GACGCTGGCG
CTGGGCGGCC GGTGGATGGG CCTGGTGGGC AGCAACATCT TCGGACGAGA AGCGCTGGCG
GAAATCCGAG TAAACGCGGC GCAGGACCTG GGAGACCGGA GAGGGGAAAC GAACGTCTCT
CTGCTGGGCA ACCCCGGCTT CGCGCAAAGC GTGAGGGGGG CGAAAGTGGG AACGACGGCG
CTGCAGCTGG GAGCCGGACT GAGCGTGCCG GTGGGAACGA AGGGAACCAT CTACGTGAAC
GGGAACGCGG ACATCCGTGA CGGGTCCAGC GCGCTGAACG GAAGCATCGG CTACCGCCAC
GACTTCTAA
 
Protein sequence
MKLRLPLKLL SALLSFYAIS GFHVFAATEW DGSRYSGNIY TWVGSSSNDF HNGGIVLTNP 
DGTYGPVNTN WGSVSEEGSI WNLFANETGL SEYNTLRFVA SGTEVGDVTV SATNKNPSCS
FTPFTMGGLI VEEGASGFSL VPNNNAQRTL ILGRAGTEEP VLFTVREDFS LGSSSNAWGP
VNVNADWNVL VANGKTLNVY GALNSSGRTI TVGGEGFSGT LVLNNAANAS MTAEWVITSG
STVRCMNNSA LGSGLVTLDG GTLDFNSQTI GSTITLNMSG TGTLQNVTVD GAILSYTSSS
GNDGITASNT TWTSGKIDIG RLLQGFSAEG SNTVDLGAEL GEGFSVLGLD TGQYRIEGSV
LTIEDGAISS VTWVSAGEGG ELEETVKNAF TLALGEGSAA NVSLGYLNGT LTTSGDKVYQ
ITNTGGTKIN LSGVYNSGAT LPSGNLNYQG DIWMDINGGA FGIIAGGVTN EWGTNLQTST
LTGDTHVQLS GNATAEHVIG GNNKGASTTL TGNTNVTVKD NAIVAGAIIG GSTSSHNAVT
TITGNTSVLV TNIQHSNSAT VNLGDFGNVT AQNFITGGSA WTANQTSGTT IQGNTSVTIN
VGDAELSGTE GHNNFVKNIY GGSYANTKSE GNGAVQKVEG NSSVSISGKE GITFTGDIMG
GSFWNWGNGT TLTTNGNTSV SIDGGSTFTG KIVGGSWRGS TWTAEDPTAL PVSIGGNITV
TLGQGTYLGD IYGAGNCGTV GGEVHVSLTG GSIFGEEGEE SGITIGGSAG AAVEGNRTLE
LKGTFGTGDF QNVTFTRFDE INIAQEETSA TIYALTDSPA LTKTGAGTLT LGADAAGAET
ILDGTTEGIT ISEGSLNLSG AGGSHMKGTW NIASGSRLTG VSGTVTVGEG GLDGLTIALG
TENIGQDTQA SGAVIINGSG TGSDPNLSIE GEGLTLDLSN DAVVNLLLDH KTDDSSSYLT
LTSGTLTVGD LGDIAFTTDL LSNYGIRVTG TDGGSLVLSG AATGLYRVQE EGGNAHEVNS
YQTLSGYAGV VIGGGQTLTV NLAGAPGESD GQGAKINNLM GATGSSLVVN NTGDGTAVVI
LNNKQMTTGE DDIDPAGQDT VMGGSITGGN NVAFIKEGTG TLTVGGTMDV ETLALREGNI
VLNGASNTLD TLTLEGGGLT INGNAEVGTI TGTEAGGSLT IQGTLNLTGT GEINDGSITG
TGTLRIQEGA ELALGGEARL DGTSVTADGT LALSGTESGA IISLSGSGTL SMNGGSLSIS
SATTSSGTFS GTLAGSGTLD ISGQATQYLQ TGNKDYDLAV RDGGVLVLKG TADAPTLNYN
SITAGNNGTL RIEATGDAQG SANTTLNVES ITFQNGSTTE LIYNFNQDAP FGAPMLTADT
ITVQDGAGFL LSNMEGNAAM NAGSDLHDVI LMSATGSISG LEDGQSLAAR ISGLFAVYYQ
DATLNRDGND ILLNATLRQE NLFASAADTW NSAAGASLLW EARKNLDPDS QLAQFMNGVS
TMINDGNLSG ATRAMAAAAG STVNALGTAQ RDALRDQMGW IRNRTTLMGV NPAYVNDDLP
RFHMWMEGTG SYAKLDTRGD ESGYQLTTWG GTVGVDADLS DRLTVGAAFT ASYGDLTAGA
ADSADGHLDS YYASLFGRYQ DRRWAHTLIL TGGWNDAKLN RTVNYGEGSY GTQGSTSGWG
FGAMYELTYD VYLNENRSSV LQPLFNASVV TTRMDGYEET GAGNAGLNVG RQDWTTGTLA
LGGRWMGLVG SNIFGREALA EIRVNAAQDL GDRRGETNVS LLGNPGFAQS VRGAKVGTTA
LQLGAGLSVP VGTKGTIYVN GNADIRDGSS ALNGSIGYRH DF