Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1039 |
Symbol | |
ID | 6274074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1232285 |
End bp | 1237882 |
Gene Length | 5598 bp |
Protein Length | 1865 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642613088 |
Product | outer membrane autotransporter barrel domain protein |
Protein accession | YP_001877646 |
Protein GI | 187735534 |
COG category | [S] Function unknown |
COG ID | [COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain |
TIGRFAM ID | [TIGR01414] outer membrane autotransporter barrel domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.429609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCTCC ATCTGCCTAT TACCCTGCTG GCGGCTGTTC TGGCTTGTTA CACCAGTGTT TCCCTGGCCG TCCCAACATC TGAATCCCCC GCATGGGGAG CGGACTCCAC CTTTAATAAC AATGAGCCCG CCAATGAGTA TTCAGTTACT GGGAGCCAAT CCGTCAACCT TGATGTAAAT TCAGGAAACA ACAATTATTC CACAGGACTT TATATCGGGG CAGGCTCCTC GTTTACCATT AATCAGAATG CGAACGGCGG CTGTACCATC AATTTGAACG GGGCATTTGC CGGAGAAGGA AATTTAACGC TGGTAGCGGC TAACGGCAAT GCCGGCTATG CCTCCAAGTT TGTGCTTGGT TCCCAAGAAA GCAGTTTTTC CGGGAACATT ATCCTTTCCC AGAAGGGGAC GCAGCCCGGC GGCGCCATTC TCCAGATTAC GGGAACGGCT CTCGCCAATG CAACGGTAGA TTTGTCAGGC TCAATCAATC AAAGTTCCAG CGCCCTGACG TTGCAAATTT CCAATGCCGC CTCTCTGGCC GGTTTAAATG ATGCCGACGG CTTCAGCGGA ACACATAAGG GGCGCGTCCA GTCGGCTAAT TCCTCCAGAG CGAATCTTAC CCTGACGGGT AACGGAAATT ACACGTATGG GGGAAGTATC GGGGCAACCA CGCAGCATTC CGGAGTAAAC GGCAACACTA CTCCCACGGG AGGAATAAAT CTGATTATGG CCGGAACAGG TACGCAAAAC CTGACGGGAA CCGTCATCAA CGCGAATATC ACAGCCCAAG GTGGCATCTT GAAAATCAAT AATTCTTCCT TGGCCTATTC CGGAATAATC ACCATGGCGG GAGGAACGCT GGATTTCACC TCCGCCACTC TGGGGGCAAA TGCCGTTCTG AACATGAACG GCACCGGCGT ATTGAAAAAC GCAGCCATTG ACGGAGCCAA GCTAACCTAT ACGGAATCCG GATCCTCCTT TACCAAGGAA AATGTAACTT TCACTTCAGG CACCATTGAC ATCGGCGGCG CGCTTGACAG CCTGGTGGAG GGAGAACAGG GATACACGTT TGATTTAGGA AGCAACCTCG GCGCGAATTT TACCGTTACG GGCTTAGATA CAGGGCAATA TAGAATAGAA GGCAGCGTGC TGACGATAGA AGACGGGGCG ATATCCAGCG TGACGTGGGT ATCTGCGGGA GAAGGAGGGG AATTGGAAGA AACGGTCAAG AACGCCTTTA CGCTGGCTCT TGGGGAGGGG AGTGCGGCCA ACGTCAGCCT GGGATATCTG AACGGGACCC TGACGACCAG CGGGGACAAA GTCTATCAAA TCACCAACAC CGGCGGAACC AAAATCAACC TGTCCGGAGT ATACAACAGC GGGGCGACCC TGCCGTCCGG CAACCTCAAC TACCAGGGGG ACATCTGGAT GGACATCAAC GGAGGAGCAT TCGGCATCAT CGCCGGAGGT GTCACCAATG AATGGGGCAC CAACCTCCAG ACCAGCACGC TGACTGGGGA CACCCATGTG CAGCTCTCTG GAAACGCCAC GGCGGAACAC GTCATCGGCG GGAACAACAA GGGAGCAAGC ACTACCCTGA CGGGAAACAC CAACGTTACC GTCAAAGATA ACGCCATCGT GGCGGGAGCC ATCATCGGAG GAAGCACCTC CTCCCACAAT GCCGTCACCA CCATCACAGG AAACACGAGC GTCCTGGTCA CCAACATCCA GCACAGCAAC AGCGCGACCG TCAACCTGGG AGACTTCGGC AACGTCACAG CCCAAAACTT CATCACGGGC GGAAGCGCGT GGACGGCCAA CCAGACTTCA GGAACGACCA TCCAGGGCAA CACCTCGGTA ACGATTAATG TAGGCGATGC GGAACTCTCC GGCACGGAAG GGCACAACAA CTTCGTCAAA AACATCTATG GAGGCAGTTA CGCAAACACG AAATCCGAGG GCAACGGAGC CGTACAAAAA GTGGAAGGAA ACTCAAGCGT CTCCATCAGC GGAAAAGAAG GAATCACCTT CACGGGAGAC ATCATGGGCG GCTCCTTCTG GAACTGGGGC AACGGCACGA CGCTGACCAC CAACGGAAAC ACCAGCGTCT CCATTGACGG AGGTTCCACC TTCACGGGCA AAATCGTGGG CGGCTCCTGG AGAGGAAGCA CGTGGACGGC GGAAGATCCC ACAGCCCTGC CCGTCAGCAT TGGCGGCAAC ATCACCGTCA CCCTTGGACA GGGCACCTAC CTGGGAGACA TCTACGGGGC GGGCAACTGT GGCACGGTAG GAGGCGAAGT CCACGTCTCG CTGACGGGAG GAAGCATCTT TGGGGAGGAA GGAGAAGAAA GCGGCATCAC CATCGGCGGA AGCGCCGGAG CGGCGGTAGA AGGAAACAGG ACGCTGGAAC TCAAGGGAAC CTTTGGAACC GGGGACTTCC AAAACGTCAC CTTCACCCGG TTTGATGAAA TCAATATCGC GCAGGAAGAA ACCTCGGCCA CCATCTACGC GCTGACCGAC AGCCCGGCCC TGACCAAAAC GGGAGCGGGA ACCCTGACAC TCGGGGCGGA CGCGGCAGGA GCGGAAACCA TCCTTGACGG AACTACGGAA GGAATCACCA TCTCGGAAGG CAGCCTGAAC CTCTCCGGCG CCGGCGGCAG CCATATGAAA GGAACGTGGA ACATCGCCTC CGGCTCCCGT CTCACCGGAG TCAGCGGAAC CGTAACCGTA GGAGAAGGAG GCCTGGACGG ACTGACCATC GCCCTGGGAA CGGAAAACAT CGGCCAGGAC ACGCAGGCGT CTGGCGCCGT CATCATCAAC GGCAGCGGAA CGGGAAGCGA CCCGAACCTG TCCATAGAGG GGGAAGGGCT GACGCTGGAC CTGAGCAACG ACGCCGTCGT CAACCTTCTG CTGGACCACA AAACCGACGA CTCCTCCAGC TACCTGACCC TGACCAGCGG AACGCTGACC GTCGGCGACC TGGGCGACAT CGCCTTCACT ACCGACCTTC TGTCCAACTA CGGCATCCGG GTCACCGGAA CGGACGGAGG CAGCCTGGTG CTCAGCGGAG CGGCCACCGG CCTCTACCGC GTGCAGGAGG AAGGAGGAAA CGCCCATGAA GTCAACTCCT ATCAAACGCT CTCCGGCTAC GCCGGCGTCG TCATCGGCGG CGGCCAGACC CTGACGGTCA ACCTGGCCGG AGCCCCGGGC GAATCCGACG GACAGGGAGC TAAAATCAAC AACCTGATGG GAGCCACGGG CAGCAGCCTG GTCGTCAACA ACACCGGAGA CGGCACGGCC GTCGTCATCC TCAACAACAA ACAAATGACG ACGGGAGAAG ACGACATCGA CCCGGCCGGC CAGGACACCG TCATGGGCGG CAGCATCACG GGAGGCAACA ACGTCGCCTT CATCAAGGAA GGAACCGGGA CGCTGACGGT CGGAGGAACG ATGGACGTGG AAACCCTCGC CCTGCGTGAA GGAAACATCG TCCTCAACGG CGCGTCAAAC ACCCTGGACA CCCTCACTCT GGAAGGAGGC GGCCTGACCA TTAACGGCAA CGCCGAAGTC GGAACCATCA CCGGCACGGA AGCCGGAGGC TCCCTGACCA TCCAGGGAAC TCTCAACCTG ACCGGAACAG GCGAAATAAA CGACGGCTCC ATCACCGGAA CCGGCACCCT CCGCATCCAG GAAGGAGCGG AACTGGCCCT GGGCGGAGAA GCCCGGCTGG ACGGAACCTC AGTCACCGCC GACGGCACGC TGGCCCTCTC CGGAACGGAA TCCGGCGCCA TCATCAGCCT CTCCGGCAGC GGCACCCTGT CCATGAACGG AGGCAGCCTC TCCATCTCCT CGGCAACAAC CTCCTCCGGA ACCTTCTCCG GCACGCTGGC AGGCAGCGGC ACCCTGGACA TCTCCGGGCA AGCCACCCAA TACCTGCAAA CCGGCAATAA GGACTACGAC CTCGCCGTCC GTGACGGAGG CGTCCTGGTC CTGAAAGGCA CCGCGGACGC CCCCACGCTG AACTACAACA GCATCACCGC CGGAAACAAC GGCACCCTGC GTATTGAAGC CACCGGAGAC GCCCAGGGCA GCGCCAACAC CACGCTCAAC GTGGAAAGCA TCACCTTCCA GAACGGCTCC ACGACGGAAC TGATCTACAA CTTCAACCAG GACGCCCCCT TTGGCGCCCC CATGCTGACG GCGGATACCA TCACCGTGCA GGACGGAGCC GGCTTCCTCC TCTCCAACAT GGAAGGAAAC GCCGCCATGA ACGCGGGGAG CGACCTCCAT GACGTCATCC TGATGAGCGC TACCGGCAGC ATCAGCGGCC TGGAAGACGG GCAAAGCCTC GCCGCCCGGA TATCCGGCCT CTTCGCCGTC TACTACCAGG ACGCCACGCT GAACCGCGAC GGAAACGACA TCCTGCTCAA TGCCACACTC CGGCAGGAAA ACCTCTTCGC CTCCGCGGCC GACACGTGGA ACTCCGCCGC CGGAGCCAGC CTGCTCTGGG AAGCCCGCAA AAACCTGGAC CCCGACTCCC AGCTGGCCCA ATTCATGAAC GGCGTCAGCA CCATGATCAA TGACGGCAAC CTCTCCGGAG CCACCCGCGC TATGGCGGCG GCAGCTGGGA GCACGGTCAA CGCGCTGGGG ACGGCGCAGA GGGACGCCCT GCGCGACCAG ATGGGCTGGA TCAGGAACCG GACCACCCTC ATGGGCGTCA ACCCGGCCTA CGTCAACGAC GACCTCCCCC GCTTCCACAT GTGGATGGAA GGCACGGGCT CCTACGCCAA ACTGGACACC CGCGGGGATG AAAGCGGCTA CCAGCTCACC ACCTGGGGAG GTACGGTAGG CGTGGACGCG GACCTCAGCG ACCGCCTCAC AGTGGGAGCG GCCTTCACGG CCAGCTACGG CGACCTGACG GCCGGCGCGG CGGACAGCGC CGACGGGCAC CTGGACAGCT ACTACGCCAG CCTCTTCGGC CGCTACCAGG ACAGGCGCTG GGCGCACACG CTCATCCTGA CGGGAGGGTG GAACGACGCG AAACTCAACC GTACGGTCAA CTACGGGGAA GGAAGCTACG GGACGCAGGG AAGCACCAGC GGGTGGGGCT TTGGAGCGAT GTATGAACTC ACCTACGACG TATACCTCAA CGAAAACCGC AGCAGCGTGC TGCAGCCGCT GTTCAACGCC TCGGTGGTGA CGACGCGGAT GGACGGCTAT GAGGAAACGG GTGCGGGCAA CGCGGGCCTG AACGTCGGCA GGCAGGACTG GACGACGGGG ACGCTGGCGC TGGGCGGCCG GTGGATGGGC CTGGTGGGCA GCAACATCTT CGGACGAGAA GCGCTGGCGG AAATCCGAGT AAACGCGGCG CAGGACCTGG GAGACCGGAG AGGGGAAACG AACGTCTCTC TGCTGGGCAA CCCCGGCTTC GCGCAAAGCG TGAGGGGGGC GAAAGTGGGA ACGACGGCGC TGCAGCTGGG AGCCGGACTG AGCGTGCCGG TGGGAACGAA GGGAACCATC TACGTGAACG GGAACGCGGA CATCCGTGAC GGGTCCAGCG CGCTGAACGG AAGCATCGGC TACCGCCACG ACTTCTAA
|
Protein sequence | MRLHLPITLL AAVLACYTSV SLAVPTSESP AWGADSTFNN NEPANEYSVT GSQSVNLDVN SGNNNYSTGL YIGAGSSFTI NQNANGGCTI NLNGAFAGEG NLTLVAANGN AGYASKFVLG SQESSFSGNI ILSQKGTQPG GAILQITGTA LANATVDLSG SINQSSSALT LQISNAASLA GLNDADGFSG THKGRVQSAN SSRANLTLTG NGNYTYGGSI GATTQHSGVN GNTTPTGGIN LIMAGTGTQN LTGTVINANI TAQGGILKIN NSSLAYSGII TMAGGTLDFT SATLGANAVL NMNGTGVLKN AAIDGAKLTY TESGSSFTKE NVTFTSGTID IGGALDSLVE GEQGYTFDLG SNLGANFTVT GLDTGQYRIE GSVLTIEDGA ISSVTWVSAG EGGELEETVK NAFTLALGEG SAANVSLGYL NGTLTTSGDK VYQITNTGGT KINLSGVYNS GATLPSGNLN YQGDIWMDIN GGAFGIIAGG VTNEWGTNLQ TSTLTGDTHV QLSGNATAEH VIGGNNKGAS TTLTGNTNVT VKDNAIVAGA IIGGSTSSHN AVTTITGNTS VLVTNIQHSN SATVNLGDFG NVTAQNFITG GSAWTANQTS GTTIQGNTSV TINVGDAELS GTEGHNNFVK NIYGGSYANT KSEGNGAVQK VEGNSSVSIS GKEGITFTGD IMGGSFWNWG NGTTLTTNGN TSVSIDGGST FTGKIVGGSW RGSTWTAEDP TALPVSIGGN ITVTLGQGTY LGDIYGAGNC GTVGGEVHVS LTGGSIFGEE GEESGITIGG SAGAAVEGNR TLELKGTFGT GDFQNVTFTR FDEINIAQEE TSATIYALTD SPALTKTGAG TLTLGADAAG AETILDGTTE GITISEGSLN LSGAGGSHMK GTWNIASGSR LTGVSGTVTV GEGGLDGLTI ALGTENIGQD TQASGAVIIN GSGTGSDPNL SIEGEGLTLD LSNDAVVNLL LDHKTDDSSS YLTLTSGTLT VGDLGDIAFT TDLLSNYGIR VTGTDGGSLV LSGAATGLYR VQEEGGNAHE VNSYQTLSGY AGVVIGGGQT LTVNLAGAPG ESDGQGAKIN NLMGATGSSL VVNNTGDGTA VVILNNKQMT TGEDDIDPAG QDTVMGGSIT GGNNVAFIKE GTGTLTVGGT MDVETLALRE GNIVLNGASN TLDTLTLEGG GLTINGNAEV GTITGTEAGG SLTIQGTLNL TGTGEINDGS ITGTGTLRIQ EGAELALGGE ARLDGTSVTA DGTLALSGTE SGAIISLSGS GTLSMNGGSL SISSATTSSG TFSGTLAGSG TLDISGQATQ YLQTGNKDYD LAVRDGGVLV LKGTADAPTL NYNSITAGNN GTLRIEATGD AQGSANTTLN VESITFQNGS TTELIYNFNQ DAPFGAPMLT ADTITVQDGA GFLLSNMEGN AAMNAGSDLH DVILMSATGS ISGLEDGQSL AARISGLFAV YYQDATLNRD GNDILLNATL RQENLFASAA DTWNSAAGAS LLWEARKNLD PDSQLAQFMN GVSTMINDGN LSGATRAMAA AAGSTVNALG TAQRDALRDQ MGWIRNRTTL MGVNPAYVND DLPRFHMWME GTGSYAKLDT RGDESGYQLT TWGGTVGVDA DLSDRLTVGA AFTASYGDLT AGAADSADGH LDSYYASLFG RYQDRRWAHT LILTGGWNDA KLNRTVNYGE GSYGTQGSTS GWGFGAMYEL TYDVYLNENR SSVLQPLFNA SVVTTRMDGY EETGAGNAGL NVGRQDWTTG TLALGGRWMG LVGSNIFGRE ALAEIRVNAA QDLGDRRGET NVSLLGNPGF AQSVRGAKVG TTALQLGAGL SVPVGTKGTI YVNGNADIRD GSSALNGSIG YRHDF
|
| |