Gene Amuc_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1039 
Symbol 
ID6274074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1232285 
End bp1237882 
Gene Length5598 bp 
Protein Length1865 aa 
Translation table11 
GC content60% 
IMG OID642613088 
Productouter membrane autotransporter barrel domain protein 
Protein accessionYP_001877646 
Protein GI187735534 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.429609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCTCC ATCTGCCTAT TACCCTGCTG GCGGCTGTTC TGGCTTGTTA CACCAGTGTT 
TCCCTGGCCG TCCCAACATC TGAATCCCCC GCATGGGGAG CGGACTCCAC CTTTAATAAC
AATGAGCCCG CCAATGAGTA TTCAGTTACT GGGAGCCAAT CCGTCAACCT TGATGTAAAT
TCAGGAAACA ACAATTATTC CACAGGACTT TATATCGGGG CAGGCTCCTC GTTTACCATT
AATCAGAATG CGAACGGCGG CTGTACCATC AATTTGAACG GGGCATTTGC CGGAGAAGGA
AATTTAACGC TGGTAGCGGC TAACGGCAAT GCCGGCTATG CCTCCAAGTT TGTGCTTGGT
TCCCAAGAAA GCAGTTTTTC CGGGAACATT ATCCTTTCCC AGAAGGGGAC GCAGCCCGGC
GGCGCCATTC TCCAGATTAC GGGAACGGCT CTCGCCAATG CAACGGTAGA TTTGTCAGGC
TCAATCAATC AAAGTTCCAG CGCCCTGACG TTGCAAATTT CCAATGCCGC CTCTCTGGCC
GGTTTAAATG ATGCCGACGG CTTCAGCGGA ACACATAAGG GGCGCGTCCA GTCGGCTAAT
TCCTCCAGAG CGAATCTTAC CCTGACGGGT AACGGAAATT ACACGTATGG GGGAAGTATC
GGGGCAACCA CGCAGCATTC CGGAGTAAAC GGCAACACTA CTCCCACGGG AGGAATAAAT
CTGATTATGG CCGGAACAGG TACGCAAAAC CTGACGGGAA CCGTCATCAA CGCGAATATC
ACAGCCCAAG GTGGCATCTT GAAAATCAAT AATTCTTCCT TGGCCTATTC CGGAATAATC
ACCATGGCGG GAGGAACGCT GGATTTCACC TCCGCCACTC TGGGGGCAAA TGCCGTTCTG
AACATGAACG GCACCGGCGT ATTGAAAAAC GCAGCCATTG ACGGAGCCAA GCTAACCTAT
ACGGAATCCG GATCCTCCTT TACCAAGGAA AATGTAACTT TCACTTCAGG CACCATTGAC
ATCGGCGGCG CGCTTGACAG CCTGGTGGAG GGAGAACAGG GATACACGTT TGATTTAGGA
AGCAACCTCG GCGCGAATTT TACCGTTACG GGCTTAGATA CAGGGCAATA TAGAATAGAA
GGCAGCGTGC TGACGATAGA AGACGGGGCG ATATCCAGCG TGACGTGGGT ATCTGCGGGA
GAAGGAGGGG AATTGGAAGA AACGGTCAAG AACGCCTTTA CGCTGGCTCT TGGGGAGGGG
AGTGCGGCCA ACGTCAGCCT GGGATATCTG AACGGGACCC TGACGACCAG CGGGGACAAA
GTCTATCAAA TCACCAACAC CGGCGGAACC AAAATCAACC TGTCCGGAGT ATACAACAGC
GGGGCGACCC TGCCGTCCGG CAACCTCAAC TACCAGGGGG ACATCTGGAT GGACATCAAC
GGAGGAGCAT TCGGCATCAT CGCCGGAGGT GTCACCAATG AATGGGGCAC CAACCTCCAG
ACCAGCACGC TGACTGGGGA CACCCATGTG CAGCTCTCTG GAAACGCCAC GGCGGAACAC
GTCATCGGCG GGAACAACAA GGGAGCAAGC ACTACCCTGA CGGGAAACAC CAACGTTACC
GTCAAAGATA ACGCCATCGT GGCGGGAGCC ATCATCGGAG GAAGCACCTC CTCCCACAAT
GCCGTCACCA CCATCACAGG AAACACGAGC GTCCTGGTCA CCAACATCCA GCACAGCAAC
AGCGCGACCG TCAACCTGGG AGACTTCGGC AACGTCACAG CCCAAAACTT CATCACGGGC
GGAAGCGCGT GGACGGCCAA CCAGACTTCA GGAACGACCA TCCAGGGCAA CACCTCGGTA
ACGATTAATG TAGGCGATGC GGAACTCTCC GGCACGGAAG GGCACAACAA CTTCGTCAAA
AACATCTATG GAGGCAGTTA CGCAAACACG AAATCCGAGG GCAACGGAGC CGTACAAAAA
GTGGAAGGAA ACTCAAGCGT CTCCATCAGC GGAAAAGAAG GAATCACCTT CACGGGAGAC
ATCATGGGCG GCTCCTTCTG GAACTGGGGC AACGGCACGA CGCTGACCAC CAACGGAAAC
ACCAGCGTCT CCATTGACGG AGGTTCCACC TTCACGGGCA AAATCGTGGG CGGCTCCTGG
AGAGGAAGCA CGTGGACGGC GGAAGATCCC ACAGCCCTGC CCGTCAGCAT TGGCGGCAAC
ATCACCGTCA CCCTTGGACA GGGCACCTAC CTGGGAGACA TCTACGGGGC GGGCAACTGT
GGCACGGTAG GAGGCGAAGT CCACGTCTCG CTGACGGGAG GAAGCATCTT TGGGGAGGAA
GGAGAAGAAA GCGGCATCAC CATCGGCGGA AGCGCCGGAG CGGCGGTAGA AGGAAACAGG
ACGCTGGAAC TCAAGGGAAC CTTTGGAACC GGGGACTTCC AAAACGTCAC CTTCACCCGG
TTTGATGAAA TCAATATCGC GCAGGAAGAA ACCTCGGCCA CCATCTACGC GCTGACCGAC
AGCCCGGCCC TGACCAAAAC GGGAGCGGGA ACCCTGACAC TCGGGGCGGA CGCGGCAGGA
GCGGAAACCA TCCTTGACGG AACTACGGAA GGAATCACCA TCTCGGAAGG CAGCCTGAAC
CTCTCCGGCG CCGGCGGCAG CCATATGAAA GGAACGTGGA ACATCGCCTC CGGCTCCCGT
CTCACCGGAG TCAGCGGAAC CGTAACCGTA GGAGAAGGAG GCCTGGACGG ACTGACCATC
GCCCTGGGAA CGGAAAACAT CGGCCAGGAC ACGCAGGCGT CTGGCGCCGT CATCATCAAC
GGCAGCGGAA CGGGAAGCGA CCCGAACCTG TCCATAGAGG GGGAAGGGCT GACGCTGGAC
CTGAGCAACG ACGCCGTCGT CAACCTTCTG CTGGACCACA AAACCGACGA CTCCTCCAGC
TACCTGACCC TGACCAGCGG AACGCTGACC GTCGGCGACC TGGGCGACAT CGCCTTCACT
ACCGACCTTC TGTCCAACTA CGGCATCCGG GTCACCGGAA CGGACGGAGG CAGCCTGGTG
CTCAGCGGAG CGGCCACCGG CCTCTACCGC GTGCAGGAGG AAGGAGGAAA CGCCCATGAA
GTCAACTCCT ATCAAACGCT CTCCGGCTAC GCCGGCGTCG TCATCGGCGG CGGCCAGACC
CTGACGGTCA ACCTGGCCGG AGCCCCGGGC GAATCCGACG GACAGGGAGC TAAAATCAAC
AACCTGATGG GAGCCACGGG CAGCAGCCTG GTCGTCAACA ACACCGGAGA CGGCACGGCC
GTCGTCATCC TCAACAACAA ACAAATGACG ACGGGAGAAG ACGACATCGA CCCGGCCGGC
CAGGACACCG TCATGGGCGG CAGCATCACG GGAGGCAACA ACGTCGCCTT CATCAAGGAA
GGAACCGGGA CGCTGACGGT CGGAGGAACG ATGGACGTGG AAACCCTCGC CCTGCGTGAA
GGAAACATCG TCCTCAACGG CGCGTCAAAC ACCCTGGACA CCCTCACTCT GGAAGGAGGC
GGCCTGACCA TTAACGGCAA CGCCGAAGTC GGAACCATCA CCGGCACGGA AGCCGGAGGC
TCCCTGACCA TCCAGGGAAC TCTCAACCTG ACCGGAACAG GCGAAATAAA CGACGGCTCC
ATCACCGGAA CCGGCACCCT CCGCATCCAG GAAGGAGCGG AACTGGCCCT GGGCGGAGAA
GCCCGGCTGG ACGGAACCTC AGTCACCGCC GACGGCACGC TGGCCCTCTC CGGAACGGAA
TCCGGCGCCA TCATCAGCCT CTCCGGCAGC GGCACCCTGT CCATGAACGG AGGCAGCCTC
TCCATCTCCT CGGCAACAAC CTCCTCCGGA ACCTTCTCCG GCACGCTGGC AGGCAGCGGC
ACCCTGGACA TCTCCGGGCA AGCCACCCAA TACCTGCAAA CCGGCAATAA GGACTACGAC
CTCGCCGTCC GTGACGGAGG CGTCCTGGTC CTGAAAGGCA CCGCGGACGC CCCCACGCTG
AACTACAACA GCATCACCGC CGGAAACAAC GGCACCCTGC GTATTGAAGC CACCGGAGAC
GCCCAGGGCA GCGCCAACAC CACGCTCAAC GTGGAAAGCA TCACCTTCCA GAACGGCTCC
ACGACGGAAC TGATCTACAA CTTCAACCAG GACGCCCCCT TTGGCGCCCC CATGCTGACG
GCGGATACCA TCACCGTGCA GGACGGAGCC GGCTTCCTCC TCTCCAACAT GGAAGGAAAC
GCCGCCATGA ACGCGGGGAG CGACCTCCAT GACGTCATCC TGATGAGCGC TACCGGCAGC
ATCAGCGGCC TGGAAGACGG GCAAAGCCTC GCCGCCCGGA TATCCGGCCT CTTCGCCGTC
TACTACCAGG ACGCCACGCT GAACCGCGAC GGAAACGACA TCCTGCTCAA TGCCACACTC
CGGCAGGAAA ACCTCTTCGC CTCCGCGGCC GACACGTGGA ACTCCGCCGC CGGAGCCAGC
CTGCTCTGGG AAGCCCGCAA AAACCTGGAC CCCGACTCCC AGCTGGCCCA ATTCATGAAC
GGCGTCAGCA CCATGATCAA TGACGGCAAC CTCTCCGGAG CCACCCGCGC TATGGCGGCG
GCAGCTGGGA GCACGGTCAA CGCGCTGGGG ACGGCGCAGA GGGACGCCCT GCGCGACCAG
ATGGGCTGGA TCAGGAACCG GACCACCCTC ATGGGCGTCA ACCCGGCCTA CGTCAACGAC
GACCTCCCCC GCTTCCACAT GTGGATGGAA GGCACGGGCT CCTACGCCAA ACTGGACACC
CGCGGGGATG AAAGCGGCTA CCAGCTCACC ACCTGGGGAG GTACGGTAGG CGTGGACGCG
GACCTCAGCG ACCGCCTCAC AGTGGGAGCG GCCTTCACGG CCAGCTACGG CGACCTGACG
GCCGGCGCGG CGGACAGCGC CGACGGGCAC CTGGACAGCT ACTACGCCAG CCTCTTCGGC
CGCTACCAGG ACAGGCGCTG GGCGCACACG CTCATCCTGA CGGGAGGGTG GAACGACGCG
AAACTCAACC GTACGGTCAA CTACGGGGAA GGAAGCTACG GGACGCAGGG AAGCACCAGC
GGGTGGGGCT TTGGAGCGAT GTATGAACTC ACCTACGACG TATACCTCAA CGAAAACCGC
AGCAGCGTGC TGCAGCCGCT GTTCAACGCC TCGGTGGTGA CGACGCGGAT GGACGGCTAT
GAGGAAACGG GTGCGGGCAA CGCGGGCCTG AACGTCGGCA GGCAGGACTG GACGACGGGG
ACGCTGGCGC TGGGCGGCCG GTGGATGGGC CTGGTGGGCA GCAACATCTT CGGACGAGAA
GCGCTGGCGG AAATCCGAGT AAACGCGGCG CAGGACCTGG GAGACCGGAG AGGGGAAACG
AACGTCTCTC TGCTGGGCAA CCCCGGCTTC GCGCAAAGCG TGAGGGGGGC GAAAGTGGGA
ACGACGGCGC TGCAGCTGGG AGCCGGACTG AGCGTGCCGG TGGGAACGAA GGGAACCATC
TACGTGAACG GGAACGCGGA CATCCGTGAC GGGTCCAGCG CGCTGAACGG AAGCATCGGC
TACCGCCACG ACTTCTAA
 
Protein sequence
MRLHLPITLL AAVLACYTSV SLAVPTSESP AWGADSTFNN NEPANEYSVT GSQSVNLDVN 
SGNNNYSTGL YIGAGSSFTI NQNANGGCTI NLNGAFAGEG NLTLVAANGN AGYASKFVLG
SQESSFSGNI ILSQKGTQPG GAILQITGTA LANATVDLSG SINQSSSALT LQISNAASLA
GLNDADGFSG THKGRVQSAN SSRANLTLTG NGNYTYGGSI GATTQHSGVN GNTTPTGGIN
LIMAGTGTQN LTGTVINANI TAQGGILKIN NSSLAYSGII TMAGGTLDFT SATLGANAVL
NMNGTGVLKN AAIDGAKLTY TESGSSFTKE NVTFTSGTID IGGALDSLVE GEQGYTFDLG
SNLGANFTVT GLDTGQYRIE GSVLTIEDGA ISSVTWVSAG EGGELEETVK NAFTLALGEG
SAANVSLGYL NGTLTTSGDK VYQITNTGGT KINLSGVYNS GATLPSGNLN YQGDIWMDIN
GGAFGIIAGG VTNEWGTNLQ TSTLTGDTHV QLSGNATAEH VIGGNNKGAS TTLTGNTNVT
VKDNAIVAGA IIGGSTSSHN AVTTITGNTS VLVTNIQHSN SATVNLGDFG NVTAQNFITG
GSAWTANQTS GTTIQGNTSV TINVGDAELS GTEGHNNFVK NIYGGSYANT KSEGNGAVQK
VEGNSSVSIS GKEGITFTGD IMGGSFWNWG NGTTLTTNGN TSVSIDGGST FTGKIVGGSW
RGSTWTAEDP TALPVSIGGN ITVTLGQGTY LGDIYGAGNC GTVGGEVHVS LTGGSIFGEE
GEESGITIGG SAGAAVEGNR TLELKGTFGT GDFQNVTFTR FDEINIAQEE TSATIYALTD
SPALTKTGAG TLTLGADAAG AETILDGTTE GITISEGSLN LSGAGGSHMK GTWNIASGSR
LTGVSGTVTV GEGGLDGLTI ALGTENIGQD TQASGAVIIN GSGTGSDPNL SIEGEGLTLD
LSNDAVVNLL LDHKTDDSSS YLTLTSGTLT VGDLGDIAFT TDLLSNYGIR VTGTDGGSLV
LSGAATGLYR VQEEGGNAHE VNSYQTLSGY AGVVIGGGQT LTVNLAGAPG ESDGQGAKIN
NLMGATGSSL VVNNTGDGTA VVILNNKQMT TGEDDIDPAG QDTVMGGSIT GGNNVAFIKE
GTGTLTVGGT MDVETLALRE GNIVLNGASN TLDTLTLEGG GLTINGNAEV GTITGTEAGG
SLTIQGTLNL TGTGEINDGS ITGTGTLRIQ EGAELALGGE ARLDGTSVTA DGTLALSGTE
SGAIISLSGS GTLSMNGGSL SISSATTSSG TFSGTLAGSG TLDISGQATQ YLQTGNKDYD
LAVRDGGVLV LKGTADAPTL NYNSITAGNN GTLRIEATGD AQGSANTTLN VESITFQNGS
TTELIYNFNQ DAPFGAPMLT ADTITVQDGA GFLLSNMEGN AAMNAGSDLH DVILMSATGS
ISGLEDGQSL AARISGLFAV YYQDATLNRD GNDILLNATL RQENLFASAA DTWNSAAGAS
LLWEARKNLD PDSQLAQFMN GVSTMINDGN LSGATRAMAA AAGSTVNALG TAQRDALRDQ
MGWIRNRTTL MGVNPAYVND DLPRFHMWME GTGSYAKLDT RGDESGYQLT TWGGTVGVDA
DLSDRLTVGA AFTASYGDLT AGAADSADGH LDSYYASLFG RYQDRRWAHT LILTGGWNDA
KLNRTVNYGE GSYGTQGSTS GWGFGAMYEL TYDVYLNENR SSVLQPLFNA SVVTTRMDGY
EETGAGNAGL NVGRQDWTTG TLALGGRWMG LVGSNIFGRE ALAEIRVNAA QDLGDRRGET
NVSLLGNPGF AQSVRGAKVG TTALQLGAGL SVPVGTKGTI YVNGNADIRD GSSALNGSIG
YRHDF