Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1565 |
Symbol | |
ID | 6273658 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1883661 |
End bp | 1890038 |
Gene Length | 6378 bp |
Protein Length | 2125 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613625 |
Product | alpha-2-macroglobulin domain protein |
Protein accession | YP_001878167 |
Protein GI | 187736055 |
COG category | [R] General function prediction only |
COG ID | [COG2373] Large extracellular alpha-helical protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAT TCTTTTCCGC TCTTTTTCTG GCGGCAGGAT TGCTGGCCTC CGGAATGCCG TCCCCGGCGG ACGCATCCAC CCCTCCGCAA CAACATGCCC TTGCGGAGGG TAAAAATTCC CCTGTTTCCC CGCAGCAAAG GAAGCAAGCT GTTGAAAACC TTGAAAAGCT TCTGGGCAAA AGGCTGTTCA AGAAAGCGGC GGAACAAGCC CTGAAACAAC TCCATGAATA TGACGACCAA TACAGCGGGA CAGATCTGCT TCTTTATTAC CAGGCTCTTA CCCGTCTCAA TGCCCTGGAT GATTCCATCA ATCTGGATTC CATTCTGCAG GAACAGATGA AGCGCCACGG CGCCAATCCC CATTTCCTGA GGGACGCCGC ACTCCTGTAC CAGAGTGCCT GCCATACGTT CAAACTGGTG GACGGCGCCT ATATCCGCGG CGCCGGGCCA TGGGATGGAG AATATTCCGG AGAAGCAAGG GACCGTGTGG AAGCCCTGCG GTGCCTTGTC AAAGCCATGC AGCTTGCGGA GAAGGGCAAC AACATGAAAC TTCTGGGACA ACTGCGCTTC ATCACGGCAC AGGCGTTGCC GGTTAAAGGA AACCGCTACG CTCCCCTTTC CGCCTTCCAG TCTTACGCAG CCCTGGGGAA CCTGACCAAC CTGAAGGAAT TGCCGGATTA CGTCTCCAGA GAGGAAGCAT CCCCTTTTCG AAACGTCGCC ACCGTTCCCG TCATGGTCAA TGCCGGAACA GGAAAGCCGG AAGTTATTTT TTACCATGCT TCTTCCTCCT GGGAAACAGC TAAAAACGAC GGGGAACGCA TGCGCTGGCT GCTGGACGCC GCCATTCAGG CGAATCCGGA ACTGGCCAAC CAGGTCAATT ATTTCACCGC CTCCTGGTGC CGCCGGCTGT TTTCTTATGC CAATACGGCG CCGGACCAGG AATTCGTGTA CGGCCCCGGC AACGCGGGCA TGGTGGCGGG CATCAACCCG GCGGAATTAA AAACGGACCA GACCATCGTT AAAACCGACC GGACGGGCAA CGGTAAATTC CTGCTCACCA ACCTTCCCCC GGATTATAAT TTCATCCGGA TTGCTTCCGC CGTCCGAATA ACGCCGGAGC CGGATTACTA CGTTAACGCC GCCAATCTGG CAGCGGACGA GTTCCTGGCC CGCAACCAAA GGCCCGCTGC CGCGCAGTTT TTAGGAAAAA TCCTTCAAAC CTGGAACGCA CAATCCTGGA ACAAAAAGGA CAAGGAGGAA TTTCTGGACG TGGCGGACAA TCTGAAAAAA CGCATCGCCT CCATCACGGA ACCCAACGGA ACATTTGATA CGGATAAAAG AACGCTGCTG GCGGGAGAAC CGGTGACTGT TTCCTTCTCC TACCGCAACG CTTCCCGGGC CTGCGTAGCG GTCCGCCCCG TGGATATGAA ACGGTGGCAG GAAGAACGCA TGGACAAGGT CCAAACCTCC AAAACGCTGG GAAAAGCTTA CAAAGACAGG TATTCCAACC TGGGGAACCT GCTTTTCAGC CTGTTGCATG ACTCATCCTA TGCCCGCTAT CTGGGGGAGG AAATCAAGGG GGATGAGATA ACCCTGACTC CCGGGAACCG GCACCTGAAC CACATCGCTC ACATTCCCGT ACCCACCCGC AAACCCGGCT GGTACCTGCT CACCGTCACA CTGGAAAACG GCTACCGGTT CCATCGCTTT CTCACTCTGT CAGACATGGT GCTGGTACGC CGTTCCGTTC CGGAAGGCAA TCTCTGGTTC CTGGCGGACG CCAGAACGGG AATGCCCGTG GAAGGAGGCA GCCTGCGGCT CCTCCGCTAC CGGCAGGATA AAACGCTCCA AAAACGCCAG GTAAAGGGAA TCACGGACAA GGACGGAGCC ATGATAGAAA CAATCCCTCA CCGCCATTCC AGCGCCCCTC CCTACAACAT GTTCCTGGGC ATCGTTTCCA AGGGGGACAG TTATGTGCTG GCGGGAATAT CTGACTACGG CTGGGAATCC GGCAATTACC TCATGGATAC GGCCAACAAG GCCTCCGACT CCTATTCCTG CTTTTTCCTG GAAAGCCAGC CCGTGTACCG CCCCGGACAG ACGGCCCGTT TCAAAGGCGT CCTCTTCCGC CCGGACATCG CGAATCCCGG AACAAAAGAC TGCGCCGGGA AAAAACTGAC GCTGACCATT GCCTCTCCAA CAGGGGATGA GAGCATTTTC CCTCCAAAGA CCGTCACGAC GGATTCCACA GGCAGCTTTG AACTGGAACT GCTCATTCCC GCGGACGCCC CGCTGGGCCA GTATGGGGCC AAGCTGAAAC TGCATGACTC CGCAGATCCG GATTTCCGGC TGTATGCCCC CCTTTTCCGG ATGGAAGAGT ATAAAAAGCC GGAATTCTCC GTCAGAATGG ATGCCCCCGC CAAACCTATC CGGCTGGGAC AACCCATTCC CGTCGCCATT CAGGCGGATT ATTACGCGGG CGGCCCCGTA AACGGGGGGA AGGCAACCAT CACCATCACC CGGACGCTGG GGGCGAACAT CTGGACGCCT TACTGGAAAT GGAGCTGGCT GTACGACAGG TCTTTCAGCC CGTACTACAT TCCCTTTTTC CCGGATGCGC CTCTGACCGT CCTGGAAAAG ACGGTGCCGC TGGACAAGGA CGGAAAAGCA TCCGTAGAAT TACCCACGGC GCAGGATGCC CGGGATTTCC CCTCCAGCAA CGTCACCTAC CGCGTTTCCG TCAGCGTGAC GGACGCCTCC CTGCGGGAAG TTTCCGCTTC CGGACAAGTC ATTGCCACTT TCCGTCCGTT TAATATCTTC ACCACCCTGA ACCGCGGCTA TGCCCCCGCC GGAACGCCGG TCCAGGCATC CATTACCGCA GCCACGGCGG ATGGCGCCAA AATTGCCCAT GCCAGGGGAA CCTGCGTCCT GCAGCATATC CGTGCGGACG GCAGACGGGA AACGCTGGAA ACCTGGGATA TCGCCACCAG GAAAGATGGG GAAGCCTCCC TGTCCTTCCA GACCGGGGAA TCCGGCCTGT ATGCCCTTTC CACAACCCTG GAGGACGGGC ACGGCAACAA GGTGGAAGAA TCCTTCCAGT TTCTTTCCTA CGGCAAGGGC AAACAGAATC CTTTTAAAAT CAATCCTCTT TCCATCCATC CGGACAAAAA GGAATACGCT CCGGGAGATA CGGCCAGGCT GCTCGTCACC TCTGACTATC CGGACGCCCG CGTCTGGACC TTCCTGCGCA ACTCCTGGAA AAACGAATCC CGCCGTCTGG TTTCCCTGGA CCGGCAGACA GCCCTTGTGG AATGCCGCCT GACCCGGGAA GATATGCCCA ATATGGGAGT CAACGCCTTC ACCGTCAGAA ACGGAGAACT TCATGAGGCT TCCGCCGAAC TTCTGATTCC GCCCGCCGGA CAGCTTCTGG CTCCTTCCGT AGTGCCGGGC AAATCCCAAT ACCGGCCCGG TGAACAAGGC AACGTCACCA TTCAGGTCAA GGGGCCTGAC GGCAAACCCG TCAGTAACGG CATCGTCGCC CTGGCCGTTT ATGACAAGGC TCTGGAATAC ATCGCACGCC CCAATATTAC GGACATCTCC AAAACCGTAT GGGGCAGACT GAACGAAACT GGCTTCCTCA GCCTGAAAAA AATGACGGCT TCCGGCACGC AGCAGGATCG CGGTCCGGGA CAGCCCTCTT TCCAATCCCT GCTCTACAGA AATTACGGAC CTATGGCGAG AAAGGCCAAA GGGACCGTTA ACGGCTTTGC GGAGGCAGTG TTCGATTCCG GTGCTGATGC TGCCGCAAGC CGCGCATTGG CTACAGGAGC GGCCGCCCCC GCCGCCGTTC CGGTCATGGC CATGGCCGCG GACAAAGAAA GTGCAGAAAG CGAGTCCCTT GCAAACGGAC AGGGAAATGC GGACGCGCAG GAAAACGGTT CCCCGCACAT TCAACTGCGC ACCAATTTTG CGGACTGCAT CAAGTGGTGC GGCACGCTCA AAACGGATGA AGAAGGAAAC GTTGCCGTAC CCGTGGAAAT GCCGGACAAC CTGACCACAT GGAAGGCCTC AGCCTGGGTC ATCACCCCCG GATTGCAGGT TGGGCAGGCT TCCGCGGAAT TCCTGACCAC GAAGGACTTC ATGGTCAGCA TGCAGGCCCC GCGCTTCTTT GTGGAAAAAG ACATCGTGAT GCTCAGCGCC CTGGTCCGCA ACCGGACGGG AAAGGCTGTC CGCGCCCGCG TTTCCATTTC CCTGAAAGAC GGGTGCCTGG AACTTCTCCC GGCTGACGAT CCGGCCGTGA AGGGGCTTTC CGCAGATACG GATAATTCCG CCGTGCGGGA AGTGGACGTT CCTGCCCAGG GGCAGGCAGT CGTCAACTGG TGGGCTGCCG CCGTCAGGGA GGGAACCGCC GCCGTCGCCA TGGAAGCATC GGCTGGCTCT ACCGGAGACG CCATGCAGAT GAACTTCCCC GTACTGGTGC ACGGCATGAA GCAGCTCCAT GCGGAAAGCG CCGCAGTGCT CTCCGGAGAA CAGGAAAAGA CACTTTCCAT TTCCCTGCCG CAGCAGCGGC GCAGGGAGGA AAGCGAACTG GTCGTCAAGG TCTCCCCCAG CATCGCCCTC AGCATGGTGG AGGCCCTGCC CTATCTGGCG GAATACCCCT ATGGCTGCGT GGAACAGACG CTCAACCGCT TCCTCCCCGC CCTCGTCGTG ACGGACACGC TCAAGCAGCT CGGCCTGAAT CCCGGAGCGG CGCTGAAAAG CCATCGCAGC CTGAATCCCC GGGACATCAG GAACAAGGCC TTCCATGACA GCGTCATGAA AAAACTGGAA CGCAACCCCG TTTATGATGA AGCCGCGCTG AAAAAGATGG CTGCTAGGGG AATCTCCTCC CTCCGGGAAA AACAGCTCTC CAACGGCTCC TGGGGATGGT TCGGAGGGGC GGAAGAGGGA GATCCCGTCA TGACGGCCCA CGTGGCTCAC GGCCTGAAAA TTGCTTCAAA CACGGTCAAC GTGCCGGAAG GCATGCTTTC CGGCGCTGTC CGCTGGCTGA AAAACTATCA GGAACGCCAG ACAGCTCTTC TTGAACAAGG AGACAAATTC CGGAAACTGG AACAACTGCC GGACGGACCT GAAAAGAAGG AAGCCCTCCG CAAACTGGGG AATTACCGCC TGACGGCCTC CGCGACGGAT ACCCTGGTCT ACTCCGTCCT GGCGGAATGC GGCGTGAAAA ACCTGCCTAT GGAACGCTAC CTGTTCCGTG ACAGGCTGGA ACTTCCCGTC ATCAGCCAGA TTCAACTGGC GGAAATCCTG CTGGACGCCC ACCGCATGGA CGACTTCAAC AAGGTAATGC CTGTCATCTC CCAATTCCTC CAGCAGGACG ATTCCCTGCA AACCGCCTGG CTACGCCTTC CCAATGCAGG ATACTGGTGG CGCTGGTACG GCAGCAGCGC AGCCACGCAG GCGGCCTACC TGAAACTGAT GTCCAAGAGC GCCCCCGGCA ATCCCGTCAC GGCGCGCCTG GCAAAATGGC TGCTCGACAA CCGCGCCAAC GGTTCCTACT GGGACTCCAC CAAGGATACG GCGGATTGCC TGGAAGCCCT GTCCGCCTAC CTCCTTCAGA CCAGGGAAGG CATGGAGGAC ATGGAAGCGG AAATCCTTTA TGACGGCGTT CCGGTTAAAA CGGTTTCCTG CACGAAGGAA ACCCTGTTCA CCTTTGACAA TGCTTTCCGC ATGAGCGGGA AAGCCCTGGC GGACGGCAGC CACGTCATCA CCATCCGCAG GAAAAAAGGC AGCGGGAACA TCTATGCCAA TTCCACGCTC AGCTACTTCT CCCTGGAAGA CCCCATTCCC GCCGCTGGCA ATGCCGTCAC CGTGGAACGC TCCTATTACC GCATCCGGAA GGAAACGGTG AAAAACGGCT CCGTGAAGGA CACCCAGACG GATGCAGGAG AGCTCGTCTC CCAGGGCAGA GACCTGACAC GGCGCACGCT GTTGAAAAAC GGGGACGTCA TCGCCAGCGG CGACATCATT GAAGTGGTGA TGACCGTGAA AACGAAGAAC GACGTGGAAT ACCTCATGCT CCAGGATCCC AAGCCCGCAG GCTGCGAATC CCGGGAAACC GCCAGTGGCT ACGCCCGGCT GGGAACCGTC TTCGGCTACA GGGAAATAGG GGATGAAGAA ATATGCCTCT TCCTTTCCTC CCTGCCCATG GGCCAATACC AGATCAGCCA CCGCCTGCGC GCGGAACGTC CGGGACGCTT CAGCGCCCTT CCGGCCGTCA TTGAGGCCAT GTACGCCCCG GAACTCCGCG GCAACAGCAG GGAACATAAA ATCGGCATTT CCATGCCTGT TGAACAGAAT AATGACCAGC CGCAGTAG
|
Protein sequence | MNRFFSALFL AAGLLASGMP SPADASTPPQ QHALAEGKNS PVSPQQRKQA VENLEKLLGK RLFKKAAEQA LKQLHEYDDQ YSGTDLLLYY QALTRLNALD DSINLDSILQ EQMKRHGANP HFLRDAALLY QSACHTFKLV DGAYIRGAGP WDGEYSGEAR DRVEALRCLV KAMQLAEKGN NMKLLGQLRF ITAQALPVKG NRYAPLSAFQ SYAALGNLTN LKELPDYVSR EEASPFRNVA TVPVMVNAGT GKPEVIFYHA SSSWETAKND GERMRWLLDA AIQANPELAN QVNYFTASWC RRLFSYANTA PDQEFVYGPG NAGMVAGINP AELKTDQTIV KTDRTGNGKF LLTNLPPDYN FIRIASAVRI TPEPDYYVNA ANLAADEFLA RNQRPAAAQF LGKILQTWNA QSWNKKDKEE FLDVADNLKK RIASITEPNG TFDTDKRTLL AGEPVTVSFS YRNASRACVA VRPVDMKRWQ EERMDKVQTS KTLGKAYKDR YSNLGNLLFS LLHDSSYARY LGEEIKGDEI TLTPGNRHLN HIAHIPVPTR KPGWYLLTVT LENGYRFHRF LTLSDMVLVR RSVPEGNLWF LADARTGMPV EGGSLRLLRY RQDKTLQKRQ VKGITDKDGA MIETIPHRHS SAPPYNMFLG IVSKGDSYVL AGISDYGWES GNYLMDTANK ASDSYSCFFL ESQPVYRPGQ TARFKGVLFR PDIANPGTKD CAGKKLTLTI ASPTGDESIF PPKTVTTDST GSFELELLIP ADAPLGQYGA KLKLHDSADP DFRLYAPLFR MEEYKKPEFS VRMDAPAKPI RLGQPIPVAI QADYYAGGPV NGGKATITIT RTLGANIWTP YWKWSWLYDR SFSPYYIPFF PDAPLTVLEK TVPLDKDGKA SVELPTAQDA RDFPSSNVTY RVSVSVTDAS LREVSASGQV IATFRPFNIF TTLNRGYAPA GTPVQASITA ATADGAKIAH ARGTCVLQHI RADGRRETLE TWDIATRKDG EASLSFQTGE SGLYALSTTL EDGHGNKVEE SFQFLSYGKG KQNPFKINPL SIHPDKKEYA PGDTARLLVT SDYPDARVWT FLRNSWKNES RRLVSLDRQT ALVECRLTRE DMPNMGVNAF TVRNGELHEA SAELLIPPAG QLLAPSVVPG KSQYRPGEQG NVTIQVKGPD GKPVSNGIVA LAVYDKALEY IARPNITDIS KTVWGRLNET GFLSLKKMTA SGTQQDRGPG QPSFQSLLYR NYGPMARKAK GTVNGFAEAV FDSGADAAAS RALATGAAAP AAVPVMAMAA DKESAESESL ANGQGNADAQ ENGSPHIQLR TNFADCIKWC GTLKTDEEGN VAVPVEMPDN LTTWKASAWV ITPGLQVGQA SAEFLTTKDF MVSMQAPRFF VEKDIVMLSA LVRNRTGKAV RARVSISLKD GCLELLPADD PAVKGLSADT DNSAVREVDV PAQGQAVVNW WAAAVREGTA AVAMEASAGS TGDAMQMNFP VLVHGMKQLH AESAAVLSGE QEKTLSISLP QQRRREESEL VVKVSPSIAL SMVEALPYLA EYPYGCVEQT LNRFLPALVV TDTLKQLGLN PGAALKSHRS LNPRDIRNKA FHDSVMKKLE RNPVYDEAAL KKMAARGISS LREKQLSNGS WGWFGGAEEG DPVMTAHVAH GLKIASNTVN VPEGMLSGAV RWLKNYQERQ TALLEQGDKF RKLEQLPDGP EKKEALRKLG NYRLTASATD TLVYSVLAEC GVKNLPMERY LFRDRLELPV ISQIQLAEIL LDAHRMDDFN KVMPVISQFL QQDDSLQTAW LRLPNAGYWW RWYGSSAATQ AAYLKLMSKS APGNPVTARL AKWLLDNRAN GSYWDSTKDT ADCLEALSAY LLQTREGMED MEAEILYDGV PVKTVSCTKE TLFTFDNAFR MSGKALADGS HVITIRRKKG SGNIYANSTL SYFSLEDPIP AAGNAVTVER SYYRIRKETV KNGSVKDTQT DAGELVSQGR DLTRRTLLKN GDVIASGDII EVVMTVKTKN DVEYLMLQDP KPAGCESRET ASGYARLGTV FGYREIGDEE ICLFLSSLPM GQYQISHRLR AERPGRFSAL PAVIEAMYAP ELRGNSREHK IGISMPVEQN NDQPQ
|
| |