Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2152 |
Symbol | |
ID | 6273736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2620771 |
End bp | 2626509 |
Gene Length | 5739 bp |
Protein Length | 1912 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642614213 |
Product | YD repeat protein |
Protein accession | YP_001878741 |
Protein GI | 187736629 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.0482725 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCCC CTCATCGTTC TGCCGTTCCC GCTTACCGGA AATGGTACTT TGGAGCCGGC GGCAGCGTCG GCAGGCGGGG CATGGAGGCA GCTCCGGAGC CCGATCCGCT ACCGTTTGAA GCGATCCGGA TCCATGGGGA AAGAACCGTG GGGCCGCCGA GCACGTCGGC GGACGGGCAC AGCGCGCACG GGACATTCAC CATCCCGGAA GGGGAGGGGG GAAAAAAGCT CTACGGCACC TGCTCGCTCT TCCTTGGGGT GGACGACTGG GGAAGCCTGG AAGTAAAGGA CTCCGGCGGC AACGTGGTGG CGCAGGTGGA TCTGAAAGAA AACCCGCAGA CGGCGGGCGA ACAGGGCGGG CACAAATACC ACACGGGGAC TGGCGGGGCG CAGCTCCCCT CCGGCACCTA CAGCTGGGAA GTCAGCCAGA CCAACATCGA CTACAATCCG GCAAGCGGCA ACACCTCCAT CTGCAACTAC AGCATCGACG TGGTGCCGAC GGAACCGGGG GGCAGGAAAG AACCCGAACC GTGTCCCTGC GAGGGAGACA CGTGCGACAA CAGCGGCGGA ACGCCGCCCT CCCCGCCGCA GGCCCGGTCG TGCCCTGAGG CGGGCATGGA AAGCGGCGCC CTGGGAAACT ACAGCTCGGC GGGGTGCAGC GTGACGGCGG AAAGCACGGC CACGCTGATG TACTGGTCCT GCAACTTCGG AGCGTTCCGT GGACTTGGGG GCCTTCCGGC CGGAAGGGTG GAACTGAGGG CTGAACAAAA CGTCTCCGGC CTGGAAAGCC CCTCCTCGCT GGCCTACAAC CATCCCCTGA ACAGCCGTCT GGACGTGCCG GAAGGGGGCA TTGCGCCGGG AGTGCGGTTC AACCTGGTGC AGGGAGACCG GGTGATTGCC ATGCGCTGCT ACACGGACGG GTCCGTGCTG CCCATCGGGG TGGACACGTC GGGCGGAGGG CGTGCGGCGC TGGCCACGGT GGAAGGACAA TCCTGCCTGC GCTGGGTGGT GGAAGACGGC AGCCAATACC TCTTCTCGGC GGAAACGGGA ACGCTCCTCT CCTACACCAC CACGGACAGG CAGGTCATCT CCAACGCGTC ATCCTATCTG GACGTCAGGC ATGCCGGAGA CGGCTCGCTG AGGCAAATCT GGAACCTGTG GGACGGCCTG CTCAACGTGG AAAACGTCAC CTCCACGGGC TACACCATCG CGCTCTATAC CCCTGGGCAA ATCACCGGAA CGGACGAACA GGGATTTTAT ACCGTTACGG GCGCTCCCTT GAAAACATTT ATTCTTTCCC TGGATGCTGA GGAAAAGTTC ACCATCACGG AACAGGCGCC TGCCAGGCAG CCCTACGCCG TCACCTGGTG GAACGACGGC CTGGCGTGGA ACATGCGGCA GGGCACGGGG GAAGACGCCC TCACGACCCT CCGCACGCGC ACGGAGCTGG AACCGGAAAA CTCGGTCTGG CAGCTGGTCA CGGAAATCTC CAAAAACGGA ATCGTGGCGG CGCGCACCTG CGCCATCTAC CAGACCACGG ACGTGGGCGA CCTGCTGCTC ACGCTGGCGG AAGGCTACGG AAGCCCGGAG GAGCAAACCA CGCAATACGC CTACGACCAG TGCGGACGGC TCAGAACGGA AACGGCCCCG GGCGGCAGCC AGACTCATTA CGCCTATGAC CTCTACGGCC GCCTGCTCAG CCGGGACGAA CCGTGGGCGG AAAGCGGCAG GCGCATCACG CGCTACACCT ACGCCTGTTC GGGAGAAGCC GACTTCAGCA ACGAACCCGC CACGGAAACG GCAGACCTGC TTCCGCTGGA AGGACACGTC AAAACGCTGA CATCCACCAC CTGGAAATAC ACGACGGCCA ACCACATCAA AAGAACGGAA CGGCGGGTCA CCGGACTGGG CGTGACGGGC ACGCGCCTGA CGGCGGAGGA ACAATGGCTG GCCGGAGCCG CCAACATCCA TGCCCGCGGA CGCACGCGGT TCAGCCGGGA CCTCGACGGC GTGCAAACGT GGCACGACTA CGCGGCCACG ACGGAGCACG GCGCCCTCTA CACGGAAACG GTGGAAACGC GCATCAACGG AGAAGCCGTG CCGGGACAAA GCACGCGCGC CGTCACCTGG ATCACGGCGG AAGGGCAGCG CGTCAGGGAA GAAAACTACC TCCTGCTTTC CACCGGGCAA TGGGCGCTCA CGGGCAGCGC CGTCTACGAA TTTGACACGC AGAACCGGTG GGTGAAGCGG ACGGCGGGCA ACGGCCGGCT CACGGAACGC GAACTGATGT GCGACGGAGG CCTGCTGTGG GAAATCGATG AAAACGGCAT CAGGACGGAC TACGCCTACG ACACGGCGCG CCAACTGGTG GAAGTCACGC GTTCCGCCGT GATGGACGGG GAAACCGTCA TCACGCCGGA AACCATCACC ACCTACGTCC GGGATGCGGC AGGGCGCGTA CTCTCCACGC GTCAAGACAC GGGGGCGATG ACCACGCGGG AAAGCGCCAC CTACGACCTT CTTGGCAGAA CAACCTCCAC CACGGACGTC CTGGGCCGGG TCACTACCTA CGCCTACAGC CAGGACGGCT TGACGGTCAC GCAAACCGTC CCTTCCGGGG CTACATTCAT CACGCGCAGC GCGCCGGACG GAACGGTGAT GGAAGAATCC GGCACGGGGC AGCGGCACGT CATCTACGCC ATCGACCTGG TCAGCGACGG TGTGCGGACC TTCACGAAAG CCGTCTCCGG GGAAACGCAA ACCGAGCTGC AGCGCAGCAT TGTCAACGGA GCCGGGGAAA CCCTGCGCAC GGGCGTCCCC AACACCACCG GTGGCGTCAT TTACACGAGG AACACCTACA ACGCCAGGGG GCAGCTCACC AAAACGCAGA CGGACGCGGG CAATGCGGCC ACGACGATGG CCCCGACCCT GTGGGAATAC GACGCCTTCG GCAACAAAAC GAAAGAAACC TGGAAACTCG CCGATCCGGC CACGACATCC AACTCGCGCA TCACCACGTG GAGCTACGGC GTGGAACAGG CCCAGGATGA AGTATACCGC GTTGTTACGG CGACCAGGAA CAACAGCCGG GGAACGACCT ATAACGAAAC GCAGAAAACG CTGGCTTCCT CCCTCTCGTC CACGCTGGAA AGCAAAGTCA TTTCCATCGA CCCCAGGGGA AACGCTTCCG AACAATGGAG CGAATACGGT CCGGGCGCCG TCCGGACGCA GAAAAGCAGC ATCCCCACCT CCGACATCAC GGCCGCCGCT ACGGTCATCG ACGGTTTTAT CATCTCGCAA ACGGACCATG CGGGCGTCAC GGCCACGCAT ACCCGCGCCT ACACGGAAAC CGGCGTCATC TACGCCAGCA CGGACGGCCG GGGCAACACG GTCACGACGC ACACCGACCT TACCGGGCGC ACGATCTCGG TGACGGACGC GGCGGGCAAC ACGACTTCTA CCGCCTACGG CCCCTGGTTT GACCAGCCTG CCGTCGTCAC CAACGCCCTG GGCAACACGA CCTGCTACGG CTACGACCTC CGGGGCCGCA ACACGGCGCA ATGGGGAACG AGGGCCCAGC CCCTGCTCTT CGGCTATGAC GAGGCGGACA GGATGATAAG CCTCACCACG TTCCGGGAGG ACGCGGGCGA CATCACCGCC GACCCCACGG GACGCACGGA CGGGGACGTC ACTACGTGGA GCTACGATGA CGCCACGGGC CTGCTCATCC GCAAAACCTG GGCGGACGGC ACCCATGAAG ACACCGCCTA CAATGCCCTG AACTTCAAAT CCACGCTCAT GGACGCGCGG GGGGTGGTCA CCACCTGGGG CTACAACCTG AAGAAGGGGG TCAACAACTC CGTCTCCTAC AGCGACTCCA CGCCCGGCAT CCAGTACGCC TACAACCACC TCAACCAGCT GACCCAGGTC ACGGACGCCT CCGGCTCGCG CGTCCTCACG TACACCCCCT GCAACGAACC GGACACCGAC AGCATCACCA TCGGAGGGAG CTCTTACCAG CTCCAGGAAC ACTACGACAC TTACGGACGC TCCTCCGGCT ATACCCTGAA ACAGGGAACC GACGTCCTCC AGGAAGCCAG CCAGGGCTAT GAAACCGACG GAAGGCTGGC CAGCGCCGGA ATCAGGCACG GGGGAACGGA GCAAAGCTTC GCCTACGGCT ACCTGGCAGG AAGCAGCCTG CTCTCCAGCC TTGCGATGCC CGACGGCATC GTCCGGGAAC TTGCCTATGA ACAGCGCCGC AACCTGGTCA CGGCAATCAA CTGCCGCCTG GGGGAAACCG TGCTGGTCTC CCGCAGCCAG GGCTACGATG CCCTGGGACG CCCGGTCACC CGCACCCAGC AGCGTGGAAC GGAACCCGCC CGCAGCGACA GCTTCAGCTA CAACGGCAGA AACGAACTCA CCGCCGCTAC CCTGGGCGCC GCCCCCTACG GCTACAGCTA CGACAACATC GGCAACCGCA AGACGGCACG GGAACCGGCC GAAGAACTCG CCTACGCGGC CAACGGGCTC AACCAGTACA CCGGCATTGA AGAAAGCGGG GAAGCTCCTT TTGTGCCGAC GTACGACGCC TCGGGCAACC AGACCCTCAT CAAGACGTCA ACGGGCATCT GGACGGCCGT GTACAACGCG GCCAACCGCG CGGTGAGCTT CACCAGCCGG GACGGCGCGA CAGTCGTGGA ATGCGGCTAC GATTACCAGG GACGCCGCTA CATGAAGAAA GTGACCCAAA ACGGCACGGT CGCCAGCCAC GAACGCTATC TATACCGCGG CTATTTACAA ATAGCGGCAT TGGATATGCT GGACAACCGT AACGTGCTTC GCACGCTGTT GTGGGATCCT CTGGAACCGG TGGCCACCCG CCCCCTGGCC CTCGCGCAGG GCGCTTCCCT GTACTGCTAC GGCATGGACT TCAACAAGAA TGTGTCGGAG GTCTTCGACG CACAGGGAAC GATCGCGGCG GCTTACGACT ACTCGCCCTA TGGGATAGTT GGCAGCACAG GCAACCTCGT CCAACCCGTA CAGTGGTCCG GCGAGATGCA CGACGAAGAA CCCACCCTGG CCTATTATAA TTACCGCTTT TACAACCCCA AAGACGGCAG GTGGATCAAT AGGGATCCCA TCGCTGAACA GGGAGGATGG AATTTTTATG CGTTCGTAGG GAACAGCCCT CAAGATAAGT TTGATGCTTT GGGGTTAGAA GATAAGAAAA AAGATAAAGA ATTTCTTGGA TATGTTTATG ATCAAACTTT AGAAGGAACA GACATATACA TTTGTGAAAC AACAGGATTG AAAATAAATG ACAAAATTTT AAATAATACA GTTTCAGAAT CCATGAACGC TTTTAAGGAA GCTAATAGTG CTAATCAAGC GGCAAGCAAT ATTAGATATG CAAAAGATTT TAGTAAAAAA GTGTCTAAAC TTTATAAAGT AACTCCTGGA ACAAAAATTC CCGGAGTAAA AATTTCTTCA ATTAAAGATA TAATTGATTT TGTTTATATC AAAAGAGATG AAAAAGTTGC ATATCAACAA TTTTATGAAG CTGCTACACG ATATCAAACG CTAAAGAAAA AGACATTCTC AAGTTGTTTT GAATTATGCG TAGCCATGGG AGAGTATGCA AAAGTATTTT TAAAAAATGA TTTAGGAAAA GGAATAGTTA AATTTTCTAC TGACTATTGT ATTTCTAAAT GTAATAAAAG GCATAAAACA TTAGATTAA
|
Protein sequence | MNPPHRSAVP AYRKWYFGAG GSVGRRGMEA APEPDPLPFE AIRIHGERTV GPPSTSADGH SAHGTFTIPE GEGGKKLYGT CSLFLGVDDW GSLEVKDSGG NVVAQVDLKE NPQTAGEQGG HKYHTGTGGA QLPSGTYSWE VSQTNIDYNP ASGNTSICNY SIDVVPTEPG GRKEPEPCPC EGDTCDNSGG TPPSPPQARS CPEAGMESGA LGNYSSAGCS VTAESTATLM YWSCNFGAFR GLGGLPAGRV ELRAEQNVSG LESPSSLAYN HPLNSRLDVP EGGIAPGVRF NLVQGDRVIA MRCYTDGSVL PIGVDTSGGG RAALATVEGQ SCLRWVVEDG SQYLFSAETG TLLSYTTTDR QVISNASSYL DVRHAGDGSL RQIWNLWDGL LNVENVTSTG YTIALYTPGQ ITGTDEQGFY TVTGAPLKTF ILSLDAEEKF TITEQAPARQ PYAVTWWNDG LAWNMRQGTG EDALTTLRTR TELEPENSVW QLVTEISKNG IVAARTCAIY QTTDVGDLLL TLAEGYGSPE EQTTQYAYDQ CGRLRTETAP GGSQTHYAYD LYGRLLSRDE PWAESGRRIT RYTYACSGEA DFSNEPATET ADLLPLEGHV KTLTSTTWKY TTANHIKRTE RRVTGLGVTG TRLTAEEQWL AGAANIHARG RTRFSRDLDG VQTWHDYAAT TEHGALYTET VETRINGEAV PGQSTRAVTW ITAEGQRVRE ENYLLLSTGQ WALTGSAVYE FDTQNRWVKR TAGNGRLTER ELMCDGGLLW EIDENGIRTD YAYDTARQLV EVTRSAVMDG ETVITPETIT TYVRDAAGRV LSTRQDTGAM TTRESATYDL LGRTTSTTDV LGRVTTYAYS QDGLTVTQTV PSGATFITRS APDGTVMEES GTGQRHVIYA IDLVSDGVRT FTKAVSGETQ TELQRSIVNG AGETLRTGVP NTTGGVIYTR NTYNARGQLT KTQTDAGNAA TTMAPTLWEY DAFGNKTKET WKLADPATTS NSRITTWSYG VEQAQDEVYR VVTATRNNSR GTTYNETQKT LASSLSSTLE SKVISIDPRG NASEQWSEYG PGAVRTQKSS IPTSDITAAA TVIDGFIISQ TDHAGVTATH TRAYTETGVI YASTDGRGNT VTTHTDLTGR TISVTDAAGN TTSTAYGPWF DQPAVVTNAL GNTTCYGYDL RGRNTAQWGT RAQPLLFGYD EADRMISLTT FREDAGDITA DPTGRTDGDV TTWSYDDATG LLIRKTWADG THEDTAYNAL NFKSTLMDAR GVVTTWGYNL KKGVNNSVSY SDSTPGIQYA YNHLNQLTQV TDASGSRVLT YTPCNEPDTD SITIGGSSYQ LQEHYDTYGR SSGYTLKQGT DVLQEASQGY ETDGRLASAG IRHGGTEQSF AYGYLAGSSL LSSLAMPDGI VRELAYEQRR NLVTAINCRL GETVLVSRSQ GYDALGRPVT RTQQRGTEPA RSDSFSYNGR NELTAATLGA APYGYSYDNI GNRKTAREPA EELAYAANGL NQYTGIEESG EAPFVPTYDA SGNQTLIKTS TGIWTAVYNA ANRAVSFTSR DGATVVECGY DYQGRRYMKK VTQNGTVASH ERYLYRGYLQ IAALDMLDNR NVLRTLLWDP LEPVATRPLA LAQGASLYCY GMDFNKNVSE VFDAQGTIAA AYDYSPYGIV GSTGNLVQPV QWSGEMHDEE PTLAYYNYRF YNPKDGRWIN RDPIAEQGGW NFYAFVGNSP QDKFDALGLE DKKKDKEFLG YVYDQTLEGT DIYICETTGL KINDKILNNT VSESMNAFKE ANSANQAASN IRYAKDFSKK VSKLYKVTPG TKIPGVKISS IKDIIDFVYI KRDEKVAYQQ FYEAATRYQT LKKKTFSSCF ELCVAMGEYA KVFLKNDLGK GIVKFSTDYC ISKCNKRHKT LD
|
| |