Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1143 |
Symbol | |
ID | 6273894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1365798 |
End bp | 1371773 |
Gene Length | 5976 bp |
Protein Length | 1991 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642613195 |
Product | YD repeat protein |
Protein accession | YP_001877750 |
Protein GI | 187735638 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0172483 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTACCA ACGATCAACA AAATGACGCC CCCTCTTTGA ACAGGGGTTC CGCAAACATG ATTAACTCCA ACCCCGCCGC CGGGCTTCCC GATGCGGCCA GCCAGCCCGG CGCGCCGAGC GCCGCAGCGC CCCTCAATGC CATGTCCACA CCCATTCAGC CATACGGCGC GGATTCCGTT ATTTATGAGA AATCCGACAA CTTCGTTCAG ACGTCTGCCG GAGCCGATGT CTTCATGTCT CCAGTCAACG ACACCTTTAC CGTCCCGGAA GGCGGGGCCA CGGCTGTGGC CAGCCTGACG GTGGATGACT GGGGCAAGCT GACCATTTCC GGCCCGGGCG GCACGTTTGA GCTGGACCTG ACCTCCGCCG CCGATGAACC CGGAGAACTG GGAGGCCACC AGGAATGGTC CAAATCAGGC TCCTTTGAGT TATCCGAAGG CACGTACACT CTCTCCATCA CGCACCAGAA CATCGACATG CCGCACAACG AATACAACCA GTCCGTGTGC AGGTACTCCG TGACGGTGAC CGCGCATGGG GGCTCCAGCA GCAGTTCAAG CAGTTCGGAC ACCCCTCCTT CTTCCCTGTC GAGCAGCTCC TCTTCGGACG TGCCGCCGGA GGAAAAGGAA ATCTGCTGCC GTTGCGGTTG TTGTACAGAC GCGGAAGGCA ATGAATACAC CATTGCGGCG GAAAAACTGC CGGGTGACCC CGGCGTGGAA ATCTGCATGT CCCAGGCCGA ATTCCTGGCC AGGGGAGGGG CTTCGGCTCC GTCTCCCGCG GCCTTCAGCC TCCGCTCCGC GGAAAACGCC CGTGAAACGG CGGAAGCCTG CGGCGGCCTG AAATACGTGA GCCCGTGGGC CTGGCGCGCC CATCTGGACG AAACGTCCGG CCTCATCACC ATGGTGCCGC CGGCGGGCGC CGCGCTTTAC TTCAACGTGC AGGCCGGTTC CGATACGGCC CTGCCTGCGG GCATCTCCCG CAAGCGCGAC TTCAGGGTGC AGCTGCTGGA TGAAACTCTG GCTCCTGCCG CTTCCGGCGC CCCCGCCTAC CTTTCCCTGG TGGACGCGGA CGGGCAGAAA ATCCGCTTCT CCGCGGAAAC CGGCGCTGTG GTCGGCATGA CCTCCGCCTC CGGCAGGGTT CTTCTGGCGG AAGACTACTT CCGGAATGTG AGCAATACGT ATGACCATGA GGGCAGCCTG GTAAGCAGCT ACAGCGCCGC GGAGGGGCTG ATGCGCACCC GGACCGGAGC GGACGGAGAA CTCGTCATGG AATGGTACGC TCCCGCCGCC GTCACGGTCC TGGCCGACGG AACATACGAG GTAACGGGGG AACCTTATAA AACCTCCTCC TTCCTGTCCT CGGAAGAAAA CGGCGTGCGG ACCACCGTCA TCACGCGCCA GCAGCGCGGA CTGCCGGCCC ACACCATCAC CCGTACGGAA GAACCCGGCA GAGTCAGCAT CGCCAAAGGC CAGGGGGACG ACACCATCAT CCGTACCATT GAAACCAACC GCCTCTACGG AGGCCTCTCG GAACGCATTG AGACCGTCAG GGGCATCAAC GATGCCGAGC CTGTTTCCTG CAGCCGCAGC GTCAGGCAAT ACACGGACGG CGGCTGGCTG CTGGTAAGCG AAACGGAAGC CTTCAACACG CCGCTGGCGC GGACGACCTC CTACGAATAC AACAGCCAGT ACCGCGTCTC CCGGATCAAC CGCCCGGACG GAGGCTACAC GCGCTATGAA TACGACGGCG AAGGACGGGT CACGCTTGAG GCGGCGCCCT GGGCCGGCGG CGGAGAACAG GTAACCCGGA CCGAATACGC CGGCCTGCGC TTCTACGACA ACCGTCCGGT GCGTGTCGCC GAATCCCGGG TGCTGTCCGA CGGCACGGAA ATCGAACTGA CGGCCGTCGC TTACGCCTAT GAGCAGTCTC CCCTCATGGA ACGGGTTGTC AAAACCGTGA CTGCCGCCGG TTCCAGCCAG GAACAGACAA GCGTGGAAGA AACGTATGGA GAAGCCGCCG CCTATCCCTA TGCCGCCGGG CAAATGAAAT TCACCCGGGA CATTGCCGGA GTGGAAACCT CCTATGACTA TGAGGCGGCC GCGGAACACG GCGCCGCGCA TAAAAAGACG GCCATCACCA AAGCCGGCGG CGGACTGGTG GCCGGACAGA GCCGCAAGAC GGAATCGTTC ATTGCCGCCA ATGATACGGT GCTTTTTGAG CAGGAAAGCA TCTGGGATGG TGAAAACTGG CTGCTGCTCT CAAGCGGCGC CCATGAATAC GATGAAGAAG GCCGCCGCAC GAAAACCACG CGGGGCAACG GCCGCGTCAG CGTCACGTCC TGGATGTGCT GCGGCAAGCT CTCGGAAACG GACGAAGATG GTGTCCTGAC GTCGTACGGT TACAACAGCG CCCACCAGCT GGTGGAAACC ATCCGTTCGG AAATCAGCGA CGGAGACACG GTCGTTACCC CGGAAACCAT CACCACCTAC ACCCGGGACG CCTCCGGGCG CGCCCTGCAG ACGCGCCGGG ACAGGGGAGC CATGACCACG ACGGAAAGCG TGGAATACGA CAGGCTCGGC CGTATCGTCA GGCAAACGGA TGTGCTGGGT CGGGTGACGG CGACAGCCTA CAGCGAAGAC GGCCTTACGG AAACCGTCAC GACGCCCTCC GGCGCTACCC TCGTCACGGA ATATCATGCC GACGGCTCCG TACTTCATGA ATACGGCACG GGACAGCGCG AACGCTGCCA TGTCTATGAC ATTGACAATA ACTGTTTGAG GGAAACCGTT ACCCTGGCCG GCCAGACCAT CATCCTTTCC CGGACCCTGG TCAACGGCTT CGGACAAAGC GTCGTGCAAG TGACGCCCAC GACTGCCGGG TTCCTGTATG ACCGTTCCGA ATACGATGAA CAGGGGAGTC TCATCCGCTC ATGGAGGGAT GCGGGAACGC AGGAGGGGGC CGTCGCCATG GCGCCTGCGC TCTATGAATA CGATGCCTTT GGCAACATGA CCAGGGAAAC GCTCGCCCTG GCGGAGCAGC CCGCTCCGGA CAACAGCCCC ATCCGGGAAT ATGCCTTCAG CGTGGAAAAC GCGGAGGACG GCGTCTATAT GGTGACGGCG CAAATCCGCT ACAATGCTGA GGGACAGCCG CTTGTCTCCG TACGGAAGCA GCTCCTGTCC GAACTCTCCG GAGTTCTGGA AACAAAAACG GTCATCGTTA ACGAACGCGG CTTGACTTCG GCGGAATGGA CGGAGTATGC TGGAAATACG AAAAGAATCC AAAAGAGCGT TATCCCCTCT TCCAGCGTCA CGGCTCAAAC GGTGGCGATG GATGGTTGGG TGCTCTCGCA GCAGAACCAC GCGGGCATCA CGGAGACGGC CGCCCGCGCC TATACGGCCT CGGGCATGAC TCTGACCCGC ACGGACGGCC GCGGCAACAC GGTCACGACC CGGACCGACC TGGCCGGTCG GGCCGTCAGC GTGACGGACG CCGCGGGGAA TGAAACCGTG ACGCAATACG ACTCCTGCCA CGACCTGGCT GCCGTAGTGA CGGACGCGCT GGGCAATACG AAATGCGCCA GATACGACGC CAGAGGCCGG AAAACGGCCG AATGGGGGAC GGGGACGCAG CCCCTGCTCA TGGGCTATGA CGAGGCCGAC CGCCTGGTGA GCCTGACCAC CTTCCGCGCG GCGCAGGAAG GCGACATCGC GGAGGACCCC TCCGAGCGCG CGGACGGCGA CACCACCACC TGGAACTATG ACGAAGCCAC GGGGCTGGAA ACGCGCAAAA CCTATGCCGA CGGAACGCAC GTGGACAAAA CCTGGGACGC CTTCAACAGG CTTGCTACGG AAACAAACGC CCGCGGCATC GTCAAGACCT GCACTTACGA ACAGCCACGC GGGCTGCTGG TGGGAATCAG CTACTCAGAC GCCACGCCCG GCCAGAGCTT CGCCTACGAT CACCTCGGTC AATTGACGCA AATCACTGAT GTTGCCGGAA CGCGAACCTT CGCCTACAAT CTCTACGGAG AACCGGAAAC CGACAGCCTT GCGGCAAACG GCATCGCCTG GCAGGTCTCC GAGCGCTATG ACGGGCTTGG CCGTCAGGCG GGGTACGAAT TAAGCGCGGA CGGCCGCCGC GTCCAGCAGA CGCACCTGTC CTATGACGGG AAAGGCCGCC TCTCCACCCT CACGGCGGAA GGCATGGAAA CGCCCTTCTC CTGGACTTAC TCCGAACATG GAGGGCTTGT GGAACAACTC GCCTACCCCA ACGGCATGAC CCGGGTCAAC ACCTATGAAG ACAGCCGCGA CCTCCTCTCC GTCATCGACT ACCAGAGGCC CGGAAGCGCC AACCCGCCGG CAAGGCACGA ATACGACTAC GACGCGCTGG GCCGTCCTGC ACGGCGCAGG GACACGTGGA ACACGGCGGC GCCCAAAACG ACGCGTTTGT TCACCTACAA CAGCCGTGGC GAACTGGTCG GAGATCAGCT CAGGCCCGGC GGCCGCTTTG GCTATCAGTA CGACAACATC GGCAACCGGA AAGAAGCCTT CGAATTCGGC AGCACCACGG ACTATGAAAC CGATGAACTC AACCGGTATG CGGGCATCGT CAGAAATAGA GGGGAAGCCT TTACACCCCA ATACGACGCG GACGGCAACC AGACGCTGGT AAAAACATCC ACGGGCATCT GGGAAGTCAC CTACAACGCG GAAAACCGGC CCGTGAAATT CGAAAGCGAA GACGGAGGGA CAACCGTGGA ATGCGCCTAC GACTCCATGG GCAGGAGATT CGAGAAAAAA GTGACGGTTG GAGGGACAAC GGGCTTCCAC GCGCGCTACC TCTACCGTGA CTACCTGCAG GTGGCGGAGT GCGACTTGAC CGGGGAAACG CCGGAGGTTG TGCGCAGTTA CATCTGGGAC CCCTCGGAAC CTGAGGCCAC GCGCGTCCTG TCCATGACGC GCTGGGAAGC GAACGGGACG CAGGAGAAAG AGCATCTCTA CTGCATGCAC GACGCGATGA AAAACGTCAC CTCCCTCTTC GGGGAAGCGC GCGGACGCCG CGCCCTGTAT GAATACCGGC CGTACGGAGG TCTGATCACG TCGGAAGGCA ACATGGCGGA AGAGAACAAA TTCCGCTTCT CCAGCGAATA CATGGACGAC GAACTTGGGC TGGTCTACTA CAACTACCGG CATCTCAATC CGCTTGACGG CAGGTGGATC AGCCGCGATC CCATTGAGGA AGAAGGTGGT TGGAATTTGT TCGCGTTTGT AGGAAATAGA ATTTTTAATC AAGCTGATAT TTTAGGGTTG TGGCCATGGT CCCAGAAACA ACCAGATCCT CCAACCTTTA CAACAGAAAC AAAAAAATGT CCAGATAAAA ATACGATAAG CGTAGTTGTG CGTAGAAGTA ACGAAATTAC GGTGGATGCA GACGGTTCTC CTCGTGCGTA TCATCCAAAA AACATAGGGT TAGATGATAA TAGAAATGGA GGAATAGGAA AAGATAATTA CGGTATTGTT AGTCCTGATG TTATTCAAGG GAAAAATGAT CCTGCTCCAG GTTATTATGT ATCAGTTACA GCATTATTCG ATCCCCGGAA AAAGAAAACA GACCCTCGTA GATATGTAAA TTCAGAAGTA ATTCCATATC TTGTTTTTAA TAAAGAGGAT AGAAAAAAAG GTGCTAAGGC CGGTGATTAT GCAACAGTTA CTAAAAAGAT GCCAAATGGT GATCTTTTAA TTGTTCACGC TATTGTTGCA GATTATAACC CTTATTCTAA AGGGGAAGGT TCTATAAAAT TAGTAAAGGA ATTAGGAGGA AATCCGGATC CTAGAAGAGG AGGGGTAAAA TGTAAGGAAG GTTTTACTAT TTACGTGTAT CCTGGGACTG CAGAAAAATT TGATAGCGAT AAAGTTTCTC ATGAAACTAT TCAAAAAAAA GGTAAAGAAA TTTGGGATAA GCAGCATAAC AAATAA
|
Protein sequence | MFTNDQQNDA PSLNRGSANM INSNPAAGLP DAASQPGAPS AAAPLNAMST PIQPYGADSV IYEKSDNFVQ TSAGADVFMS PVNDTFTVPE GGATAVASLT VDDWGKLTIS GPGGTFELDL TSAADEPGEL GGHQEWSKSG SFELSEGTYT LSITHQNIDM PHNEYNQSVC RYSVTVTAHG GSSSSSSSSD TPPSSLSSSS SSDVPPEEKE ICCRCGCCTD AEGNEYTIAA EKLPGDPGVE ICMSQAEFLA RGGASAPSPA AFSLRSAENA RETAEACGGL KYVSPWAWRA HLDETSGLIT MVPPAGAALY FNVQAGSDTA LPAGISRKRD FRVQLLDETL APAASGAPAY LSLVDADGQK IRFSAETGAV VGMTSASGRV LLAEDYFRNV SNTYDHEGSL VSSYSAAEGL MRTRTGADGE LVMEWYAPAA VTVLADGTYE VTGEPYKTSS FLSSEENGVR TTVITRQQRG LPAHTITRTE EPGRVSIAKG QGDDTIIRTI ETNRLYGGLS ERIETVRGIN DAEPVSCSRS VRQYTDGGWL LVSETEAFNT PLARTTSYEY NSQYRVSRIN RPDGGYTRYE YDGEGRVTLE AAPWAGGGEQ VTRTEYAGLR FYDNRPVRVA ESRVLSDGTE IELTAVAYAY EQSPLMERVV KTVTAAGSSQ EQTSVEETYG EAAAYPYAAG QMKFTRDIAG VETSYDYEAA AEHGAAHKKT AITKAGGGLV AGQSRKTESF IAANDTVLFE QESIWDGENW LLLSSGAHEY DEEGRRTKTT RGNGRVSVTS WMCCGKLSET DEDGVLTSYG YNSAHQLVET IRSEISDGDT VVTPETITTY TRDASGRALQ TRRDRGAMTT TESVEYDRLG RIVRQTDVLG RVTATAYSED GLTETVTTPS GATLVTEYHA DGSVLHEYGT GQRERCHVYD IDNNCLRETV TLAGQTIILS RTLVNGFGQS VVQVTPTTAG FLYDRSEYDE QGSLIRSWRD AGTQEGAVAM APALYEYDAF GNMTRETLAL AEQPAPDNSP IREYAFSVEN AEDGVYMVTA QIRYNAEGQP LVSVRKQLLS ELSGVLETKT VIVNERGLTS AEWTEYAGNT KRIQKSVIPS SSVTAQTVAM DGWVLSQQNH AGITETAARA YTASGMTLTR TDGRGNTVTT RTDLAGRAVS VTDAAGNETV TQYDSCHDLA AVVTDALGNT KCARYDARGR KTAEWGTGTQ PLLMGYDEAD RLVSLTTFRA AQEGDIAEDP SERADGDTTT WNYDEATGLE TRKTYADGTH VDKTWDAFNR LATETNARGI VKTCTYEQPR GLLVGISYSD ATPGQSFAYD HLGQLTQITD VAGTRTFAYN LYGEPETDSL AANGIAWQVS ERYDGLGRQA GYELSADGRR VQQTHLSYDG KGRLSTLTAE GMETPFSWTY SEHGGLVEQL AYPNGMTRVN TYEDSRDLLS VIDYQRPGSA NPPARHEYDY DALGRPARRR DTWNTAAPKT TRLFTYNSRG ELVGDQLRPG GRFGYQYDNI GNRKEAFEFG STTDYETDEL NRYAGIVRNR GEAFTPQYDA DGNQTLVKTS TGIWEVTYNA ENRPVKFESE DGGTTVECAY DSMGRRFEKK VTVGGTTGFH ARYLYRDYLQ VAECDLTGET PEVVRSYIWD PSEPEATRVL SMTRWEANGT QEKEHLYCMH DAMKNVTSLF GEARGRRALY EYRPYGGLIT SEGNMAEENK FRFSSEYMDD ELGLVYYNYR HLNPLDGRWI SRDPIEEEGG WNLFAFVGNR IFNQADILGL WPWSQKQPDP PTFTTETKKC PDKNTISVVV RRSNEITVDA DGSPRAYHPK NIGLDDNRNG GIGKDNYGIV SPDVIQGKND PAPGYYVSVT ALFDPRKKKT DPRRYVNSEV IPYLVFNKED RKKGAKAGDY ATVTKKMPNG DLLIVHAIVA DYNPYSKGEG SIKLVKELGG NPDPRRGGVK CKEGFTIYVY PGTAEKFDSD KVSHETIQKK GKEIWDKQHN K
|
| |