Gene Amuc_0983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0983 
Symbol 
ID6274152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1171299 
End bp1177088 
Gene Length5790 bp 
Protein Length1929 aa 
Translation table11 
GC content60% 
IMG OID642613034 
ProductYD repeat protein 
Protein accessionYP_001877593 
Protein GI187735481 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.474988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.984855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCGC CTCATCGTTC CTCCTCCGCC CATCGTTCTG CCGTTCCCGC TTACCGGAAA 
TGGTACTTTG GTTCCGGTGC CAGCCAATCC GGCTCTACTC GCCGCGTGCG CTTATCCGGC
CAGCCGGATC CCGATCCGCT GCCGTTTGAA GCGATCCGGA TCCATGAGGA AAGGAAAGTG
GGGCCGCCGA GCACGTCGGC GGACGGGCAC AGCGCGCACG GGACATTCAC CATCCCGGAA
GGAGCGGAGG GCAAAAAACT CTACGGCACC TGCTCGCTCT TCCTTGGGGT GGACGACTGG
GGAATCCTGG AGGTGAAGGA CTCCGGCGGC AACGTGGTGG CGCAGGTGGA TCTGAAAGAA
AACCCGCAGA CGGCGGGCGA ACAGGGCGGG CACAAATACC ACACGGGGAC TGGCGGGGCG
CAGCTCCCCT CCGGCACCTA CAGCTGGGAA GTCAGCCAGA CCAACATCGA CTACAATCCG
GCAAGCGGCA ACACCTCCAT CTGCAACTAC AGCATCGACG TGGTGCCGAC GGAACCGGGG
GGCAGGAAAG AACCCGAACC GTGTCCCTGC GAGGGAGACA CGTGCGACAA CAGCGGCGGA
ACGCCGCCCT CCCCGCCGCA GGCCCGGTCG TGCCCTGAGG CGGGCATGGA AAGCGGCGCC
CTGGGAAACT ACAGCTCGGC GGGGTGCAGC GTGACGGCGG AAAGCACGGC CACGCTGATG
TACTGGTCCT GCAACTTCGG AGCGTTCCGT GGACTTGGGG GCCTTCCGGC CGGAAGGGTG
GAACTGAGGG CTGAACAAAA CGTCTCCGGC CTGGAAAGCC CCTCCTCGCT GGCCTACAAC
CATCCCCTGA ACAGCCGTCT GGACGTGCCG GAAGGGGGCA TTGCGCCGGG AGTGCGGTTC
AACCTGGTGC AGGGAGACCG GGTGATTGCC ATGCGCTGCT ACACGGACGG GTCCGTGCTG
CCCATCGGGG TGGACACGTC GGGCGGAGGG CGTGCGGCGC TGGCCACGGT GGAAGGACAA
TCCTGCCTGC GCTGGGTGGT GGAAGACGGC AGCCAATACC TCTTCTCGGC GGAAACGGGA
ACGCTCCTCT CCTACACCAC CACGGACAGG CAGGTCATCT CCAACGCGTC ATCCTATCTG
GACGTCAGGC ATGCCGGAGA CGGCTCGCTG AGGCAAATCT GGAACCTGTG GGACGGCCTG
CTCAACGTGG AAAACGTCAC CTCCACGGGC TACACCATCG CGCTCTATAC CCCTGGGCAA
ATCACCGGAA CGGACGAACA GGGATTTTAT ACCGTTACGG GCGCTCCCTT GAAAACATTT
ATTCTTTCCC TGGATGCTGA GGAAAAGTTC ACCATCACGG AACAGGCGCC TGCCAGGCAG
CCCTACGCCG TCACCTGGTG GAACGACGGC CTGGCGTGGA ACATGCGGCA GGGCACGGGG
GAAGACGCCC TCACGACCCT CCGCACGCGC ACGGAGCTGG AACCGGAAAA CTCGGTCTGG
CAGCTGGTCA CGGAAATCTC CAAAAACGGA ATCGTGGCGG CGCGCACCTG CGCCATCTAC
CAGACCACGG ACGTGGGCGA CCTGCTGCTC ACGCTGGCGG AAGGCTACGG AAGCCCGGAG
GAGCAAACCA CGCAATACGC CTACGACCAG TGCGGACGGC TCAGAACGGA AACGGCCCCG
GGCGGCAGCC AGACTCATTA CGCCTATGAC CTCTACGGCC GCCTGCTCAG CCGGGACGAA
CCGTGGGCGG AAAGCGGCAG GCGCATCACG CGCTACACCT ACGCCTGTTC GGGAGAAGCC
GACTTCAGCA ACGAACCCGC CACGGAAACG GCAGACCTGC TTCCGCTGGA AGGACACGTC
AAAACGCTGA CATCCACCAC CTGGAAATAC ACGACGGCCA ACCACATCAA AAGAACGGAA
CGGCGGGTCA CCGGACTGGG CGTGACGGGC ACGCGCCTGA CGGCGGAGGA ACAATGGCTG
GCCGGAGCCG CCAACATCCA TGCCCGCGGA CGCACGCGGT TCAGCCGGGA CCTCGACGGC
GTGCAAACGT GGCACGACTA CGCGGCCACG ACGGAGCACG GCGCCCTCTA CACGGAAACG
GTGGAAACGC GCATCAACGG AGAAGCCGTG CCGGGACAAA GCACGCGCGC CGTCACCTGG
ATCACGGCGG AAGGGCAGCG CGTCAGGGAA GAAAACTACC TCCTGCTTTC CACCGGGCAA
TGGGCGCTCA CGGGCAGCGC CGTCTACGAA TTTGACACGC AGAACCGGTG GGTGAAGCGG
ACGGCGGGCA ACGGCCGGCT CACGGAACGC GAACTGATGT GCGACGGAGG CCTGCTGTGG
GAAATCGATG AAAACGGCAT CAGGACGGAC TACGCCTACG ACACGGCGCG CCAACTGGTG
GAAGTCACGC GTTCCGCCGT GATGGACGGG GAAACCGTCA TCACGCCGGA AACCATCACC
ACCTACGTCC GGGATGCGGC AGGGCGCGTA CTCTCCACGC GTCAAGACAC GGGGGCGATG
ACCACGCGGG AAAGCGCCAC CTACGACCTT CTTGGCAGAA CAACCTCCAC CACGGACGTC
CTGGGCCGGG TCACTACCTA CGCCTACAGC CAGGACGGCT TGACGGTCAC GCAAACCGTC
CCTTCCGGGG CTACATTCAT CACGCGCAGC GCGCCGGACG GAACGGTGAT GGAAGAATCC
GGCACGGGGC AGCGGCACGT CATCTACGCC ATCGACCTGG TCAGCGACGG TGTGCGGACC
TTCACGAAAG CCGTCTCCGG GGAAACGCAA ACCGAGCTGC AGCGCAGCAT TGTCAACGGA
GCCGGGGAAA CCCTGCGCAC GGGCGTCCCC AACACCACCG GTGGCGTCAT TTACACGAGG
AACACCTACA ACGCCAGGGG GCAGCTCACC AAAACGCAGA CGGACGCGGG CAATGCGGCC
ACGACGATGG CCCCGACCCT GTGGGAATAC GACGCCTTCG GCAACAAAAC GAAAGAAACC
TGGAAACTCG CCGATCCGGC CACGACATCC AACTCGCGCA TCACCACGTG GAGCTACGGC
GTGGAACAGG CCCAGGATGA AGTATACCGC GTTGTTACGG CGACCAGGAA CAACAGCCGG
GGAACGACCT ATAACGAAAC GCAGAAAACG CTGGCTTCCT CCCTCTCGTC CACGCTGGAA
AGCAAAGTCA TTTCCATCGA CCCCAGGGGA AACGCTTCCG AACAATGGAG CGAATACGGT
CCGGGCGCCG TCCGGACGCA GAAAAGCAGC ATCCCCACCT CCGACATCAC GGCCGCCGCT
ACGGTCATCG ACGGTTTTAT CATCTCGCAA ACGGACCATG CGGGCGTCAC GGCCACGCAT
ACCCGCGCCT ACACGGAAAC CGGCGTCATC TACGCCAGCA CGGACGGCCG GGGCAACACG
GTCACGACGC ACACCGACCT TACCGGGCGC ACGATCTCGG TGACGGACGC GGCGGGCAAC
ACGACTTCTA CCGCCTACGG CCCCTGGTTT GACCAGCCTG CCGTCGTCAC CAACGCCCTG
GGCAACACGA CCTGCTACGG CTACGACCTC CGGGGCCGCA ACACGGCGCA ATGGGGAACG
AGGGCCCAGC CCCTGCTCTT CGGCTATGAC GAGGCGGACA GGATGATAAG CCTCACCACG
TTCCGGGAGG ACGCGGGCGA CATCACCGCC GACCCCACGG GACGCACGGA CGGGGACGTC
ACTACGTGGA GCTACGATGA CGCCACGGGC CTGCTCATCC GCAAAACCTG GGCGGACGGC
ACCCATGAAG ACACCGCCTA CAATGCCCTG AACTTCAAAT CCACGCTCAT GGACGCGCGG
GGGGTGGTCA CCACCTGGGG CTACAACCTG AAGAAGGGGG TCAACAACTC CGTCTCCTAC
AGCGACTCCA CGCCCGGCAT CCAGTACGCC TACAACCACC TCAACCAGCT GACCCAGGTC
ACGGACGCCT CCGGCTCGCG CGTCCTCACG TACACCCCCT GCAACGAACC GGACACCGAC
AGCATCACCA TCGGAGGGAG CTCTTACCAG CTCCAGGAAC ACTACGACAC TTACGGACGC
TCCTCCGGCT ATACCCTGAA ACAGGGAACC GACGTCCTCC AGGAAGCCAG CCAGGGCTAT
GAAACCGACG GAAGGCTGGC CAGCGCCGGA ATCAGGCACG GGGGAACGGA GCAAAGCTTC
GCCTACGGCT ACCTGGCAGG AAGCAGCCTG CTCTCCAGCC TTGCGATGCC CGACGGCATC
GTCCGGGAAC TTGCCTATGA ACAGCGCCGC AACCTGGTCA CGGCAATCAA CTGCCGCCTG
GGGGAAACCG TGCTGGTCTC CCGCAGCCAG GGCTACGATG CCCTGGGACG CCCGGTCACC
CGCACCCAGC AGCGTGGAAC GGAACCCGCC CGCAGCGACA GCTTCAGCTA CAACGGCAGA
AACGAACTCA CCGCCGCTAC CCTGGGCGCC GCCCCCTACG GCTACAGCTA CGACAACATC
GGCAACCGCA AGACGGCACG GGAACCGGCC GAAGAACTCG CCTACGCGGC CAACGGGCTC
AACCAGTACA CCGGCATTGA AGAAAGCGGG GAAGCTCCTT TTGTGCCGAC GTACGACGCC
TCGGGCAACC AGACCCTCAT CAAGACGTCA ACGGGCATCT GGACGGCCGT GTACAACGCG
GCCAACCGCG CGGTGAGCTT CACCAGCCGG GACGGCGCGA CAGTCGTGGA ATGCGGCTAC
GATTACCAGG GACGCCGCTA CATGAAGAAA GTGACCCAAA ACGGCACGGT CGCCAGCCAC
GAACGCTATC TATACCGCGG CTATTTACAA ATAGCGGCAT TGGATATGCT GGACAACCGT
AACGTGCTTC GCACGCTGTT GTGGGATCCT CTGGAACCGG TGGCCACCCG CCCCCTGGCC
CTCGCGCAGG GCGCTTCCCT GTACTGCTAC GGCATGGACT TCAACAAGAA TGTGTCGGAG
GTCTTCGACG CACAGGGAAC GATCGCGGCG GCTTACGACT ACTCGCCCTA TGGGATAGTT
GGCAGCACAG GCAACCTCGT CCAACCCGTA CAGTGGTCCG GCGAGATGCA CGACGAAGAA
CCCACCCTGG CCTATTATAA TTACCGCTTT TACAACCCCA AAGACGGCAG GTGGATCAAT
AGGGATCCCA TCGCTGAACA GGGAGGGTGG AATTTGTACG GGTTTGTTGA TAATGGAGTG
GTATTTTCTA TTGATTATCT CGGAAAAGAA ACTAACGAAT TTGGCTACAC AATGACTATT
CCTGCGGGTT ACATACCCAT TCTTCTTATT ATTGAGGGAA GTATATCAAT AGAAAAAAAG
AAAAATTGTG TATGCATTGA AGCAAAATTA CGAACAGATG TTGGTATAGG TGTAGGTATA
GGGCTTAAAA TTAAACAAAA ATGGTGGCCT ATTCTTCCTG ATATTGAATA TTCTTTAAAA
ACTATGATTG CTGGATTAAG TAATGAAAAA ATACTTAAAA TTGACAATTG TAATGGCAAA
ACTTTAACAT CCTATCGGGA AGAATTATTA GGATTTTCTC ACAGAATTGA TGCAGGCATT
ACACTATCAT CTTATATACA AGCATCTTAT TCTATCGATT TTAAGATAAG TTCTGGATTA
ACTTTGAACG CAAAACCAAT ATATTTAACT TTAGATGCAA CGGAATATAT TAAGATTGAC
GCTACTTTAA AAATCCCATT TATCATAGAT CAAAACGAAG AACTTGTTAA CAAATCTTTT
AATGAAAATA TAAAAATATG GTCTAAATAA
 
Protein sequence
MNPPHRSSSA HRSAVPAYRK WYFGSGASQS GSTRRVRLSG QPDPDPLPFE AIRIHEERKV 
GPPSTSADGH SAHGTFTIPE GAEGKKLYGT CSLFLGVDDW GILEVKDSGG NVVAQVDLKE
NPQTAGEQGG HKYHTGTGGA QLPSGTYSWE VSQTNIDYNP ASGNTSICNY SIDVVPTEPG
GRKEPEPCPC EGDTCDNSGG TPPSPPQARS CPEAGMESGA LGNYSSAGCS VTAESTATLM
YWSCNFGAFR GLGGLPAGRV ELRAEQNVSG LESPSSLAYN HPLNSRLDVP EGGIAPGVRF
NLVQGDRVIA MRCYTDGSVL PIGVDTSGGG RAALATVEGQ SCLRWVVEDG SQYLFSAETG
TLLSYTTTDR QVISNASSYL DVRHAGDGSL RQIWNLWDGL LNVENVTSTG YTIALYTPGQ
ITGTDEQGFY TVTGAPLKTF ILSLDAEEKF TITEQAPARQ PYAVTWWNDG LAWNMRQGTG
EDALTTLRTR TELEPENSVW QLVTEISKNG IVAARTCAIY QTTDVGDLLL TLAEGYGSPE
EQTTQYAYDQ CGRLRTETAP GGSQTHYAYD LYGRLLSRDE PWAESGRRIT RYTYACSGEA
DFSNEPATET ADLLPLEGHV KTLTSTTWKY TTANHIKRTE RRVTGLGVTG TRLTAEEQWL
AGAANIHARG RTRFSRDLDG VQTWHDYAAT TEHGALYTET VETRINGEAV PGQSTRAVTW
ITAEGQRVRE ENYLLLSTGQ WALTGSAVYE FDTQNRWVKR TAGNGRLTER ELMCDGGLLW
EIDENGIRTD YAYDTARQLV EVTRSAVMDG ETVITPETIT TYVRDAAGRV LSTRQDTGAM
TTRESATYDL LGRTTSTTDV LGRVTTYAYS QDGLTVTQTV PSGATFITRS APDGTVMEES
GTGQRHVIYA IDLVSDGVRT FTKAVSGETQ TELQRSIVNG AGETLRTGVP NTTGGVIYTR
NTYNARGQLT KTQTDAGNAA TTMAPTLWEY DAFGNKTKET WKLADPATTS NSRITTWSYG
VEQAQDEVYR VVTATRNNSR GTTYNETQKT LASSLSSTLE SKVISIDPRG NASEQWSEYG
PGAVRTQKSS IPTSDITAAA TVIDGFIISQ TDHAGVTATH TRAYTETGVI YASTDGRGNT
VTTHTDLTGR TISVTDAAGN TTSTAYGPWF DQPAVVTNAL GNTTCYGYDL RGRNTAQWGT
RAQPLLFGYD EADRMISLTT FREDAGDITA DPTGRTDGDV TTWSYDDATG LLIRKTWADG
THEDTAYNAL NFKSTLMDAR GVVTTWGYNL KKGVNNSVSY SDSTPGIQYA YNHLNQLTQV
TDASGSRVLT YTPCNEPDTD SITIGGSSYQ LQEHYDTYGR SSGYTLKQGT DVLQEASQGY
ETDGRLASAG IRHGGTEQSF AYGYLAGSSL LSSLAMPDGI VRELAYEQRR NLVTAINCRL
GETVLVSRSQ GYDALGRPVT RTQQRGTEPA RSDSFSYNGR NELTAATLGA APYGYSYDNI
GNRKTAREPA EELAYAANGL NQYTGIEESG EAPFVPTYDA SGNQTLIKTS TGIWTAVYNA
ANRAVSFTSR DGATVVECGY DYQGRRYMKK VTQNGTVASH ERYLYRGYLQ IAALDMLDNR
NVLRTLLWDP LEPVATRPLA LAQGASLYCY GMDFNKNVSE VFDAQGTIAA AYDYSPYGIV
GSTGNLVQPV QWSGEMHDEE PTLAYYNYRF YNPKDGRWIN RDPIAEQGGW NLYGFVDNGV
VFSIDYLGKE TNEFGYTMTI PAGYIPILLI IEGSISIEKK KNCVCIEAKL RTDVGIGVGI
GLKIKQKWWP ILPDIEYSLK TMIAGLSNEK ILKIDNCNGK TLTSYREELL GFSHRIDAGI
TLSSYIQASY SIDFKISSGL TLNAKPIYLT LDATEYIKID ATLKIPFIID QNEELVNKSF
NENIKIWSK