Gene Amuc_2152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2152 
Symbol 
ID6273736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2620771 
End bp2626509 
Gene Length5739 bp 
Protein Length1912 aa 
Translation table11 
GC content60% 
IMG OID642614213 
ProductYD repeat protein 
Protein accessionYP_001878741 
Protein GI187736629 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0482725 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCCCC CTCATCGTTC TGCCGTTCCC GCTTACCGGA AATGGTACTT TGGAGCCGGC 
GGCAGCGTCG GCAGGCGGGG CATGGAGGCA GCTCCGGAGC CCGATCCGCT ACCGTTTGAA
GCGATCCGGA TCCATGGGGA AAGAACCGTG GGGCCGCCGA GCACGTCGGC GGACGGGCAC
AGCGCGCACG GGACATTCAC CATCCCGGAA GGGGAGGGGG GAAAAAAGCT CTACGGCACC
TGCTCGCTCT TCCTTGGGGT GGACGACTGG GGAAGCCTGG AAGTAAAGGA CTCCGGCGGC
AACGTGGTGG CGCAGGTGGA TCTGAAAGAA AACCCGCAGA CGGCGGGCGA ACAGGGCGGG
CACAAATACC ACACGGGGAC TGGCGGGGCG CAGCTCCCCT CCGGCACCTA CAGCTGGGAA
GTCAGCCAGA CCAACATCGA CTACAATCCG GCAAGCGGCA ACACCTCCAT CTGCAACTAC
AGCATCGACG TGGTGCCGAC GGAACCGGGG GGCAGGAAAG AACCCGAACC GTGTCCCTGC
GAGGGAGACA CGTGCGACAA CAGCGGCGGA ACGCCGCCCT CCCCGCCGCA GGCCCGGTCG
TGCCCTGAGG CGGGCATGGA AAGCGGCGCC CTGGGAAACT ACAGCTCGGC GGGGTGCAGC
GTGACGGCGG AAAGCACGGC CACGCTGATG TACTGGTCCT GCAACTTCGG AGCGTTCCGT
GGACTTGGGG GCCTTCCGGC CGGAAGGGTG GAACTGAGGG CTGAACAAAA CGTCTCCGGC
CTGGAAAGCC CCTCCTCGCT GGCCTACAAC CATCCCCTGA ACAGCCGTCT GGACGTGCCG
GAAGGGGGCA TTGCGCCGGG AGTGCGGTTC AACCTGGTGC AGGGAGACCG GGTGATTGCC
ATGCGCTGCT ACACGGACGG GTCCGTGCTG CCCATCGGGG TGGACACGTC GGGCGGAGGG
CGTGCGGCGC TGGCCACGGT GGAAGGACAA TCCTGCCTGC GCTGGGTGGT GGAAGACGGC
AGCCAATACC TCTTCTCGGC GGAAACGGGA ACGCTCCTCT CCTACACCAC CACGGACAGG
CAGGTCATCT CCAACGCGTC ATCCTATCTG GACGTCAGGC ATGCCGGAGA CGGCTCGCTG
AGGCAAATCT GGAACCTGTG GGACGGCCTG CTCAACGTGG AAAACGTCAC CTCCACGGGC
TACACCATCG CGCTCTATAC CCCTGGGCAA ATCACCGGAA CGGACGAACA GGGATTTTAT
ACCGTTACGG GCGCTCCCTT GAAAACATTT ATTCTTTCCC TGGATGCTGA GGAAAAGTTC
ACCATCACGG AACAGGCGCC TGCCAGGCAG CCCTACGCCG TCACCTGGTG GAACGACGGC
CTGGCGTGGA ACATGCGGCA GGGCACGGGG GAAGACGCCC TCACGACCCT CCGCACGCGC
ACGGAGCTGG AACCGGAAAA CTCGGTCTGG CAGCTGGTCA CGGAAATCTC CAAAAACGGA
ATCGTGGCGG CGCGCACCTG CGCCATCTAC CAGACCACGG ACGTGGGCGA CCTGCTGCTC
ACGCTGGCGG AAGGCTACGG AAGCCCGGAG GAGCAAACCA CGCAATACGC CTACGACCAG
TGCGGACGGC TCAGAACGGA AACGGCCCCG GGCGGCAGCC AGACTCATTA CGCCTATGAC
CTCTACGGCC GCCTGCTCAG CCGGGACGAA CCGTGGGCGG AAAGCGGCAG GCGCATCACG
CGCTACACCT ACGCCTGTTC GGGAGAAGCC GACTTCAGCA ACGAACCCGC CACGGAAACG
GCAGACCTGC TTCCGCTGGA AGGACACGTC AAAACGCTGA CATCCACCAC CTGGAAATAC
ACGACGGCCA ACCACATCAA AAGAACGGAA CGGCGGGTCA CCGGACTGGG CGTGACGGGC
ACGCGCCTGA CGGCGGAGGA ACAATGGCTG GCCGGAGCCG CCAACATCCA TGCCCGCGGA
CGCACGCGGT TCAGCCGGGA CCTCGACGGC GTGCAAACGT GGCACGACTA CGCGGCCACG
ACGGAGCACG GCGCCCTCTA CACGGAAACG GTGGAAACGC GCATCAACGG AGAAGCCGTG
CCGGGACAAA GCACGCGCGC CGTCACCTGG ATCACGGCGG AAGGGCAGCG CGTCAGGGAA
GAAAACTACC TCCTGCTTTC CACCGGGCAA TGGGCGCTCA CGGGCAGCGC CGTCTACGAA
TTTGACACGC AGAACCGGTG GGTGAAGCGG ACGGCGGGCA ACGGCCGGCT CACGGAACGC
GAACTGATGT GCGACGGAGG CCTGCTGTGG GAAATCGATG AAAACGGCAT CAGGACGGAC
TACGCCTACG ACACGGCGCG CCAACTGGTG GAAGTCACGC GTTCCGCCGT GATGGACGGG
GAAACCGTCA TCACGCCGGA AACCATCACC ACCTACGTCC GGGATGCGGC AGGGCGCGTA
CTCTCCACGC GTCAAGACAC GGGGGCGATG ACCACGCGGG AAAGCGCCAC CTACGACCTT
CTTGGCAGAA CAACCTCCAC CACGGACGTC CTGGGCCGGG TCACTACCTA CGCCTACAGC
CAGGACGGCT TGACGGTCAC GCAAACCGTC CCTTCCGGGG CTACATTCAT CACGCGCAGC
GCGCCGGACG GAACGGTGAT GGAAGAATCC GGCACGGGGC AGCGGCACGT CATCTACGCC
ATCGACCTGG TCAGCGACGG TGTGCGGACC TTCACGAAAG CCGTCTCCGG GGAAACGCAA
ACCGAGCTGC AGCGCAGCAT TGTCAACGGA GCCGGGGAAA CCCTGCGCAC GGGCGTCCCC
AACACCACCG GTGGCGTCAT TTACACGAGG AACACCTACA ACGCCAGGGG GCAGCTCACC
AAAACGCAGA CGGACGCGGG CAATGCGGCC ACGACGATGG CCCCGACCCT GTGGGAATAC
GACGCCTTCG GCAACAAAAC GAAAGAAACC TGGAAACTCG CCGATCCGGC CACGACATCC
AACTCGCGCA TCACCACGTG GAGCTACGGC GTGGAACAGG CCCAGGATGA AGTATACCGC
GTTGTTACGG CGACCAGGAA CAACAGCCGG GGAACGACCT ATAACGAAAC GCAGAAAACG
CTGGCTTCCT CCCTCTCGTC CACGCTGGAA AGCAAAGTCA TTTCCATCGA CCCCAGGGGA
AACGCTTCCG AACAATGGAG CGAATACGGT CCGGGCGCCG TCCGGACGCA GAAAAGCAGC
ATCCCCACCT CCGACATCAC GGCCGCCGCT ACGGTCATCG ACGGTTTTAT CATCTCGCAA
ACGGACCATG CGGGCGTCAC GGCCACGCAT ACCCGCGCCT ACACGGAAAC CGGCGTCATC
TACGCCAGCA CGGACGGCCG GGGCAACACG GTCACGACGC ACACCGACCT TACCGGGCGC
ACGATCTCGG TGACGGACGC GGCGGGCAAC ACGACTTCTA CCGCCTACGG CCCCTGGTTT
GACCAGCCTG CCGTCGTCAC CAACGCCCTG GGCAACACGA CCTGCTACGG CTACGACCTC
CGGGGCCGCA ACACGGCGCA ATGGGGAACG AGGGCCCAGC CCCTGCTCTT CGGCTATGAC
GAGGCGGACA GGATGATAAG CCTCACCACG TTCCGGGAGG ACGCGGGCGA CATCACCGCC
GACCCCACGG GACGCACGGA CGGGGACGTC ACTACGTGGA GCTACGATGA CGCCACGGGC
CTGCTCATCC GCAAAACCTG GGCGGACGGC ACCCATGAAG ACACCGCCTA CAATGCCCTG
AACTTCAAAT CCACGCTCAT GGACGCGCGG GGGGTGGTCA CCACCTGGGG CTACAACCTG
AAGAAGGGGG TCAACAACTC CGTCTCCTAC AGCGACTCCA CGCCCGGCAT CCAGTACGCC
TACAACCACC TCAACCAGCT GACCCAGGTC ACGGACGCCT CCGGCTCGCG CGTCCTCACG
TACACCCCCT GCAACGAACC GGACACCGAC AGCATCACCA TCGGAGGGAG CTCTTACCAG
CTCCAGGAAC ACTACGACAC TTACGGACGC TCCTCCGGCT ATACCCTGAA ACAGGGAACC
GACGTCCTCC AGGAAGCCAG CCAGGGCTAT GAAACCGACG GAAGGCTGGC CAGCGCCGGA
ATCAGGCACG GGGGAACGGA GCAAAGCTTC GCCTACGGCT ACCTGGCAGG AAGCAGCCTG
CTCTCCAGCC TTGCGATGCC CGACGGCATC GTCCGGGAAC TTGCCTATGA ACAGCGCCGC
AACCTGGTCA CGGCAATCAA CTGCCGCCTG GGGGAAACCG TGCTGGTCTC CCGCAGCCAG
GGCTACGATG CCCTGGGACG CCCGGTCACC CGCACCCAGC AGCGTGGAAC GGAACCCGCC
CGCAGCGACA GCTTCAGCTA CAACGGCAGA AACGAACTCA CCGCCGCTAC CCTGGGCGCC
GCCCCCTACG GCTACAGCTA CGACAACATC GGCAACCGCA AGACGGCACG GGAACCGGCC
GAAGAACTCG CCTACGCGGC CAACGGGCTC AACCAGTACA CCGGCATTGA AGAAAGCGGG
GAAGCTCCTT TTGTGCCGAC GTACGACGCC TCGGGCAACC AGACCCTCAT CAAGACGTCA
ACGGGCATCT GGACGGCCGT GTACAACGCG GCCAACCGCG CGGTGAGCTT CACCAGCCGG
GACGGCGCGA CAGTCGTGGA ATGCGGCTAC GATTACCAGG GACGCCGCTA CATGAAGAAA
GTGACCCAAA ACGGCACGGT CGCCAGCCAC GAACGCTATC TATACCGCGG CTATTTACAA
ATAGCGGCAT TGGATATGCT GGACAACCGT AACGTGCTTC GCACGCTGTT GTGGGATCCT
CTGGAACCGG TGGCCACCCG CCCCCTGGCC CTCGCGCAGG GCGCTTCCCT GTACTGCTAC
GGCATGGACT TCAACAAGAA TGTGTCGGAG GTCTTCGACG CACAGGGAAC GATCGCGGCG
GCTTACGACT ACTCGCCCTA TGGGATAGTT GGCAGCACAG GCAACCTCGT CCAACCCGTA
CAGTGGTCCG GCGAGATGCA CGACGAAGAA CCCACCCTGG CCTATTATAA TTACCGCTTT
TACAACCCCA AAGACGGCAG GTGGATCAAT AGGGATCCCA TCGCTGAACA GGGAGGATGG
AATTTTTATG CGTTCGTAGG GAACAGCCCT CAAGATAAGT TTGATGCTTT GGGGTTAGAA
GATAAGAAAA AAGATAAAGA ATTTCTTGGA TATGTTTATG ATCAAACTTT AGAAGGAACA
GACATATACA TTTGTGAAAC AACAGGATTG AAAATAAATG ACAAAATTTT AAATAATACA
GTTTCAGAAT CCATGAACGC TTTTAAGGAA GCTAATAGTG CTAATCAAGC GGCAAGCAAT
ATTAGATATG CAAAAGATTT TAGTAAAAAA GTGTCTAAAC TTTATAAAGT AACTCCTGGA
ACAAAAATTC CCGGAGTAAA AATTTCTTCA ATTAAAGATA TAATTGATTT TGTTTATATC
AAAAGAGATG AAAAAGTTGC ATATCAACAA TTTTATGAAG CTGCTACACG ATATCAAACG
CTAAAGAAAA AGACATTCTC AAGTTGTTTT GAATTATGCG TAGCCATGGG AGAGTATGCA
AAAGTATTTT TAAAAAATGA TTTAGGAAAA GGAATAGTTA AATTTTCTAC TGACTATTGT
ATTTCTAAAT GTAATAAAAG GCATAAAACA TTAGATTAA
 
Protein sequence
MNPPHRSAVP AYRKWYFGAG GSVGRRGMEA APEPDPLPFE AIRIHGERTV GPPSTSADGH 
SAHGTFTIPE GEGGKKLYGT CSLFLGVDDW GSLEVKDSGG NVVAQVDLKE NPQTAGEQGG
HKYHTGTGGA QLPSGTYSWE VSQTNIDYNP ASGNTSICNY SIDVVPTEPG GRKEPEPCPC
EGDTCDNSGG TPPSPPQARS CPEAGMESGA LGNYSSAGCS VTAESTATLM YWSCNFGAFR
GLGGLPAGRV ELRAEQNVSG LESPSSLAYN HPLNSRLDVP EGGIAPGVRF NLVQGDRVIA
MRCYTDGSVL PIGVDTSGGG RAALATVEGQ SCLRWVVEDG SQYLFSAETG TLLSYTTTDR
QVISNASSYL DVRHAGDGSL RQIWNLWDGL LNVENVTSTG YTIALYTPGQ ITGTDEQGFY
TVTGAPLKTF ILSLDAEEKF TITEQAPARQ PYAVTWWNDG LAWNMRQGTG EDALTTLRTR
TELEPENSVW QLVTEISKNG IVAARTCAIY QTTDVGDLLL TLAEGYGSPE EQTTQYAYDQ
CGRLRTETAP GGSQTHYAYD LYGRLLSRDE PWAESGRRIT RYTYACSGEA DFSNEPATET
ADLLPLEGHV KTLTSTTWKY TTANHIKRTE RRVTGLGVTG TRLTAEEQWL AGAANIHARG
RTRFSRDLDG VQTWHDYAAT TEHGALYTET VETRINGEAV PGQSTRAVTW ITAEGQRVRE
ENYLLLSTGQ WALTGSAVYE FDTQNRWVKR TAGNGRLTER ELMCDGGLLW EIDENGIRTD
YAYDTARQLV EVTRSAVMDG ETVITPETIT TYVRDAAGRV LSTRQDTGAM TTRESATYDL
LGRTTSTTDV LGRVTTYAYS QDGLTVTQTV PSGATFITRS APDGTVMEES GTGQRHVIYA
IDLVSDGVRT FTKAVSGETQ TELQRSIVNG AGETLRTGVP NTTGGVIYTR NTYNARGQLT
KTQTDAGNAA TTMAPTLWEY DAFGNKTKET WKLADPATTS NSRITTWSYG VEQAQDEVYR
VVTATRNNSR GTTYNETQKT LASSLSSTLE SKVISIDPRG NASEQWSEYG PGAVRTQKSS
IPTSDITAAA TVIDGFIISQ TDHAGVTATH TRAYTETGVI YASTDGRGNT VTTHTDLTGR
TISVTDAAGN TTSTAYGPWF DQPAVVTNAL GNTTCYGYDL RGRNTAQWGT RAQPLLFGYD
EADRMISLTT FREDAGDITA DPTGRTDGDV TTWSYDDATG LLIRKTWADG THEDTAYNAL
NFKSTLMDAR GVVTTWGYNL KKGVNNSVSY SDSTPGIQYA YNHLNQLTQV TDASGSRVLT
YTPCNEPDTD SITIGGSSYQ LQEHYDTYGR SSGYTLKQGT DVLQEASQGY ETDGRLASAG
IRHGGTEQSF AYGYLAGSSL LSSLAMPDGI VRELAYEQRR NLVTAINCRL GETVLVSRSQ
GYDALGRPVT RTQQRGTEPA RSDSFSYNGR NELTAATLGA APYGYSYDNI GNRKTAREPA
EELAYAANGL NQYTGIEESG EAPFVPTYDA SGNQTLIKTS TGIWTAVYNA ANRAVSFTSR
DGATVVECGY DYQGRRYMKK VTQNGTVASH ERYLYRGYLQ IAALDMLDNR NVLRTLLWDP
LEPVATRPLA LAQGASLYCY GMDFNKNVSE VFDAQGTIAA AYDYSPYGIV GSTGNLVQPV
QWSGEMHDEE PTLAYYNYRF YNPKDGRWIN RDPIAEQGGW NFYAFVGNSP QDKFDALGLE
DKKKDKEFLG YVYDQTLEGT DIYICETTGL KINDKILNNT VSESMNAFKE ANSANQAASN
IRYAKDFSKK VSKLYKVTPG TKIPGVKISS IKDIIDFVYI KRDEKVAYQQ FYEAATRYQT
LKKKTFSSCF ELCVAMGEYA KVFLKNDLGK GIVKFSTDYC ISKCNKRHKT LD