Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0537 |
Symbol | |
ID | 6275044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 632466 |
End bp | 633545 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642612587 |
Product | peptidase M42 family protein |
Protein accession | YP_001877156 |
Protein GI | 187735044 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.349139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATTA GCGACGACAG CCTTGAATTC TTAACGGAAC TTCTGGAAAC CCCCAGCCCT TCCGGCTTTG AAATAGATGC CCAGCGCATC TGGGCGGACG AACTGCGCAA ATATACGGAA GACGTCCAGT GCGACACTTA CGGCAATACC TGGGCCGTCT TCCACGCGGA CGCGGAAGAA GCCCCCACAT TGATGATTGA AGCCCACGCG GATGAAATCG GCTTCATGAT CCGCCATATC ACCAAGGACG GCTTCCTGTA TGTGGAACGC GTAGGCGGCA CGGATACGGC CATCGCACGG GGGCGCCGCG TGCGCTTCCT GGGTTCCCAG GGAGAAGTGA TGGGGGTGAC CGGAAACACG GCCATCCACT TGCGGGAACC CGGAGAGAAG GAACCCAAAA TCTGGGAAAT TTACGTTGAT GTAGGCGCCT CCTCCGACAA GGAAGTAGCG GAACTCGGTT TGCGCGTGGG CCATGTGGGC GTTTACTGCG ACGGCCCCAT GCTGATGAAT GAAAACAAGC TGGTATGCCG GGCTCTGGAC AACCGGCTGA GCGGCTTCAT CCTGTCGGAA ATAGCCCGCA AGCTGTGCAA GCTGAAAAAG CCCGTCTCCT GGAACGTGGT GCTCGTCAAT GCCGTGCAGG AAGAAGTGGG CTGCATTGGC GCGGGAATGA TTACCCACCG CCTGCGCCCG GATGCGGCTA TCTGCATAGA CGTGACTCAT GCCACGGACT CGCCCGGACT GGACAAGGGC AAATTTGGCG ATATCAGGCT TGGCGGCGGC CCTGCGGTCA TCCACGGCAC GGCCAACCAT CCCAATCTGG TGGCCCGTCT GGAAATCGTG GCGGACAAGA ACAAAATACC CCTCCAGCAT GAAGCCGCCG GACGCCGCAC CGGAACGGAT ACGGACAGTA TCTACATCTC CCGCGACGGC GTAGCCTCCG CGCTGGTGTC CGTCCCCCTG CGCTATATGC ACTCCCCGGT GGAAACGGCC TCTCTGACAG ATGTGGAAAA TACAATCAAG CTGCTGCTGG AATTGGTCAA ATCCCTGATG CCGGGAGACT CTTTCGGGCA CAAGCTGTAA
|
Protein sequence | MKISDDSLEF LTELLETPSP SGFEIDAQRI WADELRKYTE DVQCDTYGNT WAVFHADAEE APTLMIEAHA DEIGFMIRHI TKDGFLYVER VGGTDTAIAR GRRVRFLGSQ GEVMGVTGNT AIHLREPGEK EPKIWEIYVD VGASSDKEVA ELGLRVGHVG VYCDGPMLMN ENKLVCRALD NRLSGFILSE IARKLCKLKK PVSWNVVLVN AVQEEVGCIG AGMITHRLRP DAAICIDVTH ATDSPGLDKG KFGDIRLGGG PAVIHGTANH PNLVARLEIV ADKNKIPLQH EAAGRRTGTD TDSIYISRDG VASALVSVPL RYMHSPVETA SLTDVENTIK LLLELVKSLM PGDSFGHKL
|
| |