Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0867 |
Symbol | |
ID | 6274301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1035676 |
End bp | 1036824 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642612922 |
Product | peptidase M42 family protein |
Protein accession | YP_001877481 |
Protein GI | 187735369 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.937418 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTGG AACTGCTTCG TCAGGTTTGC GTGACTCCGG GCGCTCCGGG GTTTGAAGAC AAAATCCGCG ATTTCATCAT CCAGGAAGTG GCCCCGCTGG TGGACGCCGT GCGCGTGGAC AACATGGGCA GCGTGATTGC CATCGTGGAA GGCAAAAACA CGGAAAAAAC CATGATGGCC GCCGCCCACA TGGATGAAAT CGGGTTCATG GTCCGTCACA TCGACGACAA GGGGTTCATC AAATTCCTGC CACTGGGCGG CTTTGACGCC AAGACGCTGA CGGCCCAGCG CGTCATCGTC CACGGTAAAA AAGACCTCAT CGGCGTCATG GGCGTGAAAC CCATCCACGT CATGTCCCCG GCGGAACGTA CCAAGCTGCC GGAAGTGACC GACTTCTTCA TTGACCTGGG CATGAGCAAG GAGGAAGTGG AAAAATACGT TTCCGTAGGC GACTCCGTTA CCCGTGAACG GGATTTGGTG GAAATGGGGG ATTGCGTGAA CGTCAAATCT CTGGACAACC GCGCCGGATG CTACGTGCTG ATTGAAGCCC TCCGCGCCAT CAAGGCTTCC AGGAAGAAAC CCTCCTGCAA CTTCGTGGCC GCCTTTACCG TTCAGGAGGA AGTGGGCCTG AGGGGCGCGC AGGCCGGCAC GCTGGACATC CAACCGGATT TCTCCATTGC CCTGGATGTC ACCATCGCCT GTGACATTCC CGGAACTCCG GCGCACGACC AGGTTTCCCA CCTGGGCGCA GGCGCCGCCA TCAAGCTGTA TGACGGTTCC GTCATTGCAG ACCGCCGCAT GGTCAAGTTC ATGAAGGCCA TGGCAGACGC CAACAAAATT AAATGGCAGA CGGAAATGCT GCCGGCGGGA GGCACGGATG CCGGAGCCAT GCAGAAATTC GTTCCGGGCG GTTCCATTGC CGGGGCCATT TCCGTTCCCA CCCGCAATGT GCACCAGGTT ATTGAAATGG CTCACAAAGA CGACCTGGAC GCTTCCGTAG CGCTTCTGAC CGCCTGCGCC ATGAACGTGG ACAAATGGGA CTGGTCCTGG AACTCCGTCA ACGAATGCCC GGCGGAAAAA CCCGCCAAGG CCGCAAAAAC GGCAGCCAAG CCCGCCAAAG CCGCTGCCAA GAAGAAAAAG GCCAAATAA
|
Protein sequence | MNLELLRQVC VTPGAPGFED KIRDFIIQEV APLVDAVRVD NMGSVIAIVE GKNTEKTMMA AAHMDEIGFM VRHIDDKGFI KFLPLGGFDA KTLTAQRVIV HGKKDLIGVM GVKPIHVMSP AERTKLPEVT DFFIDLGMSK EEVEKYVSVG DSVTRERDLV EMGDCVNVKS LDNRAGCYVL IEALRAIKAS RKKPSCNFVA AFTVQEEVGL RGAQAGTLDI QPDFSIALDV TIACDIPGTP AHDQVSHLGA GAAIKLYDGS VIADRRMVKF MKAMADANKI KWQTEMLPAG GTDAGAMQKF VPGGSIAGAI SVPTRNVHQV IEMAHKDDLD ASVALLTACA MNVDKWDWSW NSVNECPAEK PAKAAKTAAK PAKAAAKKKK AK
|
| |