Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0161 |
Symbol | |
ID | 6274966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 199559 |
End bp | 200563 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642612206 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_001876786 |
Protein GI | 187734674 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.458265 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAGA AACTTAAACC GGAAACCTGG GTAGGCATCT TTCTGATCGC CGGGATCCTG ATGATTATCG GAGTCATCCT GGGATTCGGC AACATCAAAA CATCCAAGGA GCAAACCTAC CCCATCAACA TCATTTTCAA AGATGCGGCG GGGCTCATCA AGGATTCCCA GATACGCCTG GGCGGCGTCA CGGTGGGGAA AGTCACCAAA GCCCCTGAAC TCCTGCCTTC CGGCAATGAA GTCATGCTGG AAGCCAGCAT TCAGAGCGAC GTGAAAATCC AGCAAGGGTC CGTCTTCCGG GTGGACATGC AGAACATCCT GGGCGACAAA TACATCGATA TCGTCCCTCC GGCCCAGCCC ACGCATGAAT ACATCCTTCC CCATGCCACC ATCATCGGTC AGCCGGAAAG CGATTTCAGC AAAATCAAAA ACAATGCCGT GGGCGCCACA GAGGAAATCC TGAAAATCCT CAAGCAAATC GAGAAAAATT CGGACAACAT TGACGACGCC ATCCTCAATA TCGGAGAGGC GGCCAAAGGG CTGGCCCAGA CGACCAGGCT GATCAACGAG GGCATCCTGA ACCGGGAGAA CATCCAGAAC CTCGGCAGCG TACTGTCTCA AATAAACCGG GCGGGAGAAC AGCTCCCCGG CCTCATGGAA GAAACGCGCT CCTCGGTCGC GGCCATGAAA GACACCGTCC GGGATGCCCG CAAACTCATT GCCGGAGCTG AAGAAAAGCT CAACACTCTG GACCCCGCCG TCAAGGCAAT CCCCCCTACG CTGGCCGCTC TGAGGAAAGC ATCTGAACAA ATCTCCTCCT TCACGGCTGA TGCACGTAAA AACCAGGGCT TTCTTGGGCT GCTTATGTAC GATGCTCGCT TCCGGGCCAA CGCTCAGGAA TTCATCCGCA ATCTGAGGGA TTACGGCATT CTGAGGTACC GGAATCCTAA CGAACCCCAA GTCAAACCGG ACCCCCGCGG CGGTTTTTCA GGAAGCCGCC GCTGA
|
Protein sequence | MKQKLKPETW VGIFLIAGIL MIIGVILGFG NIKTSKEQTY PINIIFKDAA GLIKDSQIRL GGVTVGKVTK APELLPSGNE VMLEASIQSD VKIQQGSVFR VDMQNILGDK YIDIVPPAQP THEYILPHAT IIGQPESDFS KIKNNAVGAT EEILKILKQI EKNSDNIDDA ILNIGEAAKG LAQTTRLINE GILNRENIQN LGSVLSQINR AGEQLPGLME ETRSSVAAMK DTVRDARKLI AGAEEKLNTL DPAVKAIPPT LAALRKASEQ ISSFTADARK NQGFLGLLMY DARFRANAQE FIRNLRDYGI LRYRNPNEPQ VKPDPRGGFS GSRR
|
| |