Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1515 |
Symbol | |
ID | 6275717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1808657 |
End bp | 1809907 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613574 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_001878117 |
Protein GI | 187736005 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.0000000845442 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAAATGGC TGAATGACTG GCTGGGCTTT TTCAAGCCCC TTCAGGTGCG GAACCTGCAA ATATTCTGGT CAGGGCAGGC TTCGGCCCTG ATTGGAATGT GGTTGCAGGT GACGGCCATG GGTATTCTTG TGTACGATAT TTCCGGAGGC TCCGCCACGG CAGTGGGGGT GTTGGCTGCC TTGAACGCTC TTCCTTTCTT TCTGGGCGGC ATGCTGCTTG CCGGGCTGGG GGACCGTTTT GACCGGAGGA AACTCCTCAT AGCCGTGCAG TGCGTCCAGT GGCTGGTGGC TGTAGCCCTG TTTCTGCTGA CTGTGCTGGA TATGCTCCAG CTGTGGCACT TATACGCCGC CGGGCTGGTG ATGGGCGTCA ATCAGACGGT GGGTTTTCCC ACACAGCAGG CTTTTGTGGG AGACCTGATA CCGCGCCGGC AATTACAGGA GGCCGTGGGC ATGTATTCTC TGGTATTTAA TACGTGCCGG GCTATAGGGC CGGCGCTTGC CGGCTACATT ATTGCGGAAT GGGGCGCCGG AACCGCCTTT GGCGGCAATG TGGCGGCAAG CCTTCCGCTG GTTGGCTGCC TGGTTGCCTT AAAGGGACGC GTTGCGGATA CCTCCGCTCC CAGAAAACAG CGCGGCGCAG GAAGGAAAGC ATCCGGCCTC AAGGCGGTGC TTGCCACGCG CAGCCTGCTT TTCATCATGA TCAGCGCTCT GATTCAGAAT ATCTGCGGGC AATCTCTCTA TCAGATTGTT CCGGCATTGA TGCACGGAAA TCCCAGGAAT ACCGGCCTCA TTCTGGGTGC GGTCGGAGCC GGAGCCATGG TGAGCATCCT CTTTGTCCTG CCGTTTGCAC GCAAAAGCGA CAGGGTGGGA GCCAAGCTTT CCTCAGGCAC CCTGTGGATG GGATGCGCAC TGTGCGTGGC AGGCGCCATC CCGGTGGTGG AAGTGCAGGC CCTCTGTTTC TTTTTTGCCG GTTTGGCCAC TTCTTCCCTC TTTGTTACTT CTTCATCCGC CGTGCAGCTT CTGTCGCCCC CGGAACGCAA GTCGGCCATT CTGGGGCTGT TCAGCATTGT CACCATCGGT GTGCAGCCGC TGGCGGCCAT GGGGTGGGGA GCTGTGGTGG ATGCCTGGGG TGTCCAGATG ACGATTGTCG TGGCGGGGGG GCTGGAAGCC TTGTTTTCCA TTTGGATGCT GGGCGTTCCA TTTTGGCGTC ATTTCAAATT TTCTCCGGAT GATTGTCCGG AGACGACATA G
|
Protein sequence | MKWLNDWLGF FKPLQVRNLQ IFWSGQASAL IGMWLQVTAM GILVYDISGG SATAVGVLAA LNALPFFLGG MLLAGLGDRF DRRKLLIAVQ CVQWLVAVAL FLLTVLDMLQ LWHLYAAGLV MGVNQTVGFP TQQAFVGDLI PRRQLQEAVG MYSLVFNTCR AIGPALAGYI IAEWGAGTAF GGNVAASLPL VGCLVALKGR VADTSAPRKQ RGAGRKASGL KAVLATRSLL FIMISALIQN ICGQSLYQIV PALMHGNPRN TGLILGAVGA GAMVSILFVL PFARKSDRVG AKLSSGTLWM GCALCVAGAI PVVEVQALCF FFAGLATSSL FVTSSSAVQL LSPPERKSAI LGLFSIVTIG VQPLAAMGWG AVVDAWGVQM TIVVAGGLEA LFSIWMLGVP FWRHFKFSPD DCPETT
|
| |