Gene Amuc_1515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1515 
Symbol 
ID6275717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1808657 
End bp1809907 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content58% 
IMG OID642613574 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001878117 
Protein GI187736005 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000000845442 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGAAATGGC TGAATGACTG GCTGGGCTTT TTCAAGCCCC TTCAGGTGCG GAACCTGCAA 
ATATTCTGGT CAGGGCAGGC TTCGGCCCTG ATTGGAATGT GGTTGCAGGT GACGGCCATG
GGTATTCTTG TGTACGATAT TTCCGGAGGC TCCGCCACGG CAGTGGGGGT GTTGGCTGCC
TTGAACGCTC TTCCTTTCTT TCTGGGCGGC ATGCTGCTTG CCGGGCTGGG GGACCGTTTT
GACCGGAGGA AACTCCTCAT AGCCGTGCAG TGCGTCCAGT GGCTGGTGGC TGTAGCCCTG
TTTCTGCTGA CTGTGCTGGA TATGCTCCAG CTGTGGCACT TATACGCCGC CGGGCTGGTG
ATGGGCGTCA ATCAGACGGT GGGTTTTCCC ACACAGCAGG CTTTTGTGGG AGACCTGATA
CCGCGCCGGC AATTACAGGA GGCCGTGGGC ATGTATTCTC TGGTATTTAA TACGTGCCGG
GCTATAGGGC CGGCGCTTGC CGGCTACATT ATTGCGGAAT GGGGCGCCGG AACCGCCTTT
GGCGGCAATG TGGCGGCAAG CCTTCCGCTG GTTGGCTGCC TGGTTGCCTT AAAGGGACGC
GTTGCGGATA CCTCCGCTCC CAGAAAACAG CGCGGCGCAG GAAGGAAAGC ATCCGGCCTC
AAGGCGGTGC TTGCCACGCG CAGCCTGCTT TTCATCATGA TCAGCGCTCT GATTCAGAAT
ATCTGCGGGC AATCTCTCTA TCAGATTGTT CCGGCATTGA TGCACGGAAA TCCCAGGAAT
ACCGGCCTCA TTCTGGGTGC GGTCGGAGCC GGAGCCATGG TGAGCATCCT CTTTGTCCTG
CCGTTTGCAC GCAAAAGCGA CAGGGTGGGA GCCAAGCTTT CCTCAGGCAC CCTGTGGATG
GGATGCGCAC TGTGCGTGGC AGGCGCCATC CCGGTGGTGG AAGTGCAGGC CCTCTGTTTC
TTTTTTGCCG GTTTGGCCAC TTCTTCCCTC TTTGTTACTT CTTCATCCGC CGTGCAGCTT
CTGTCGCCCC CGGAACGCAA GTCGGCCATT CTGGGGCTGT TCAGCATTGT CACCATCGGT
GTGCAGCCGC TGGCGGCCAT GGGGTGGGGA GCTGTGGTGG ATGCCTGGGG TGTCCAGATG
ACGATTGTCG TGGCGGGGGG GCTGGAAGCC TTGTTTTCCA TTTGGATGCT GGGCGTTCCA
TTTTGGCGTC ATTTCAAATT TTCTCCGGAT GATTGTCCGG AGACGACATA G
 
Protein sequence
MKWLNDWLGF FKPLQVRNLQ IFWSGQASAL IGMWLQVTAM GILVYDISGG SATAVGVLAA 
LNALPFFLGG MLLAGLGDRF DRRKLLIAVQ CVQWLVAVAL FLLTVLDMLQ LWHLYAAGLV
MGVNQTVGFP TQQAFVGDLI PRRQLQEAVG MYSLVFNTCR AIGPALAGYI IAEWGAGTAF
GGNVAASLPL VGCLVALKGR VADTSAPRKQ RGAGRKASGL KAVLATRSLL FIMISALIQN
ICGQSLYQIV PALMHGNPRN TGLILGAVGA GAMVSILFVL PFARKSDRVG AKLSSGTLWM
GCALCVAGAI PVVEVQALCF FFAGLATSSL FVTSSSAVQL LSPPERKSAI LGLFSIVTIG
VQPLAAMGWG AVVDAWGVQM TIVVAGGLEA LFSIWMLGVP FWRHFKFSPD DCPETT