Gene Amuc_1064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1064 
Symbol 
ID6274046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1270394 
End bp1271875 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content56% 
IMG OID642613115 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001877671 
Protein GI187735559 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value0.902671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAGT CCGCAAACCC CATCCTCAAA TATTTTTCGG AATTCAAAAT CCTGAAAACC 
GTATCCAAGG ATTTCTGGCT TACCAACGTC GTCCAGTTCT TCGACGGAAT GGCCTATTTT
TCCATGATTA CGGTATTCGT CCTCTACCTG ACGGACTATT GTAGCTTCAA CGATGCGGAC
GCCGCCCTCT GGGTAGGCCT GTACACCCTT TTCATCTCCG CCTTCGTCTT CGCGGTGGGT
TCCATCTGCG ACATCATCGG CATACGGCGC ACTTACCTGA TCGGCTTCAT CATCCTCATT
GCCGGGCGCC TTATCATGGG CTTCGGTCCG GACCTCAGCC CGACCGTGGA CTCCGGAAGG
CTGGCCGTCA TGGCGGGCAT TCTGATTATG TCCTTCGGCA CGGCGTTCAT GTCCCCCGTC
ATCCAGACTT CCATCCGGCG CTTCACCCCG CTGAAAGCGC GTTCCACGGG CTTCAATATC
TACTACCTGC TGATGAACAT TTCCGCCGTC ATTGCGAACG TGTTCCTGAT TGAATTTTTC
CGCAAGCACT TCGGCGCCGT GGACGGCGGG TACTGGATCA TCAACTTCGG AACGCTGATG
GTGTTGCTGG GCTGCATCAC CACCCGTTTT ATTAATGAAG ACAACTACGC GGAACCCAGC
GAACGGGAGG CGAACATCAA TGCCCCCCTC CGCAGACCTC TCCAGCTCTT CATGGAGGTA
TGGAAGGAAT CCGCCTTCCG CAAACTTATC CTCTTCCTGG TGCTGACCAT GGGCGTGCGC
ATCGTTTTCA CGCTGCAATT CCTGGTGATG CCCAAATACT ACGTGCGGAC GCTATATGAC
GATTTCGCCA TAGGCTCCAT CAATGCCGTC AACCCGGCCA TCATTGTCTC CGGCCTCATC
CTGCTCATTC CGGTGCTGGG CCGCTTCTCT ACCGTTGGCC TCATGATTGC GGGCATGTCC
ATTTCCGCTT TCTCTCTCGT TTTCATGGCC ATTCCCATTG AATGGTACTA CCTGGTGCCC
GGCATCGAAA CGCGCTCCCA GGCGTATCTG GTAGCCATTG TGGCGCAGAT CCTGGTATTC
GCATTCGGGG AACTGCTCTT CTCTCCCCGC TTCTCCGAAT ATGTGGCGCG GGTAGCTCCG
AAAGACAAAG TGGCTTCCTA CATGTCTCTG GCGGCGCTTC CCATGTTCAT CGCCAAACCC
ATTAATGGCA TCATCGGCGG CCTGCTCGTC GCCTACCTCT GCTATGACGG CATCTGCGCC
AAGATGGATA CCGGGCACAT CGGCTTCTGG GACTCCCCGG AATTCATGTG GACCATTTAC
CTGGCCATGG CCGTAATCAG CCCCATCGCC ATTATCATGA CCCGCAGGAC CATCACCTCG
GACCATCCCG AGGAAGACGC GGCATCGCCA CCCATCAGCG CCATTGAAGC GGAAACGGAT
CCCGCCCTGA CGGCGGAAGA ACTTACGGAA GCCAACTCCT GA
 
Protein sequence
MNESANPILK YFSEFKILKT VSKDFWLTNV VQFFDGMAYF SMITVFVLYL TDYCSFNDAD 
AALWVGLYTL FISAFVFAVG SICDIIGIRR TYLIGFIILI AGRLIMGFGP DLSPTVDSGR
LAVMAGILIM SFGTAFMSPV IQTSIRRFTP LKARSTGFNI YYLLMNISAV IANVFLIEFF
RKHFGAVDGG YWIINFGTLM VLLGCITTRF INEDNYAEPS EREANINAPL RRPLQLFMEV
WKESAFRKLI LFLVLTMGVR IVFTLQFLVM PKYYVRTLYD DFAIGSINAV NPAIIVSGLI
LLIPVLGRFS TVGLMIAGMS ISAFSLVFMA IPIEWYYLVP GIETRSQAYL VAIVAQILVF
AFGELLFSPR FSEYVARVAP KDKVASYMSL AALPMFIAKP INGIIGGLLV AYLCYDGICA
KMDTGHIGFW DSPEFMWTIY LAMAVISPIA IIMTRRTITS DHPEEDAASP PISAIEAETD
PALTAEELTE ANS