Gene Amuc_1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1552 
Symbol 
ID6273653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1864700 
End bp1865860 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content57% 
IMG OID642613611 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001878154 
Protein GI187736042 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATGC CTCTTCCTAA AACGACGCAA GGTGTTTATC GTCTGTCGGT CAGTACCTTT 
TATTTTCTTC AGGGTTTGGT GTTCGCCAGT TGGGCATCCC GCATCCCGGA CATCAAAAGT
GCATTGGGGT TGAACGATGC CGACTTGGGG TCCGTTCTCT TTGCCGTCCC GGTGGGGCAG
ATGTCCGCCA TGGCGCTATC CGGTTACCTG GTTGGCCGTT GCGGCAGCCG GAAAATACTG
ATGGCGGCAT CCGTCTTTTA TCCTGCCGTG CTTGTATGCC TGGGTATGGC GGGGTCTTTC
TGGGAACTGG CGGCCGGGTT ATTTTTCTTT GGGGTAGCCG CAAATCTGAC GAATATATCC
GTCAATACGC AGGGAGTGGG AGTGGAACGG CTGTACCAGT GCAGCATCAT GGCCCGGTTC
CACGGTTTAT GGAGCCTGGC GGGTTTTTTC GGAGCTTTGC TGGGAGCTGC CATGGTGGAC
TGGCATATTT CTGCGGAAAC GCATTTCATC GCCATTTTCC TGATATGCAT GATTATTCTG
GCCGTTTTTT CCCCCTCTCT TCTGCCGAGG GATGCCCGGC GTTCCTCCTC CCAGGGAGGC
GGCATGTTCC GGAGCATGGA TGCTTATGTA CTGGTCATCG GGCTGATCGC CTTCGGAAGC
ATGGTGAGCG AAGGAACCAT GTTTGACTGG AGCGGCGTGT ACTTTGAAAG CGTGGTAAAA
CCCGGTCCGG GGCTGGTGCA GATGGGATAC GTGGCATTCA TGAGCACCAT GGCCCTGGGG
CGTTTTACGG CAGACCGCCT GGTGATGCGC TTCGGGCCTG TGCGGGTTCT GCGCGCCAGC
GGCATCCTTA TTGCCTCCGG ATTGCTCGTC TCCGTCCTGT TCCCGATGCT GTGGTCCGCC
ACGCTGGGCT TTCTGCTGGT GGGTTTTGGC ACCTCTTCCA TTGTCCCGCT CTGCTACAGC
ATGGCCGGGA AATCCCGGAA AATGATTCCC AGCATGGCGC TGGCTTCCGT TTCTACCATC
GGCTTTCTGG GGTTCCTGAT GGGGCCGCCG GTCATTGGTC ATATTGCCCA TGCTTCCTCC
CTCCGGTGGT CTTTCTCCCT GATTGCCCTG GTTGGACTGG GGACGGCATT CATTGCCCCT
TTCCTCAAGA AATATAGGTA A
 
Protein sequence
MNMPLPKTTQ GVYRLSVSTF YFLQGLVFAS WASRIPDIKS ALGLNDADLG SVLFAVPVGQ 
MSAMALSGYL VGRCGSRKIL MAASVFYPAV LVCLGMAGSF WELAAGLFFF GVAANLTNIS
VNTQGVGVER LYQCSIMARF HGLWSLAGFF GALLGAAMVD WHISAETHFI AIFLICMIIL
AVFSPSLLPR DARRSSSQGG GMFRSMDAYV LVIGLIAFGS MVSEGTMFDW SGVYFESVVK
PGPGLVQMGY VAFMSTMALG RFTADRLVMR FGPVRVLRAS GILIASGLLV SVLFPMLWSA
TLGFLLVGFG TSSIVPLCYS MAGKSRKMIP SMALASVSTI GFLGFLMGPP VIGHIAHASS
LRWSFSLIAL VGLGTAFIAP FLKKYR