Gene Amuc_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1947 
Symbol 
ID6275132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2365259 
End bp2366440 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content59% 
IMG OID642614007 
ProductN-acylglucosamine 2-epimerase 
Protein accessionYP_001878541 
Protein GI187736429 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2942] N-acyl-D-glucosamine 2-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.904714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.0967574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCAA CTCTTGATAT GCCCTCTTTG GGCAAGTTCT ACCGCCGCCA GCTTCTGGAA 
GACGTTCTGC CGTTCTGGTT CCCCCGCGCC TACGACGAAA AAAACGGTGG TCTTTACCAC
TGCTTTGACG CAGACGGCAC GCTGGTGGAC TCCGACAAAT CCGTCTGGGC CCAGGGCCGC
ATGGCATGGA TGCTGCTGAC AATGTACAAC AGCATCGAAA AGAATACGGA CTGGCTCAAA
TGGGCGGAAA GCGCCTTGGA ATTCCTGAAA ACCAAGTGCG TTGACCCGGC GGACGGCCGC
ATGTTCTTTC ATGTGGCTGC CGACGGCACC CCCATCCGCA AACGCCGCTA CGCCTACAGC
GAATCTTTCG CCGCCATCGC CTTTGCAGCG CACGCCAGGG CGACCGGCAG CCGGGATTCC
GCCCGTGAAG CCCGCCACTG GTTCGACATC TTCACGGACA ACTGCTTCAC TCCCGGCAAA
ATGGTTCCCA AATTCACCGG GGAACGCCCT ACGACCGGTC TGGGCACCCG CATGATTACC
CTGAACACAG CTCAGGAAAT GCGCAAGTAC CTGGAAGATG ACGACGGCTT CTATACCGGC
TGGACAGACC GCTGCATCAA CGACCTCCGC ACCCTGTTCA TGAAACCTGA CATCCAGGCA
GTCATGGAAG TGGTGGGAAC GGACGGTTCC ATCATCGACC ACTTCGACGA ACGCACCCTG
AACCCCGGCC ACACTACGGA GGGTGGGTGG TTCGTGCTGG AAGAAGCCCG CCATCGCGGC
AATGACCCCG AACTCATCAA GGTGGGCTGC GACATGATCG ACTGGGCATT CGCCCGCGGC
TGGGACAAGG AAAACGGCGG CATGCTGTAC TACACGGACG TGTACAACAA GCCCGTTCAG
GAATACTGGC ACAACATGAA GTTCTGGTGG CCGCATGATG AAGCGCTCAT CGCCATGACG
CTCGCCTACA AGCTCACAGG GGAAGAACGC TATGCCATCC GTCACGACAT GGTGCGCAAC
TGGGCTTTCT CCCACTTCCA GGACGTTCAG CATGGCGACT GGTTCGGCTA TCTGAACAAG
GACGGTTCCA GGGCTAACAC CCTCAAGGGG AGCCTCTGGA AATCCTTCTT CCACCATCCC
CGCGCCATGT GGTGCTGCGC CCACTACTGC GGCGCCATTT AA
 
Protein sequence
MAATLDMPSL GKFYRRQLLE DVLPFWFPRA YDEKNGGLYH CFDADGTLVD SDKSVWAQGR 
MAWMLLTMYN SIEKNTDWLK WAESALEFLK TKCVDPADGR MFFHVAADGT PIRKRRYAYS
ESFAAIAFAA HARATGSRDS AREARHWFDI FTDNCFTPGK MVPKFTGERP TTGLGTRMIT
LNTAQEMRKY LEDDDGFYTG WTDRCINDLR TLFMKPDIQA VMEVVGTDGS IIDHFDERTL
NPGHTTEGGW FVLEEARHRG NDPELIKVGC DMIDWAFARG WDKENGGMLY YTDVYNKPVQ
EYWHNMKFWW PHDEALIAMT LAYKLTGEER YAIRHDMVRN WAFSHFQDVQ HGDWFGYLNK
DGSRANTLKG SLWKSFFHHP RAMWCCAHYC GAI