Gene Amuc_2088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2088 
Symbol 
ID6275819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2538989 
End bp2540032 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content57% 
IMG OID642614150 
Productglycosyl transferase group 1 
Protein accessionYP_001878678 
Protein GI187736566 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAGC AATATGACTT CATCTACCTG ACCAATACGC CTTCCTTTTA CAAAGTAAGG 
CTTTGTGAGG AACTGGCGAA AAAACACTCC GTTCTCCTGG TTCTTTACGG CTATGGGGCG
GAAGCGGTCA ATACCCAGCT CTCCGGCAAT GAGGGAGGCT TTGACTACTT CTTTCTGCAT
GAGGGAGATG CGGGAAAAAG AAACAAGGCT CTTGTCCTGC TCAGGCTCCT GAAGCTGATG
GCCCGGGTTC GGGCCCGCAG AGTGCTGTTC TCCGGCTGGA TGGCGCCGGA ATACAACATA
TACAGCTTTT TTTCCCCCAG GCGCCGCAAT GCCGTCATTT GCGAATCGTC AGCCATTGAT
TCCGGCATGA GCGGCTGGAA AAGCCTGCTT AAAAAAGCCG TCATACGCCG CATGAGCGCG
GCGCTGCCTT CCGGTTCCCC CCACCGCGCC CTGTTTGAGC ATATTCGTTA TCCGGGAGAC
ATCCATGTCA CGGGCAGCGT AGGCATCTTT AACATGGAAG GCCGCCGTGC CCTCCGCCAT
TCCCCGTCCG CTCCCCTGAA CTACATTTAC GTCGGGCGTC TCGCGCCGGA AAAGAATCTG
GAACTGCTCA TCAGGGAATT CAACTCCAAT GGGCGGCCTC TGTCCATCGT GGGGGACGGC
CCTCAAAAAG AACTTCTCAA AAACATGGCC AAGGATAATA TCCGCTTTCT GGGCCACGTT
CCCAACGACA GACTCCCGGA AATATACGGA CGGCATGACG TGTTCATCCT CCCCTCCCGC
TATGAGCCGT GGGGGCTGGT CGTGGAAGAG GCCCTCTTCC GGGGGCTGCC CGTCATCGCC
AGCGACAAGG TGGGCAGCGC GGCCGACATG GTTGCCGCTC TGGAAACGGG CGCCGTCTTT
TCCCTGTCCG CGCCGGACGG CCTGAGCAAC GCCATTCATG AAGTTGAAAA GAATTATGAA
ACCATGGCGC GCCGCGTCGC GGACATCAAC TGGAACAGCC GCGTGGAAAC GCAGCTCAAG
GCATACACCT CCCTTTTAGA TTAA
 
Protein sequence
MNKQYDFIYL TNTPSFYKVR LCEELAKKHS VLLVLYGYGA EAVNTQLSGN EGGFDYFFLH 
EGDAGKRNKA LVLLRLLKLM ARVRARRVLF SGWMAPEYNI YSFFSPRRRN AVICESSAID
SGMSGWKSLL KKAVIRRMSA ALPSGSPHRA LFEHIRYPGD IHVTGSVGIF NMEGRRALRH
SPSAPLNYIY VGRLAPEKNL ELLIREFNSN GRPLSIVGDG PQKELLKNMA KDNIRFLGHV
PNDRLPEIYG RHDVFILPSR YEPWGLVVEE ALFRGLPVIA SDKVGSAADM VAALETGAVF
SLSAPDGLSN AIHEVEKNYE TMARRVADIN WNSRVETQLK AYTSLLD