Gene Amuc_0973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0973 
Symbol 
ID6274176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1163546 
End bp1164673 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content63% 
IMG OID642613027 
ProductMonogalactosyldiacylglycerol synthase 
Protein accessionYP_001877586 
Protein GI187735474 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0707] UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.817942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.377546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAG GACGGCTGCC GCGCATTCTG ATTGTGACCG CAGGATACGG GGAAGGGCAC 
AATTCCGCAG CCAGGGGCGT GAGGGATGCT CTCGCAGGGC GCGCGGAAGT CCGCGTTACG
GACCTGTGCG CGGAGGCGAT GCCCCGGATG TTCCGCCTGA CGCGCGCCGC CTACCTGTGG
ACGATCTCCC GCATGCCCCG TCTCTGGAAA TGGATGTACG AGGTGAGCGA CAGGCGCAAT
ATGGCGGAGA AGCCCGTCAG GGGAATTGCC CCCGTGGAGC GCCTGCTGGA ACGTTTGCTG
CGGGAATGGA AGCCGGATGC CGTGGTCTGC ACCTACATGG TATATCCCTA CATGCTGGAT
TCCCTGGCCT CCCGCACGGG CAGGGCGGTT CCTTACCTGA CCGTCGTGAC GGATTCCTTC
GTCATCAATA AATCATGGCT GTGTTCCAAG TCCCCCCTCT GGGCCGTAAC GGACCCCTGG
ACGAGGGCAA TTATGGAGGA AAAGGGGCTG CCGCAGGACA GGCTGCGCGT TACGGGATTT
CCCGTCAATC CCGTGCTGGG AGCGCTGGCG GAGGAACATC CCCTTTCCTG GAAAGAAGGG
GAGCCGTTCC GGGTGCTTTA CTTTGCCCAG CGTTCCGCAC GGCATGCGCG GGCGGAGCTG
GCGGGTATGC TGGATGCGAA TCCCGCCCTG CATGTAACCT GTATTCTGGG GCGCCGGTTC
CGGCGCATTT ATCCCCGCAT CCGGGATTTG CGCGCCAGAT ACGGACGCAG GCTGACGGTG
CGCGGCTGGA CGCGCCGCGT GCCTTCTTAT CTTGCCGCCA GTCACGTAGT GGTCGGGAAA
GCGGGAGGAG CCACCGTACA CGAGGTGCTT GCGGCCGCAC GTCCCATGCT GGTGAACTTT
CTTCTTCCGG GCCAGGAGGA GGGCAATACC CGCCTCCTGG AAAAACTGGG AGGCGGCAGC
CATGTGCCGG ACGCCCGTGC CCTGGCTTCC GCTCTTCAGG AAATGATGGC GGACGGTGGC
GCACAGTGGA GGCGCATGCA TGAAAACCTG CTGCGGGCCG GGATGACCGG AGGAAGCGGC
AGAATAGCGG ATTTGGCCTT GAAACTGGCG GAGGAACATA CTAACTGA
 
Protein sequence
MQKGRLPRIL IVTAGYGEGH NSAARGVRDA LAGRAEVRVT DLCAEAMPRM FRLTRAAYLW 
TISRMPRLWK WMYEVSDRRN MAEKPVRGIA PVERLLERLL REWKPDAVVC TYMVYPYMLD
SLASRTGRAV PYLTVVTDSF VINKSWLCSK SPLWAVTDPW TRAIMEEKGL PQDRLRVTGF
PVNPVLGALA EEHPLSWKEG EPFRVLYFAQ RSARHARAEL AGMLDANPAL HVTCILGRRF
RRIYPRIRDL RARYGRRLTV RGWTRRVPSY LAASHVVVGK AGGATVHEVL AAARPMLVNF
LLPGQEEGNT RLLEKLGGGS HVPDARALAS ALQEMMADGG AQWRRMHENL LRAGMTGGSG
RIADLALKLA EEHTN