Gene Amuc_0968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0968 
Symbol 
ID6274187 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1155091 
End bp1156467 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content56% 
IMG OID642613022 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_001877581 
Protein GI187735469 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.066014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.648575 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATTC AAAATGCAGA CCGTACTACC TACGGGATTA ATAAGATTGT TGTTTTCGTC 
CTTGCGTTCG TCGTTTTCTT CGTTGTCGGA TTTGTGTCCT GGCAATGGGA TGGGGTCGCT
CCGGAACCGT GGCGTCTGCG GACGCTGGTG CTGACGCCGC TGACGATGTC TCTGCTGCTG
TATGTGTGCT GTGACCAGTT CGGCGTTTAC TACCGCTGCC AGACGCTCTG GGAAAGCATC
CGGCGGTTTG TGGTGGCATA TGCCTGTTTT GTGGTGTTAA GCGTGATGAT ATGGAAGTTC
TTTGTTCCCT TTTCCGCGGA TGTCGGTGTC CAGTCCCTGG CGTATCTGCT TACCTTCCTG
GGTTCCCTCT GGCTGCGTAT CAGCCGGTTC CGGGCGGAAA AAAGGCTGGT GGAGGAAGCC
CCTACGCTGG TGGTGGGAAC TCCGGATCAT GTGAAAGAAT TCCGCAGGAC GTTGGAGAGC
AATCATGTGG ACTTCAGGAA TGGGTTGCTG GTCATGGCGC CTGAAGAGGT GGACAGTGCG
GTAGTCAGGA ATTTGCTGAT TAAGAACTGC GTCAGCAAGG TGGTGTTTTT GCCGGAGGAG
GTGGATTCTT CCGTGGCGCG GTGCCTGGTG GAACTGTGCG GCAAGATGGG AGTCGATTTT
TACGCGAGCA TGGTGGTAAG CATGCCCGCC GTGCATAAAA CCTATTTCGG CGTGATTGGA
GGAACAAGAA TGCTGGTGTA CAAATCCACC CCCATTCCCT ATACCACCTC CTGGCAGTTG
AAGAAAATGC TGGACTGGAC GGGAGCGCTG GCGCTTCTGG TGGGAACGTC ACCCCTGTGG
GTGCTGGCGG CTGTGGGAAT CAAGCTGTCG GATCGCGGCC CGGTTTTTTA CCGCCAGAAA
CGTTCCGGGC TGTATGGCCG GGAGTTTGGC ATGTGGAAAT TCCGCACCAT GTACCGGGAT
GCGGATAAAA GGCTGGATGA AGTGAAAGCC CAGTACGGCA ATGACATGGA CGGCCCCATC
TTTAAGCTGG AGCACGACCC TCGCATTTTC TCCTTCGGGC GCTTCCTGCG CAAATTCAGC
ATCGACGAGC TTCCCCAGCT CATTAATGTG CTGAAAGGGG AAATGAGCCT GGTCGGCCCG
CGCCCGCTGC CCGTTTATGA AACGGAAGCC TTTACCAGCG ATGCCCACCG CCGCAGGTTG
AGCGTGCTGC CCGGCGTGAC GGGGTACTGG CAGATTGCCG GACGCAGCAA CATCCGGGAA
TTTGAAAAGC TGGTGGAATT GGATATGAAG TATATTGACA ACTGGTCCCT GTGGCTGGAT
ATCAAACTGC TTCTGAAAAC CGTCCCGGCA GTGCTTTTCG CCCGTGGGGC AAAGTAG
 
Protein sequence
MLIQNADRTT YGINKIVVFV LAFVVFFVVG FVSWQWDGVA PEPWRLRTLV LTPLTMSLLL 
YVCCDQFGVY YRCQTLWESI RRFVVAYACF VVLSVMIWKF FVPFSADVGV QSLAYLLTFL
GSLWLRISRF RAEKRLVEEA PTLVVGTPDH VKEFRRTLES NHVDFRNGLL VMAPEEVDSA
VVRNLLIKNC VSKVVFLPEE VDSSVARCLV ELCGKMGVDF YASMVVSMPA VHKTYFGVIG
GTRMLVYKST PIPYTTSWQL KKMLDWTGAL ALLVGTSPLW VLAAVGIKLS DRGPVFYRQK
RSGLYGREFG MWKFRTMYRD ADKRLDEVKA QYGNDMDGPI FKLEHDPRIF SFGRFLRKFS
IDELPQLINV LKGEMSLVGP RPLPVYETEA FTSDAHRRRL SVLPGVTGYW QIAGRSNIRE
FEKLVELDMK YIDNWSLWLD IKLLLKTVPA VLFARGAK