Gene Amuc_0753 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0753 
Symbol 
ID6275013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp886436 
End bp887422 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content55% 
IMG OID642612804 
Productglycosyl transferase family 8 
Protein accessionYP_001877370 
Protein GI187735258 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.833822 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.756645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATC CGATGAAAAA GAATGAATTT GCCGTCGTCC TGGCCAGTGA CAACAGGGGC 
ATTCTACCTT TGAGCGTTAC TGTCTTTTCT TTGCTGAATA CTGCCGGTCC GGAGACTTTT
TACAAAATTT ACGTTCTTTC CGACGGCATT GACGGAGAGA ACTGGGCGAG CGTGGAGCGG
CTGGCCGCTC CGTTCGATTG CCGTCTGGAG TTCATAGACG TTTCCGGCAT TTTGGAAAAG
CACGACTTCC CCCATACGGA ACAATGGCCG GTGCCAGCCT GGGGACGCGT GTTCATTCCG
GAATTATTGA AGGAAGAGCG GGGCAATATT CTGTATCTGG ACATTGACGT TCTGGTTTGC
CGGGATTTGA CGGAGCTGTT CCGGACGAAT ATGGACGGAA AAGCAATAGG GGTGGTGTTT
GAGAATTTTT CCAGACCCGG TTCCCATTTT AACGAACGTC TTGAGATGCC GCTGACCTGC
ACGGGATATT TCAATTCCGG CGTGCTGCTG ATGAATGTGG ATGTCTTCCG GGAGAAGAAT
CTGGTCAGGG CTGTGCTGGA TTATGCCGTC ACTCACCGGG ACAGGCTGAC ATGTCCGGAC
CAGGATGCCT TGAACGGAGC CCTGTGCGAG CTGACTGTGC CGCTGCACCC GCGCTGGAAC
TGGCATGACG GTTTGACGCG CCGCATTTTG AAAAACGATC CCCGGGAACA GTTCTGGCGC
GGCGTCACGC CGCGCCAGGC GGTGGAAGCG GCTTTGGAAC CGGGCATTCT TCATTACCAG
GGGGTGCACA AGCCCTGGCG GTATAATTGG CGTTATGAAG GGGAACGTTA CGAACGGGTT
ATGCGTGAAG CCGGGCTGCT GCGCGGTCCG CTGCCCGGAA GAACGCTCCC GGCCGTCTTG
AAAAAGCACC TTTACCGGCC TGTTTACCGG ATGACGGCCA GAAAGATTTT AAGGCTGAAG
GAAGGGTTTG ATAACCGGCT CCTTTGA
 
Protein sequence
MNNPMKKNEF AVVLASDNRG ILPLSVTVFS LLNTAGPETF YKIYVLSDGI DGENWASVER 
LAAPFDCRLE FIDVSGILEK HDFPHTEQWP VPAWGRVFIP ELLKEERGNI LYLDIDVLVC
RDLTELFRTN MDGKAIGVVF ENFSRPGSHF NERLEMPLTC TGYFNSGVLL MNVDVFREKN
LVRAVLDYAV THRDRLTCPD QDALNGALCE LTVPLHPRWN WHDGLTRRIL KNDPREQFWR
GVTPRQAVEA ALEPGILHYQ GVHKPWRYNW RYEGERYERV MREAGLLRGP LPGRTLPAVL
KKHLYRPVYR MTARKILRLK EGFDNRLL