Gene Amuc_0757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0757 
Symbol 
ID6275321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp890740 
End bp891894 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content55% 
IMG OID642612808 
Productglycosyl transferase family 2 
Protein accessionYP_001877374 
Protein GI187735262 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.122576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCG ATGTTTCCAT CATTGTTCCC TGCTACAATG TAGCCGCTTA CGTGGACAGC 
TGCCTGGAGA GCCTGGTACG CCAGACTCTC CGGAACATAG AAATTATCTG CATCAATGAC
GGCTCCACGG ATGAGACCTG GACACATCTG CTGCGCTGGA AAGAAAAGGA CAGCAGAATC
ATCCTTCTGA ACCAGCGGAA CGCGGGCGTC TCCGCAGCAA GGAATGCAGG GCTGGATGCC
GCCCGCGGCC TTTATGTCGG TTTTGCGGAT CCGGACGACT ACATGGATCC GGAAATGTAT
TCCCGCCTTT TTTCGGCGGC TCTGGAATAT GACGCAGACA TCGTGGAATG CGGCAACCAT
GTTTTTGAAG ACTCGTCAGA CCGGATTATC GAAGCCAAAA GAAGATCACC CTCCCGGCAT
TTTGAAGAGA ACGCCTCTCC GGCCAGCTTC TTCCGGGATT CCATCTGGGG GAAAATGGAT
ATCTGCGTGT GGAGCAAACT GTTCCGGAAA AGCATGCTGG ACGCCCACCG CCTCCGCTTC
AACGTACATC TGAAATCCGG CGCGGAGGAT GAAACCTTCC GGCTGATGGC CGTTCCCCAT
GCCTCCCGTC TCCTGTTCAT TCCCGACTGC CTGTATTACT ACCGCCTTAT GCGCAACGGC
TCCCTCTCCC GCCGCTGCAA CGTTCCCACC TACTCCAAAT GCGTGCAGGA ATTCCAGCGG
CTGCTGTACA TTGTGGACTA CTGGCAGAAA CAAGGATGGC AGAATGAAGG CCTGTTCGCT
TACGGCGTCC GGAAAATCAG GCCTTTTTTT GTTTCCAAGC ATCCCCTTTT CCATCAGATG
ACCGCTGTCC AGCAACGTTC CGCGCTGGAC TGGTGGAGCC TGTTCTATCA GAAGGCGGAA
GGAAAACGTT TCCTCTCCGC TCTATCGGGA CGGGACAGGC AACTGGCGGA CCTGTTGAAC
TCGGCGGAAC CGGTCCCCAA CGGCTGGGGG CGCATTCTGC TGGCAGCCTG CTCCCTGCTT
CCCGGACAAA AAGGACGTTA TTACTCCTGC AAAAAAATGC TTGCGGAACA TTTTTCCCAA
ATCTGCCCCA ATGAGTTTCA AAAAGAAAAC GCTTCTCTGG AAGAGCCGTT TGACACATCG
CCGCCTTCCC TGTAA
 
Protein sequence
MIPDVSIIVP CYNVAAYVDS CLESLVRQTL RNIEIICIND GSTDETWTHL LRWKEKDSRI 
ILLNQRNAGV SAARNAGLDA ARGLYVGFAD PDDYMDPEMY SRLFSAALEY DADIVECGNH
VFEDSSDRII EAKRRSPSRH FEENASPASF FRDSIWGKMD ICVWSKLFRK SMLDAHRLRF
NVHLKSGAED ETFRLMAVPH ASRLLFIPDC LYYYRLMRNG SLSRRCNVPT YSKCVQEFQR
LLYIVDYWQK QGWQNEGLFA YGVRKIRPFF VSKHPLFHQM TAVQQRSALD WWSLFYQKAE
GKRFLSALSG RDRQLADLLN SAEPVPNGWG RILLAACSLL PGQKGRYYSC KKMLAEHFSQ
ICPNEFQKEN ASLEEPFDTS PPSL