Gene Amuc_0442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0442 
Symbol 
ID6275614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp524968 
End bp526080 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content59% 
IMG OID642612492 
Productglycosyl transferase group 1 
Protein accessionYP_001877061 
Protein GI187734949 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.149003 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAG TACATCTCGT CCCCTCCATG GAATCCGGAG GCGTGGAACA AGTCGTTATG 
GAACTGGGCA GCGGCCTTTC TTCCCGGGGC GTGGAAAATA TCGTCGTTTC CGGAGGCGGA
CGCCTGGTGC CCCGTCTGGA AAAGGAAGGC TCCCGCCACA TCCTGATGCC GATAGGCAAA
AAAAGCATCT CCACTCTCTT CCGCATCGGG GCCCTCCGCG CCCTGCTTCA GGCCGTCAGG
CCGGATATCC TGCATCTCCA TTCCCGCGTT CCTGCGTGGG CAGGCTACCT GGCATGGAAA
AAGCTCCCGC CGGAAGACCG CCCCGGCCTC GTCACCAGCG TTCACGGCTT CTACTCCGTC
AACCGGTACT CCGCCATCAT GAGCCGAGGA GAGCGGGTGA TCGCCGTCTC CAACTGCATC
AGGGACTACA TCCTTGACCA TTATCCGTCC ACCCCTCCGG ACCATATCAG AATCATACCC
AATGCTATTT CCCCGGACCA ATATCACCCG GCCTACTCCC CCTCCCGGGA ATGGCTCACG
GGCTGGTTCA TGTCCTATCC TGAACTGAAG GGGAAATTCA CCCTGTGCCT GCCGGGCCGC
ATCACGCGCT TGAAAGGGCA TCTGGATCTG ATTCCGGTCG TCAGGCAGCT TCTGGAACAG
GGAATCCCGG CCCACGCCGT CATTGTAGGA GAAGCAAAGA AGGGAAAAGA AGAATATAAA
AACGAGGTCC TGCGGGCAAT AGAACGTTCC GGCGTCTCCC AGTCCTTCAC CTGGACAGGC
CATCGCCAGG ATCTGAGGGA AATCCTTTCC ACATGTTCCG TCACCCTCTC CCTGACCAAA
AGCCCGGAAG CCTTCGGCAA ATCAACCCTG GAGGCGCTCG CCCTGGGCAA ACCCGTAGCC
GGATACGCCC ACGGCGGAGT CAAGGAACAG CTGGACGCCT TCCTTCCTGA AGGGAACGTC
GCCGTAGGAG ATACCGCCGC CATGGCGAAC CTGCTGGCCC GCTGGCATAC CCAGCCCCCC
CCCCTGCCCC GGCAAATTCC TTCCCCTTAC AATATGCAGG ATATGATTCA AGCCCATCTG
GACGTTTACC AGGAACTGAC ACCTTATTCA TGA
 
Protein sequence
MKIVHLVPSM ESGGVEQVVM ELGSGLSSRG VENIVVSGGG RLVPRLEKEG SRHILMPIGK 
KSISTLFRIG ALRALLQAVR PDILHLHSRV PAWAGYLAWK KLPPEDRPGL VTSVHGFYSV
NRYSAIMSRG ERVIAVSNCI RDYILDHYPS TPPDHIRIIP NAISPDQYHP AYSPSREWLT
GWFMSYPELK GKFTLCLPGR ITRLKGHLDL IPVVRQLLEQ GIPAHAVIVG EAKKGKEEYK
NEVLRAIERS GVSQSFTWTG HRQDLREILS TCSVTLSLTK SPEAFGKSTL EALALGKPVA
GYAHGGVKEQ LDAFLPEGNV AVGDTAAMAN LLARWHTQPP PLPRQIPSPY NMQDMIQAHL
DVYQELTPYS