Gene Amuc_0941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0941 
Symbol 
ID6274224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1121219 
End bp1122349 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content58% 
IMG OID642612995 
Productglycosyl transferase family 2 
Protein accessionYP_001877554 
Protein GI187735442 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCTCC CCTCACCCCT CATTTCCGTC ATCGTCCCCA TCTACAACAT GGAGCCTCTT 
CTTCCGCGCT GTCTGGACAG CCTGGCGGCA CAAACCCTCC GCGACCTGGA AATCATCTGC
GTGGACGACG GTTCAACCGA CGGGTCCGGA GGCATCGTCC GGAAATATGC TTCCGGAGAC
AGTCGTTTCC GCCTGATTAC ACAGGAAAAT TCCGGCAGGG CGGAAGCCCG TAACGCGGGA
ATCAGGGCAG CCGCGGCGCC TTACCTGGGT TTTGCGGATC CGGATGACTA CGTTGAGCCG
GACATGTACG AACGGCTCTA CCGGCTCGCG GAAGAAAGCG GGGCGGACAT GGTCCAATGT
TCCTATTCCC CCTTCCTTCC GGCTGAAAGC GGAGAATCGC GCGGAATGGC TGAGGAAAAA
CTCCTTCATA TTGAAAACAC CGCCTGCGAC GGCGTCTTTA CGGAGAAAGG AGAAATATTC
AGGCTCTTTC TGGAAGACAG GATCACCGGC GTCGTCTGGA GCAAATTGTT CAGGCGGATA
CTCCCCGGCT GTTCCGCCCC CCTGGAAGTC CGCCTGCCTT CTTCCTTCAC CAGCGGAGAA
GACACCCTTT ACGTTTCCCG GGCCATCGCC CGCTGCCGCA GCGTCGCCCT CACCTCTGAA
AAACTCTACC ACTACGGGCT GGGCGGTCCC CAATCCGTTT CTTCTCGCAA CCGGAAAGCG
GAAACCCGGC CCGCAAGTTA TTACGCCGTA TTCGAGATGC TCACCAGGGA AAAACTCCGG
GAAGGGGTTC TGGGCAGCAA CCGCACGGCC TACATGAATT ATATTGTCCC CCTGCTTTTT
CCGGACAATG AAATGCCGGC CGGGCGGCTC AGGCATTGGG CGGAACTCTG GCGGGAGGCG
GACATCACTT CCGAACACGT TTGCGGGATG CCCAGGGAAC AGAGGGCTTA TCTTGAGGCC
GCCCTGGCCG GACGGTGGAT TTACCTGCGC CTCCTTCAAT GCTTCTGGAA GTTCAAGACC
GCCAGAAGGA AAATGTTTCG GCTGAGGTTT TCCAAAAACG GAATAGACAC GCTCCAAATC
CTCGGATACA CTCTTTATTC TGCAAAGAAG CTCCCCCGAT CATCCCTGTA A
 
Protein sequence
MPLPSPLISV IVPIYNMEPL LPRCLDSLAA QTLRDLEIIC VDDGSTDGSG GIVRKYASGD 
SRFRLITQEN SGRAEARNAG IRAAAAPYLG FADPDDYVEP DMYERLYRLA EESGADMVQC
SYSPFLPAES GESRGMAEEK LLHIENTACD GVFTEKGEIF RLFLEDRITG VVWSKLFRRI
LPGCSAPLEV RLPSSFTSGE DTLYVSRAIA RCRSVALTSE KLYHYGLGGP QSVSSRNRKA
ETRPASYYAV FEMLTREKLR EGVLGSNRTA YMNYIVPLLF PDNEMPAGRL RHWAELWREA
DITSEHVCGM PREQRAYLEA ALAGRWIYLR LLQCFWKFKT ARRKMFRLRF SKNGIDTLQI
LGYTLYSAKK LPRSSL