Gene Amuc_1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1869 
Symbol 
ID6275706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2271629 
End bp2272927 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content55% 
IMG OID642613930 
Productglycosyl transferase group 1 
Protein accessionYP_001878464 
Protein GI187736352 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.740215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAA TTCGAGTTCT TACATTAGGA TGGGAATTCC CGCCCCTGGT CAACGGGGGG 
TTGGGCATTG CCTGCCTGGG TCTTTCCAAG GCGCTGGCAA AAAAAGTGGA TTTAAGGGTG
ATCGTTCCCA AGGCCGACCC TTCCGTCCTT TTTGACGGAT TCCAGCTCAC CGGTCTCAAT
AACGTCTCCT ATCGGGAAGT GGAGCAAGTG GACCGGAAAT ATTCCTATGA CAGCTTTGCC
CTGGTGGAGC GCGCGCCCAT TGAACTGGAC CCCTACACCA CCGTGGAAGG GGGATCCGGC
GTGGTGCAGT TCACCAAGGA AGGCCGCATC ACCTTCTCCA AAACGCATGA AGCCGACCTT
CAGCTGTTCA GGAACAAGGA AGACCTTTAC GCCGGAGACC TGGCCCTGAA AGTCATTCAA
TTCTCAAAAA TAGCGGTAAA AGTCGCCCTT CAGCAGGATT TTGACATTAT CCACGCCCAT
GACTGGATGA CCTATCTGGC TGGCGTGGAA GTGAAAAAAG CCACGGGCAA GCCCCTGGTG
GTGCATCTGC ACGCTTCCCA GTTTGACCGT GCCGGAGCGG ATGCCCGCGG CTGGATTTAC
GACATTGAAA AATTCGGCAT GGAACAGGCG GATGCCGTTA TCCCGGTCAG TAAATACACG
GGAACCATCG CCAGCGGGCA CTATGCCATC GACCCCCATA AGATATTTCC CATTCACAAC
GGAGCGGATC CGGTCAAAGT CTTCAAAGGG AAGAAAAAAT TCCCGGAAAA ACTCGTCCTC
TTCCTGGGCC GCCTGACGGC TCAGAAAGGC CCGGGATTCT TCCTTCAGAT TGCCGCCAAG
GTTCTGGAAC AGACGGACGA CGTACGCTTC GTCATGGCCG GTACGGGAGA AAAGCTCCGC
CAGTTGATCG AATCCGGAGC CTTCAAGGGC GTGGGCGACA AATTCCACTT CACCGGCTTC
CTGAACAAGG ACAAAGTCAA TGAACTCCTC TCCATCACGG ACATCTACTG CATGCCTTCC
GTATCGGAGC CCTTCGGCCT TTCTGCGCTG GAGGCGGCCC AATTCAACAT TCCCGCCGTG
ATTTCCAAGC AGTCCGGCGT GGCGGAAGTC ATGAAGGGAG CCCTGAAAGC GGATTTCTGG
GACGTCAACA AGATGGCGGA ACATATCGTC CATCTCTGCA CGGATGAGGA ATTGTACCGG
AAAGTAGTGG AACAGAGCAC GGAGGACATC AAGGCCTCCA CCTGGGATGC CGCCGCAGAC
AAGGTTATCC GGGTCTATGA ACATGTGCTG AACCGCTAA
 
Protein sequence
MSTIRVLTLG WEFPPLVNGG LGIACLGLSK ALAKKVDLRV IVPKADPSVL FDGFQLTGLN 
NVSYREVEQV DRKYSYDSFA LVERAPIELD PYTTVEGGSG VVQFTKEGRI TFSKTHEADL
QLFRNKEDLY AGDLALKVIQ FSKIAVKVAL QQDFDIIHAH DWMTYLAGVE VKKATGKPLV
VHLHASQFDR AGADARGWIY DIEKFGMEQA DAVIPVSKYT GTIASGHYAI DPHKIFPIHN
GADPVKVFKG KKKFPEKLVL FLGRLTAQKG PGFFLQIAAK VLEQTDDVRF VMAGTGEKLR
QLIESGAFKG VGDKFHFTGF LNKDKVNELL SITDIYCMPS VSEPFGLSAL EAAQFNIPAV
ISKQSGVAEV MKGALKADFW DVNKMAEHIV HLCTDEELYR KVVEQSTEDI KASTWDAAAD
KVIRVYEHVL NR