Gene Amuc_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2089 
Symbol 
ID6275814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2540032 
End bp2541171 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content63% 
IMG OID642614151 
Productglycosyl transferase group 1 
Protein accessionYP_001878679 
Protein GI187736567 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.44458 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCCA TATTACAAAC CATCGCGTCC CTTGACTCCC GTTCGGGCGG CACCAGCACC 
TGCACTTATG ACCTGGTCAA AGCCCTGAAC GCTTCCGGCA TGCCCACGGA CATCCTCACC
CTCCAGCCCG GCTCCCCACA AGAACGGATG GTGGGAGAAG ACAGCTTTAT CCATGCCTGC
CCGTTTGACG CCCGCACCCC TCTCGCCGTC TCCCGGAACA TACGCCGCTT TCTGGCCGGC
TCCCGGTACC GCCTCTACCA CACCAACGGC CTGTGGCTGG ACGTCAACCA CGCCACATGC
GCGCACGCGC GCAAAACGGA TGCCCCCTGC GTCGTTTCCC TGCACGGCAT GCTGTACCCC
CAGGCGCTGG AGCGCGGCGG CTGGAAAAAG AAACTCATGC TCGCCCTGGG ACACCGGAAA
GACATCTCCG GAGCCGCCTG CGTCCACGTC ACCTGCGAAA AGGAAATGGA ATACTACAGG
GACATGGGCT TTTCCAACCC CGTGGCCGTC ATCCCCAATC CGGTGCGGAT CCCGGAATAT
CTTGCGGATA TCAGGCGCCC CGGGCACGAG GGCTTCCGGG CCGGATTCCT GGGCAGACTC
CATCCCATCA AAAATCTGGA GGCCCTGATC ACGGCCTGGG GTCAGCTGCG CCTCCCGAAC
GCGGAGCTTC TGCTCATCGG TGACGGAGAC CCGGAATACA AGGCCCGGCT GGAAGAACTG
GTCCGGGAGG AAAACATTTC CAATATCTCC TTCACGGGCT TTGTTTCCGG AAGGCGGAAA
TATGAAATGC TCTCTTCCCT GGACGTCCTG TGCGCCCCCA GCCACCAGGA AAACTTTGGA
ATGAGCATTG CGGAGGCCCT GCTGGCGGGA ACGCCCGTCA TCGCCAGCCG GGGAACGCCG
TGGGAAGCGC TGAACACCCG CCGGTGCGGC TGGTGGTGCG GCAACGATTC CTCTTCCCTG
GCCGCAGCCC TGGAAAACGC CTTCAACCTC TCCCCGCAGG AAAGGCTCGC CATGGGAGAC
CGCGGACGCT CCCTCGTCAT GGAAACCTGC GCCGCCCCCC ACGCGGCCGA CCGCATGAAA
CGCCTGTACC GGTATCTGCT GGGGCAGGAA GCCAAACCGG AATTTGTCTA TCTCCCATGA
 
Protein sequence
MAPILQTIAS LDSRSGGTST CTYDLVKALN ASGMPTDILT LQPGSPQERM VGEDSFIHAC 
PFDARTPLAV SRNIRRFLAG SRYRLYHTNG LWLDVNHATC AHARKTDAPC VVSLHGMLYP
QALERGGWKK KLMLALGHRK DISGAACVHV TCEKEMEYYR DMGFSNPVAV IPNPVRIPEY
LADIRRPGHE GFRAGFLGRL HPIKNLEALI TAWGQLRLPN AELLLIGDGD PEYKARLEEL
VREENISNIS FTGFVSGRRK YEMLSSLDVL CAPSHQENFG MSIAEALLAG TPVIASRGTP
WEALNTRRCG WWCGNDSSSL AAALENAFNL SPQERLAMGD RGRSLVMETC AAPHAADRMK
RLYRYLLGQE AKPEFVYLP