Gene Amuc_0539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0539 
Symbol 
ID6275094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp634325 
End bp637066 
Gene Length2742 bp 
Protein Length913 aa 
Translation table11 
GC content58% 
IMG OID642612589 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_001877158 
Protein GI187735046 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.160142 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAG TCACTACCAT TCTCATAACG GCGGCATGTC TGGCCGGCTC TTTTTCCCTT 
TATGCCGATG CGGTTTCCGA TCAGGAGGAG GCCGCTTCAT CTACGGGAGC CTATTCCGTG
GCAGGGGCTC TTCGCCTGCC GGAAAACGTA ACGAAAAGGG ACGTTTACGG CATGAATGTG
GGCTGGAAAC TGTTTAAGGG AGAGAAGGAG CCGGAGGAAG CGGCCAGCCC GGATGTTGAC
GATTCCTCCT GGGAGTCCGT CAACCTCCCC AACGGCATTG AGCTTCTTCC GGAAGAGGCC
AGCGGCTGTT CCAACTACCA GGGGCCGGTA TGGTACCGGA AAACATTTAT TCCTCCCTCC
CGGCTGGAGG GGAGGAGAAA CACCCTGTAT TTTGAGGGCA TCATGGGAAA GAGCGAGGTC
TGGGTGAATG GGGAGAAGGC CGCGGAGCAT TTCGGCGGAT ATCTTCCCGT GATTGTTAAT
CTGGACAAGT GGCTGAAACC GGGACAAAAG AATGTCATCG CCGTGAAGGC GGACAATTCC
AACGACGCTT CCTATCCCCC CGGCAAACCC CAGGAGGGCC TGGATTTTTC CTATTTTGGC
GGTATTTACC GGGATGTGTA CCTCATTTCC ACCGGACCCG TGTATATCAC GGACCCGAAC
GAGGCCGGAA CTGTGGCCGG GGGGGGAGTT TTTTTCCGGA CGGAATTTCT TGATCCCCGT
ACCCGCAAGG GCAAGGTGGG GGTGAAGGTT CAGGTGGCCA ACCAGACAGA CAAGGAACGG
AAGGTGCGTG TGCAGGCGGT GATGACGGAC CCCAAGGGGG TGGATCCTGC CGGGGAGACG
GTTCCGCTGA CTATTCCTCC ACATTCCACG GGTGAAGTGG ATATTCCTCT TGTGCTTTCC
AACGTGAAAC CGTGGTCTCC GGACAATCCC GATTTGTACA CGTTGAGCGT GGAAGTATGG
GACGCTGCTG GAGGGGACCA TTCCAAAAAT CCGGCCCATT TGCTGGACAA CCGTTCCATA
AGGGTGGGTG TCCGGACGGT GGAAATTACG GAAAAGGGGC TGGTGCTCAA TGGATCCCTG
TTTCCTGAAA AGCTGATTGG CGGCAACAGG CACCAGGATT TTGCCCGGCT TGGGAATGCA
GTGCCCAACA ACCTGCAATG GCAGGATGCC GTGAAGTTGA GGAAGGCAGG CATGCGCGTG
ATCCGGAGCG CCCATTATCC GCAGGATCCG GCTTTCATGG ACGCCTGCGA CCGTCTGGGC
CTTTTCGTCA TCGTCGCCAC GCCGGGCTGG CAGTTCTGGG GCAGCGGCCC TTTTGCGGAT
CGGGTGTATG ACGATATCCG CCAGATGGTG CGCCGGGACA GGAACCATCC CTCCGTGATG
ATGTGGGAAC CTATTTTGAA TGAAACTCAT TACCCGGCGG ATTTCGCCAA AAAGGCCCGC
GACCTGGTGC ATGAGGAATA TCCGTACAAG GGATGCTACA CCGCTTGCGA CGCGGTGGCT
CAGGGCAGCC AGCATTATGA AGTTCTGTAT GCCCACCCGG TTACGGGGGA CAAACACTGG
TCCATCAAGG AACGGAAGAA CAGCAAACCG TATTTTACCC GTGAATTCGG TGATAACGTG
GATACCTGGT CCGCCCACAA CTCCACCTCC CGTGTAGCCC GTCACTGGGG TGAAGCTCCC
ATGATGGTGC AGGCCCTTCA CTACCTCAAG ACCTCGTTCC CATATACGAC GTACGATACG
CTGAACGCCG CTCCGGCCTA TCATTTCGGC GGCTGCCTGT GGCATCCTTT CGACCATCAG
CGCGGTTACC ATCCGGATCC CTTCTACGGC GGTATTCTGG ATGCTTTCAG GCAGCCCAAA
ACCTCCTATT ACGCATTTAT GTCCCAGCGT CCCCAGAAGA CGCGCAACGG GTTGGGTTCC
GGCCCTGTGG TGCACATCGC CAATGAATGC ACTCCCTTTT CTCCGGAAGA CGTGACGGTG
TTTTCCAATT GTGATTCCGT TCGCCTGAGT GTCAACGGAG GGCCCCCGGT GGAAAAGAGG
GTGGCGTCCT GTCCCGGAGG CCTCAAGCGC GTGCCCGTCG TGTTCCCTAA AGCCTGGGAT
TTCATGGAAA ACAAGAAGCT TGCCCGAGCA GGCAAGGAGG GAGCGGTGAA ACTGGTGGCG
GAAGGCCTGA TAGGCGGCAA GGTGGTGACC AGGCATGAAG TCCGCCCGGC CCGGAGGGCG
GAGAAAATCC GGTTGACTCT GGACCGGGAA GATGGAGTGG ATTTGTACGC CAACGGTTCC
GATGTGTTTG CCGTGGTGGC GGAGGTTACG GATGGCCGCG GAACAGTGAA GCGCCTGAAT
GATGAGGAGA TTGTCTTTTC CGTGGAAGGC CCCGCCGAGC TGCTGACGGA TTCCCCGGAC
GGCACGCTGA CCCAGTCCGT TAAATGGGGT TCCGCTCCGG CCCTGGTGCG CCTGGGGGCA
ACACCCGGCA CGGTGACGGT GAGAGCCTCC GTGAAGCATC CCGGTTCTCA AAAGCCCGTT
TCCGGCGTCA TAAGGTTTGA CACGAAGGCT CCCGGGTTGA AAATGCTCTT TACGGAAGAC
GCCGCAGGCA GCGCGAAGAA GGTTGCCGGT ACGCCGTCTT CCTCCGAGGC TCCTGCCTCC
GAACGGGAGA AATCCCTTCA GATGGAGCTG GAAAAAGTGC GCCATGAACT GAACAGGTTG
CGCAATGAAA AAGTGAGCGC CCAGCAAACC CACTTCGAAT AA
 
Protein sequence
MMKVTTILIT AACLAGSFSL YADAVSDQEE AASSTGAYSV AGALRLPENV TKRDVYGMNV 
GWKLFKGEKE PEEAASPDVD DSSWESVNLP NGIELLPEEA SGCSNYQGPV WYRKTFIPPS
RLEGRRNTLY FEGIMGKSEV WVNGEKAAEH FGGYLPVIVN LDKWLKPGQK NVIAVKADNS
NDASYPPGKP QEGLDFSYFG GIYRDVYLIS TGPVYITDPN EAGTVAGGGV FFRTEFLDPR
TRKGKVGVKV QVANQTDKER KVRVQAVMTD PKGVDPAGET VPLTIPPHST GEVDIPLVLS
NVKPWSPDNP DLYTLSVEVW DAAGGDHSKN PAHLLDNRSI RVGVRTVEIT EKGLVLNGSL
FPEKLIGGNR HQDFARLGNA VPNNLQWQDA VKLRKAGMRV IRSAHYPQDP AFMDACDRLG
LFVIVATPGW QFWGSGPFAD RVYDDIRQMV RRDRNHPSVM MWEPILNETH YPADFAKKAR
DLVHEEYPYK GCYTACDAVA QGSQHYEVLY AHPVTGDKHW SIKERKNSKP YFTREFGDNV
DTWSAHNSTS RVARHWGEAP MMVQALHYLK TSFPYTTYDT LNAAPAYHFG GCLWHPFDHQ
RGYHPDPFYG GILDAFRQPK TSYYAFMSQR PQKTRNGLGS GPVVHIANEC TPFSPEDVTV
FSNCDSVRLS VNGGPPVEKR VASCPGGLKR VPVVFPKAWD FMENKKLARA GKEGAVKLVA
EGLIGGKVVT RHEVRPARRA EKIRLTLDRE DGVDLYANGS DVFAVVAEVT DGRGTVKRLN
DEEIVFSVEG PAELLTDSPD GTLTQSVKWG SAPALVRLGA TPGTVTVRAS VKHPGSQKPV
SGVIRFDTKA PGLKMLFTED AAGSAKKVAG TPSSSEAPAS EREKSLQMEL EKVRHELNRL
RNEKVSAQQT HFE