Gene Amuc_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2108 
Symbol 
ID6274499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2568207 
End bp2569166 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content53% 
IMG OID642614170 
Productglycoside hydrolase family 16 
Protein accessionYP_001878698 
Protein GI187736586 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2273] Beta-glucanase/Beta-glucan synthetase 
TIGRFAM ID[TIGR02595] PEP-CTERM putative exosortase interaction domain 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.369089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATCA GGACAACGCT CCTTTTTCCT ATTCTTTTTC TCTCACTTTC AGGATCCTCC 
GTCATGGCGG CAACCCCTTG GGTTTCTGAC AGGAACTGGG AACTCGTTTT TGAAGACAAT
TTTGACGGCT CATCGCTGAA CGCACACAAC TGGAGCCGCA TTGATTACGT AGGCTATAAT
GCCCCGGACT GGCGCAAGTA CCAATCCCGG GACGAAAGCC TTGTGGAATT CCGGGAAAAG
GACGGCAACT CCGCCATGAC CCTGTGGGGA AAATACGGGG ACTACACCAC CCAAACCAAC
CAGACTGCCC CAGCCAGGAC ATACGCCTGC GGAGGGGTAT ATTCCCTGAA AACCTTCTCC
TTCCAATATG GATACGTAGA AGTCCGCGCC AGATTCGACT GTGTGCAGGG CGTCTGGCCG
GCCATCTGGA TGATGCCCAA ATCCGACAGC ATCGGCTGGC CTGTCGGAGG GGAAATTGAC
ATCATGGAAC ACCTGAATTA CGAAGGCCGT GTTTACCAGA CAATCCACTG GTCGCAAAAC
GGCGTTCCCA ACCAGGATAA CTCCCAGGGG GTCACCCCCG GTTGGAACGA TGGTGCCGAA
AAAGCAAACT GGCATACCTA CGGGATGGAA TGGACGGAAG AAGGCATCAC CTTTTATGTG
GATGGAAAAG CAACCGGTTC ATTCAAAAAG CCCAATAACG CAAACTGGCC CTTTGACAAG
GACGGAAACG AATTCTACCT GATCATCGAC CAGCAGATTG GAGGCAGCTG GGTGGAAAAC
GCAGGAGTTA ATAAGGGAAT CGACCAAAAT ACGCTGGCCA ATTCCGGAGC CGCATTCGAC
ATCGATTATG TCAAAGTCTA TTCCTCAAGC ATCTACAACC ACCTCGTTCC GGAACCCGCT
GTGGCTTCGC TGGGCCTGTT GGGAATGGCC TTGCTGGCGG CTCGCCGCAA AAGAAACTGA
 
Protein sequence
MFIRTTLLFP ILFLSLSGSS VMAATPWVSD RNWELVFEDN FDGSSLNAHN WSRIDYVGYN 
APDWRKYQSR DESLVEFREK DGNSAMTLWG KYGDYTTQTN QTAPARTYAC GGVYSLKTFS
FQYGYVEVRA RFDCVQGVWP AIWMMPKSDS IGWPVGGEID IMEHLNYEGR VYQTIHWSQN
GVPNQDNSQG VTPGWNDGAE KANWHTYGME WTEEGITFYV DGKATGSFKK PNNANWPFDK
DGNEFYLIID QQIGGSWVEN AGVNKGIDQN TLANSGAAFD IDYVKVYSSS IYNHLVPEPA
VASLGLLGMA LLAARRKRN