Gene Amuc_0290 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0290 
Symbol 
ID6275131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp347442 
End bp350402 
Gene Length2961 bp 
Protein Length986 aa 
Translation table11 
GC content56% 
IMG OID642612344 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_001876913 
Protein GI187734801 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCACCA ATATTACACG CACGACCCTC TGCATTACTG CGTTCAGCAT CGCCAGCCTC 
ATGGCGGCGC CTCTGAATGC AACCAAAACG GAGAGCCTGG ATTGGAACTG GAAATTCGCC
CGTTTCGGGA AAATGCCGGA TGGCAGTACG CAACCGGAAC CGGGAAAAGC CATGGGATTC
GCCACTGCCA CCAGTGAAGA ATCCGGCAAT CCGGCGGACA ATGCCGTGGA CGGGGACAAG
TCCACCCGCT GGTGTGCCGC CAGTGGCAAA AGCGGAGAAA AAATCACCGT GGACATGGGA
CGCCCCGTAG ATGTAAAAAC CGTTAACATC CTGTGGGAAA AACAAAGCAA CCATCTTTTC
AAGCTGGAAG GCTCCGGTGA CGGAAAACGC TGGGCAACTA TTGAAGACAA AACTTCCGGG
CAAAACGACT CCAAGGAAGA CACGGTAGAA AACAAAACCG GCAAACCGCG ATACTTCCGC
ATCACCGTCA CGGGCAACAA CCAGAGCAAC TGGGCCAGCA TCCGTGAAAT CACCTTTAAA
AACGACAAGG GGGAAATTAT CCGCCCTCAG GCCGCCGCCG GAACCAGTAA GGCGGACAAT
CCCTCCAGCC CCTCTTTCAA CGACAAAAAC TGGCGTTCCT TGAACCTGCC GCACGACTGG
GGCGTGGAAG GACCCTTCCG GATGGAAATT GAAAACAGAA CCGGAAAACT CCCCTGGGTC
GGCATTGGCT GGTACCGCAA AACGCTGGAA ATCCCGGCGG ACGCCAAGGG CAACCAATTC
TATCTGGACT TTGACGGCGT TATGTCCCGC CCCAAAATTT ATGTGAACGG ACACCTGGCC
GGCGAATGGA AATACGGTTA CAGCTCCTTC CGCGTGGACA TCACGCCCTT CCTGAAATTC
GGGCAGCAAA ATACCATTGC CGTCAGAGTG GACAATCCCC CCAGCACCTC CCGATGGTAT
CCGGGCGGCG GCATCTACCG CCATGTGTGG CTCACGGAAT CCAACCCTGT GCACATCGAA
CACTGGGGCG TTTTCGTCAA AACTCCGGAA ATCACCAAAT CCGCCGCCAA GGTAGAAGTG
GACACCACGG TGAAAAACAC CACGGACAAA GCCGTCATCC CCACTGTTAC TGAAGAAATC
CTGGACGGAG GTAAAATCGT AGCCTCCACA ACCACCAAAG GGGAAGAAAT TCCCGCCGGG
GAAAAGGGCA AAATCACCAG TACGCTGACG CTCAAAAACC CCACTCTGTG GACGCTAAAC
GCTCCCCATC TGTATAAGAT GAAAACCACG GTCAGGATGG GAGACAAAGT CATAGACCAA
AAATTCACCA ACTTCGGCGT AAGAACCGTT GAATGGAAAC CCACGGGATT CTACCTTAAC
GGGGAGCGCG TGCAGCTCAA GGGCGTTTGC CAGCACCATG ACCTGGGACC GCTCGGCTCC
GCCGCCCACA CGCGAGGCTA TGAACGCCAG ATTGAAATCC TGAAGGAATT CGGCGTCAAC
TCCATCCGCA CGTCCCACAA CCCGCCTGCG CCGGAAGTGC TGGACCTGTG CGATAAAATG
GGCATCCTGG TCATTGACGA ACTTTTCGAC GTATGGCAAT GCTCCAAAGA AGGCGTCAAC
AACGAATCCT TTAACGAATG GCATGAACGG GACGTGGTTA ACCTCTGCCA CCGGGACCGA
AACCACCCCT GTGTCATTGC ATGGAGTTCG GGAAATGAAG TTCCGGAACA GGGAATGAAA
AATCTGCACC ATATCTCCCA AACCCTGACG GATCTTTTCC ACCGGGAAGA CCCCACGCGT
AAAGTGACTT CCGGCTGCAA CAACGCCAAT GCCGCACGCA ACGGCTTTGG GGACACCCTG
GATGTTTACG GCTATAACTA CAAGCCCTGG GCCTACAAGG ACTTCGCCAA GGACCGCCCC
CACCAGCCGT TCTATGGTGC GGAAACCGCC TCCTGTGTCA GCTCCCGCGG AGAATACTTC
TTCCCCGTGG ACTGGAACAA AGGCAAGGGA TTCTACCTCT ACCAGGTCAG TTCCTATGAC
CTGTACGCCC CCGGCTGGGC CAACCGTCCG GATGTGGAAT TCGCCGCTCA GGAAGACAAT
CCCAACAGCG CGGGAGAATA TGTATGGACG GGCTTTGACT ACATTGGGGA ACCCACCCCG
TACAATCTGG ACGCCACCAA CGCCCTGAAC GTGCCGGAAG GGCCGGAACG CGAAAAGCTG
ATGGCGGAAC TCAAAAAACT GGGAGACCGC GCCCCCTCCC GCAGCTCCTA CTTCGGCATC
GTGGACCTGT GCGGCTTCAA AAAGGACCGC TTCTACATCT ACCAGGCCCA CTGGAGGCCG
GATCTCAAGA TGGCGCACAT CCTGCCGCAC TGGAACTGGC CGGAACGCAA GGGGCAGGTA
ACGCCCGTGC ATGTCTACAC CAGCGGGGAT GAAGCGGAAC TCTTCCTGAA TGGGAAATCC
CAGGGCGTCC GCAAAAAGGG CACCGGGGAA AAGGACCGCT ACCGCCTCGT GTGGGAAGAC
GTTAAATACA CGCCCGGCAC CCTCAAAGTA GTCGCCAAAA AGGACGGTAA AATCTGGGCT
ACGGACACGG TAACCACTAC CGGAAAACCT GCGGCGCTCA CCCTCAAGCC GGACCGCAAT
GAAATCAAGG GAGACGGCTA TGACCTGTCT TATGTCACCG TAGCCGTCCG CGACGCCCAG
GGCCGTATGG TGCCCCGAAG CAAAAACCAG CTCACCTTCA AGGTAAGCGG CCCCGCGGAC
ATCGCCGGCA TCTGCAACGG TGATCCCACG GACTTCACCA CCATGGCGAA TCCGGAAAAC
AAGAAAATCA TGAAAATCAA GGCCTTCAAT GGTCTTGCCC AGGTCATTCT GCGCTCCCGC
AAGGGAGAAT CCGGAAAAGT GACGCTCCAA GTCATCTCCA ACGGACTCAA GCCGGCTCAG
ACAACTGTGA CGGTCAAATA A
 
Protein sequence
MFTNITRTTL CITAFSIASL MAAPLNATKT ESLDWNWKFA RFGKMPDGST QPEPGKAMGF 
ATATSEESGN PADNAVDGDK STRWCAASGK SGEKITVDMG RPVDVKTVNI LWEKQSNHLF
KLEGSGDGKR WATIEDKTSG QNDSKEDTVE NKTGKPRYFR ITVTGNNQSN WASIREITFK
NDKGEIIRPQ AAAGTSKADN PSSPSFNDKN WRSLNLPHDW GVEGPFRMEI ENRTGKLPWV
GIGWYRKTLE IPADAKGNQF YLDFDGVMSR PKIYVNGHLA GEWKYGYSSF RVDITPFLKF
GQQNTIAVRV DNPPSTSRWY PGGGIYRHVW LTESNPVHIE HWGVFVKTPE ITKSAAKVEV
DTTVKNTTDK AVIPTVTEEI LDGGKIVAST TTKGEEIPAG EKGKITSTLT LKNPTLWTLN
APHLYKMKTT VRMGDKVIDQ KFTNFGVRTV EWKPTGFYLN GERVQLKGVC QHHDLGPLGS
AAHTRGYERQ IEILKEFGVN SIRTSHNPPA PEVLDLCDKM GILVIDELFD VWQCSKEGVN
NESFNEWHER DVVNLCHRDR NHPCVIAWSS GNEVPEQGMK NLHHISQTLT DLFHREDPTR
KVTSGCNNAN AARNGFGDTL DVYGYNYKPW AYKDFAKDRP HQPFYGAETA SCVSSRGEYF
FPVDWNKGKG FYLYQVSSYD LYAPGWANRP DVEFAAQEDN PNSAGEYVWT GFDYIGEPTP
YNLDATNALN VPEGPEREKL MAELKKLGDR APSRSSYFGI VDLCGFKKDR FYIYQAHWRP
DLKMAHILPH WNWPERKGQV TPVHVYTSGD EAELFLNGKS QGVRKKGTGE KDRYRLVWED
VKYTPGTLKV VAKKDGKIWA TDTVTTTGKP AALTLKPDRN EIKGDGYDLS YVTVAVRDAQ
GRMVPRSKNQ LTFKVSGPAD IAGICNGDPT DFTTMANPEN KKIMKIKAFN GLAQVILRSR
KGESGKVTLQ VISNGLKPAQ TTVTVK