Gene Amuc_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1751 
Symbol 
ID6274983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2132696 
End bp2134732 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content60% 
IMG OID642613814 
Productglycoside hydrolase family 13 domain protein 
Protein accessionYP_001878350 
Protein GI187736238 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.913589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGAC CCCCGATACC CGGCTTGGTG ATGGCGGACG GATGGCTCCA GCCGTATTCC 
CGCCAGATAC GCGACCGCCA GCGTCTGTTT GACCTGAAAA TGAAGAGGAT CAATCAGCGC
GCCGGCTCTC TGGAGGAATA CGCGCGCGGA TACCGCTATT ACGGTTTCAA CAGAGACGCG
GAGACGGGGG CGTGGACATA CCGGGAATGG GCTCCCGCCG CCCGCAGGGT GTCCCTGATT
GGGGATTTTA ACGGCTGGAA CCGGGAGAGC CACCCGCTGG AGCGGAATGA GCGCGGCGTG
TGGGAAATTA CGCTGCCGCC TGATGCGCTG GCCCACGGGC AGAAGGTGAA GGTGCACGTG
GTCGGTGCGG ACGGCACGGG CAGAGATCGC ATTCCCGCAT GGATTACCAG GACCGTCCAG
GATCCGACCA CTTATGATTT TGCCGGGGAG ATCTGGATGC CGGAACACCC CTATGAATGG
CGGAATAACG GTTTTGATCC CTCCCGGGTG GAAGTTCCGT TCGTGTATGA AGCGCATGTA
GGCATGGGCG GGGAAGAAGG GCGCGTGCAT ACGTACCGCG AGTTTGCGGA CGAGGTTCTC
CCCCGGATCG CCAGGCTGGG TTACAATACC GTCCAGCTGA TGGCGATCCA GGAGCACCCC
TATTACGGTT CCTTCGGCTA CCACGTTTCT TCCTTTTTCG CCCCTTCCTC CCGTTTCGGC
GAGCCGGAGG ACCTGAAGTA CCTGATAGAC CAGGCGCACG GCCTGGGCAT CGCCGTGCTG
CTGGACGTGG TGCATTCCCA CGCCGTGAAG AACGAGGCGG AAGGGCTGAA CAATTTTGAC
GGTTCCGGAG GCATGTATTT CCTGCCCGGG GAGCGCGGCC GCCATCCGGA CTGGGATTCC
TGTTGTTTTG ATTATGGCCG GGACGAGGTG ATTGAATTCC TCCTGTCCAA TGTCCGCTGG
TGGCTGGAAG AGTTCCGTTT TGACGGCTTC CGTTTTGACG GCGTGACATC TATGCTGTAT
TTCCACCGCG GGCATGAGCC GTTCGGGGAT TTGGGCGCCT ACTTCGGCTC TTCCGTGGAT
CTGGATGCCG TGGCTTATCT GCAGCTGGCC GCCACGCTGA TTCAGCGGGT GAAGCCGGGC
GCCATAGCGA TTGCTGAGGA CATGTCCGGC ATGCCGGGAT TGTGCCGCCC GGTGGACGAA
GGGGGGATTG GTTTTTCCCA CCGTCTGGCC ATGGGCATTC CCGATTACTG GATTAAGCTG
CTCAAGGAGA AAAAGGATGA GGAATGGAGC ATGGGCGATA TGTGGCACAC GCTGACCAAC
AGGCGCTACG GCGAGCCGCA TGTGGCCTAT TGCGAGAGCC ATGACCAGGC CCTGGTGGGG
GACAAGACTC TGGCGTTCCG CCTGATGGAT GCGGAAATGT ACTGGAAAAT GGCCGTGGAC
CAGCAGAGCC TCATTATTGA CCGCGGCATG GCGCTGCACA AGATGATCCG CCTGGTGACG
TTGGCTACCG GGGGGGAAGG CTGGCTGAAT TTCATGGGCA ACGAATTCGG GCATCCGGAA
TGGATTGATT TTCCGCGCGA AGGCAACGGC TGGTCTTACG AATACTGCCG CCGCCAGTGG
TCCCTGGTGG ACAATCCTTC CCTCAGGTTC AAGTTCCTGA ATGCCTTTGA CCAGGCGATG
GTCCGCCTGG CGCAGGAAGC CCGCCTGCTG AATAATCCGC CGCCTTTCCC GCTTAATATT
GACGAGACCA ACCATGTCAT GGCATTCCAC CGCGGCGGTC TGTTGTTTGT GTTCAACTGG
TCCGGAGACA GGGCTATCAT GGATTACATG CTGCCCGTTC CCCAGAAGGG GGAATGGCGG
GTAGTGCTGG ATACGGACAA CGCCCGTTTC GGCGGTTTCG GGAGGCAGGA TGTTTCCATG
CCGCATTTTA CGGATGGGGA GGGGAATCTT TCCCTGTACC TGCTGCCGCG TACGGCCCTG
GTCCTGAAGA GGGTAGGTTC CGCCGTCATG GCCCGCCACC CGGGGCGGGA GGATTAG
 
Protein sequence
MRRPPIPGLV MADGWLQPYS RQIRDRQRLF DLKMKRINQR AGSLEEYARG YRYYGFNRDA 
ETGAWTYREW APAARRVSLI GDFNGWNRES HPLERNERGV WEITLPPDAL AHGQKVKVHV
VGADGTGRDR IPAWITRTVQ DPTTYDFAGE IWMPEHPYEW RNNGFDPSRV EVPFVYEAHV
GMGGEEGRVH TYREFADEVL PRIARLGYNT VQLMAIQEHP YYGSFGYHVS SFFAPSSRFG
EPEDLKYLID QAHGLGIAVL LDVVHSHAVK NEAEGLNNFD GSGGMYFLPG ERGRHPDWDS
CCFDYGRDEV IEFLLSNVRW WLEEFRFDGF RFDGVTSMLY FHRGHEPFGD LGAYFGSSVD
LDAVAYLQLA ATLIQRVKPG AIAIAEDMSG MPGLCRPVDE GGIGFSHRLA MGIPDYWIKL
LKEKKDEEWS MGDMWHTLTN RRYGEPHVAY CESHDQALVG DKTLAFRLMD AEMYWKMAVD
QQSLIIDRGM ALHKMIRLVT LATGGEGWLN FMGNEFGHPE WIDFPREGNG WSYEYCRRQW
SLVDNPSLRF KFLNAFDQAM VRLAQEARLL NNPPPFPLNI DETNHVMAFH RGGLLFVFNW
SGDRAIMDYM LPVPQKGEWR VVLDTDNARF GGFGRQDVSM PHFTDGEGNL SLYLLPRTAL
VLKRVGSAVM ARHPGRED