Gene Amuc_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1667 
Symbol 
ID6274629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2017753 
End bp2020329 
Gene Length2577 bp 
Protein Length858 aa 
Translation table11 
GC content60% 
IMG OID642613725 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_001878266 
Protein GI187736154 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.611357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.331093 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAAGA TGATGGCCTG CATGCTGGGC GCTGCCCTGT GTTTTTCCAT GGCCGGAGCG 
GAAACAGGCG TTTCCTCTGC GGAAAAAGCT TCACCCCTGT TCGGGAGGAT GATCTGGAAT
CGCGGCTGGG AGTTCCGGCT TGAGGACGAG CCCGGAAAGA GCGGCTGGCA GACGGTTCAT
CTGCCCCATA CGTTCAGTCT GCCCTACTTT ATGTCCGATT CCTTTTACAC GGGGTTCGGC
TCCTACCGCA AGAAGCTTGT CATGCCGGAG GAATGGCAGG GGAAGAAGGT GTTTCTGGAT
TGCGGCGCAG CTTTCCAGGT GGCGGAGGTG AAGGTGAATG GGAAGACTGC GGGAAGGCAT
GAAGGGGGGT ATACGGCCTT CCGGTCGGAT TTGACGCCTT TCCTGAAGCC CGGCGAGAAC
TGGGTGGAGG TACGGGTGGA TAACCGCTGG AGCCCCCGCA TTGCGCCTCG GGCGGGGGAG
CATACTTTTT CCGGAGGACT GTACCGGAAC GTGCATCTGC ACGTGTGTTC TCCCGTGTAT
TTGCCCGCGC ACGGAGTATG GGTTCAGACG CCGGAGGTGA CGGCGGCCCG CGGGAAGGTC
CGGATTTTGA CGGAAGTGAA GAATGATACC GGGACAGTGA GGAAAGTGAC GGTGCGGCAT
ACTGTGCGGG ATGAGCAAAG CGGCTGCGTG GTGCTGCGGG GAGAGAAGGG CGCGGAACTG
GGGGCCGGGG AAGATTCTCG CGTACAGGCT GACCTGCCTC CTCTGGCTTC TCCCAGGTTG
TGGAGCCCTG CGGCCCCCAA TATGTACCGT GTGACGACGG AATTGTTGGA CGAGAAGGGG
AAGGTGCTGG ACCGGGCTGA AAATCCCCTG GGTTTCCGTA CTCTGGAGCT GACGGCGGAC
CAGGGTATGC TGATTAACGG GAAACCTGTT TATCTGCAGG GGGCCAATGT GCACCAGGAC
CACGCCGGGT GGGGGGATGC CGTGACGGAT GCCGGGGCCC GCCGTGATGT ACGGCTGATG
AAGAACGCCG GGTTTAATTT CATCCGCGGC TCCCATTACC CTCATTCCCA GGCTTTTCTG
GATGCTTGCG ACCGGGAGGG CATGTGCATG CTGAATGAAG GCATTTTCTG GGGAATGGGC
GGGTTCAAGG AGCATGACAG ATACTGGAAT TGCGATGCCT ATCCTGCGGA AAAGAAGGAT
CGTATTGCGT TTGAGGAAAG CTGCATGCGC CAGGTGAGGG AGATGGTGCT TCAATTCCGC
AATCATCCTT CCATCGTCAT CTGGAGCATC AGCAATGAAC CGTTTTTTAC CAGGCATGCG
CCGGAGGCCA GGGCTTTGTG CAACAGGCTC ATTACCTTGG TGAAGGAGTT GGACCCGACG
CGCCCCGTGT GCGCCGGGGG CGGGCAGCGC GGAAATTTTG ACAAGCTGGG GGACATGGCC
GCCTTGAACG GGGACGGTTC CCATGTGAAG ACGCCCGGCA GGCCCAGCAT GGTGACGGAG
TATGGTTCCG TGAGCTGCCG GAGGCCGGGG GCGTATGCGC CGGGCTGGGG GGATATGAAG
AAGGATAAGG AAGCGGGGAT TCGCTATCCC TGGCGCGTGG GAGAGGCGGT CTGGTGCGGC
TTTGACCACG GCAGCATCTG GCCTTCCGGA GGGCGCATGG GCATTGTGGA TTACTTCCGC
ATTCCCAAGC GGGCCTGGTA CTGGTACAGG AATGCGTTGA GGAATATTCC TCCTCCGGAG
TGGCCTGTAG AGGGGACTCC GGCACAGGTG AAACTGTCCG CGGACAAGAA GGTTATCAGC
CCGGCGGACG GCACGGACGA TGTGCATGTG ACCGTGAAAG TGGCGGACGC CGCCGGAAGG
CAGATATCCA ATGCCGTGCC GGTGACGCTT ACGGTGGTAT CGGGTCCCGG GGAGTTCCCT
ACCGGCAAAA GCATTACGTT CACGCCCGGA ACGGATATTG ATTTAATTGA CGGCTGTGCG
GCGATTGAGT TTCGGTCTTA TTATGCCGGG AAGACAGTGA TCAGGGCGTC TTCTCCCGGA
TTGAAGGGGG ATAGCCTCCA AATCGTTTGC CGGAATGCTC CGGCTTACGT AGCGGGCAGG
AGCGCGGAAA CGCGGGAAAG GCCCTACAAG CGTTTTAGCG CAAAGGAGAG GGATATCCAG
CTGGCGCGGT ACGGACGGCC CGAATCCGGA GAGAAAGCCA ATTTGGCTGT ATTGAGGCCC
TGTTCCGCTT CCTCCGGATT TCAGGAGACG ATGAAGGCTT CCGACGGTGA CGATGTTTCT
GCATGGCATC CGTCCGTGGA AGACAAAGCG CCGTGGTGGC AGCTGGACAT GGAGTTTGAA
TTCCGCCTGG ACCGGGTGGA GGTGAAGGCA GCCGGAAGCT GGAAGGGGCC GGCTCCCTCG
GTTCAGGTCA GCAGGGACGG AAATAGCTGG AAGGACATTA AAGCCGTTTT GCGCAAGGAT
GGGACGGAGC TGACCGCAAT GTGTCCGGAA GGGACCGGCG CCAGGTACGT ACGCATCCGC
CTGTCTCCGG GGCAGGGAAT TGCGGAAGTG GCCGTATGGC CGGCGGATGC GTCATGA
 
Protein sequence
MRKMMACMLG AALCFSMAGA ETGVSSAEKA SPLFGRMIWN RGWEFRLEDE PGKSGWQTVH 
LPHTFSLPYF MSDSFYTGFG SYRKKLVMPE EWQGKKVFLD CGAAFQVAEV KVNGKTAGRH
EGGYTAFRSD LTPFLKPGEN WVEVRVDNRW SPRIAPRAGE HTFSGGLYRN VHLHVCSPVY
LPAHGVWVQT PEVTAARGKV RILTEVKNDT GTVRKVTVRH TVRDEQSGCV VLRGEKGAEL
GAGEDSRVQA DLPPLASPRL WSPAAPNMYR VTTELLDEKG KVLDRAENPL GFRTLELTAD
QGMLINGKPV YLQGANVHQD HAGWGDAVTD AGARRDVRLM KNAGFNFIRG SHYPHSQAFL
DACDREGMCM LNEGIFWGMG GFKEHDRYWN CDAYPAEKKD RIAFEESCMR QVREMVLQFR
NHPSIVIWSI SNEPFFTRHA PEARALCNRL ITLVKELDPT RPVCAGGGQR GNFDKLGDMA
ALNGDGSHVK TPGRPSMVTE YGSVSCRRPG AYAPGWGDMK KDKEAGIRYP WRVGEAVWCG
FDHGSIWPSG GRMGIVDYFR IPKRAWYWYR NALRNIPPPE WPVEGTPAQV KLSADKKVIS
PADGTDDVHV TVKVADAAGR QISNAVPVTL TVVSGPGEFP TGKSITFTPG TDIDLIDGCA
AIEFRSYYAG KTVIRASSPG LKGDSLQIVC RNAPAYVAGR SAETRERPYK RFSAKERDIQ
LARYGRPESG EKANLAVLRP CSASSGFQET MKASDGDDVS AWHPSVEDKA PWWQLDMEFE
FRLDRVEVKA AGSWKGPAPS VQVSRDGNSW KDIKAVLRKD GTELTAMCPE GTGARYVRIR
LSPGQGIAEV AVWPADAS