Gene Amuc_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1438 
Symbol 
ID6275752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1725597 
End bp1727516 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content58% 
IMG OID642613497 
ProductGlycosyl hydrolase family 98 putative carbohydrate binding module 
Protein accessionYP_001878041 
Protein GI187735929 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTTA TGTCGAAACG TTTTTTTGCC TTGTTGCTGG TCCTGGGGTC CGGGATATGG 
AGTGTTCCTG CCATGGGGAT GGATGAGGAG TCAGCTTCCA GGGCATCTGT TCCTGCCTCC
TCGGACAGGG AGGGAGCGGA GTTTACCCGT CTGCCCGTCA GCTGGACGGT TAATCCCAGG
GATGCCGCCA ATGCCCGTGC CGCCTGGAAA ACGCTGAGCG CCTATCATCG GGGCAAACCG
AAGTCTTCCA GAAAATTGCA TGTGGTTTAC GTAACGTTCA AGGACAGGCC TGCCCTTGAA
GGATACCGGG AACGGTATGA TCATATTCTG AAGAATATTC AGGCATATTA TGCGGACCAG
ATGCAGGCCA ACGGTTTCCC TCCGCTCACA TTCCAGCTGG ATTTGGATGA ACGGGGCAAG
CTGGTTATTC ATGATGCCTA TGTGGATAAA CCCATGAGCG AGATGAGCGT GCAGAGCTCC
GGCCCCGTCT CCCGGGAAGC CGCCAGGAAG GTGCTGGCTT CCAAGGGGAT TGATATTGAG
AAGGAACATG TGCTGGTCGT TTGCCAGCTT CCGGACGGCG TGGGTCCTTA TTACGGCGGC
GGCTTCAGCC ACCAGGGCAC GGGGTGGACC TGTGACCAGG AAGGGCTGGA TCCCGCCAGC
TTTCTGGATA CGGAAATGAT GCAGGGCGGC CGTTTCAAGG TGACCAGGGG AAAGAACGCC
ACCATTTACA TAGGCGGTAC GGCCCATGAA TTGGGGCATT CCTTCGGCCT TCCCCACACG
GGCGACGGAT GGAATTACCC CGACGCCGGA GCTTCCCTGA TGGGCCATGG CAATTCCACC
TACGGCGACG AGCTGCGCCA TGAAGGGAAG GGTGCCTATT TGGCGCCGAC GGATGCTTTG
AAACTGGCCA GTGTTCCCCT GTTCAACGGG GTGGAGACGG AACTTCCCGC AGACGCCTCC
TTCGGGCGTA TGCTCGGCAA GTATGTCCCG GGGTCTTTTG AACGGCTGGA GGCCATTCCC
GTTAAAGACG GATTGCGGCT GAAAGGGAGG GTTCATCTGA CGCGCCCCGC GTATGGCATT
GTTGCCCATT TGGATCCGCC CGGAGGTTCG GATTATGATT CCAATGCCGT GGGCGCTTCT
CTGGATGAAA AGGGAGAGTT TGATCTCACC ATCTGCAGGC CGGGATATAA GGGTGGTTTT
ATAGAAATGC GGGTGGCCGT ATTGAATTGC GACAGCACGC GCAGCATGAT TACTCTGCCC
GTGTGGATGG ATGCCCGGGG AACCAAGGCC CCTTCTCTGG CCCAGATCGT TTATTTCGGG
GATGTTCAGA ATCTGTGGAT ACGGGGCCGC ACGGAAGAAG CCCGGAAGGC GCTGGCGGAA
GTGGAACGCA GGCATGGTTC CCGTTCTGAA GTGAAAGAAT GGCTCCCTGT CTGGAAGCGC
GCTCTGGGGC GCCAGGAGCC CGCGCTGGAA GTTGTTCCGG CGCAGATTCC TGCGGCAACC
GCCAGTATCA GCCTGAATGA CTGCAAGCCT TCCGTAGCCC AGGTGGGGTG GGCCGTTCCG
TTGTGGGATG TCCTGTTCCC GTCGGATTTG GGGCCCGTGC CTTTCTTCCG GACAATCGGA
AGGCCGGAAC GTTTCATTCT GGCTCATGCC CCGGCTGTTT TTGCCTATGA CCTGGATGGG
CGCTGGAAAG AGTTCCGCGC TGATGTGGGG CTGCCCTTCG GCTCCCGCGG CAGCGTCAAA
TTCAGCGTGT ATTTGGACGG CAGGAAGCTT ATGGAATCCC CTGTGCTCAA GGACGGGGAC
TCCATTCCCG TGAAGACGGC GGTAGAGGGC GGCAGGAAGC TGGAAATCCG CGTGGATGAC
GCCGGAGACG GCAATGCCGC CGATTGGGGC ATTATAGCCA ACGGCATGCT GACCCGGTAA
 
Protein sequence
MNVMSKRFFA LLLVLGSGIW SVPAMGMDEE SASRASVPAS SDREGAEFTR LPVSWTVNPR 
DAANARAAWK TLSAYHRGKP KSSRKLHVVY VTFKDRPALE GYRERYDHIL KNIQAYYADQ
MQANGFPPLT FQLDLDERGK LVIHDAYVDK PMSEMSVQSS GPVSREAARK VLASKGIDIE
KEHVLVVCQL PDGVGPYYGG GFSHQGTGWT CDQEGLDPAS FLDTEMMQGG RFKVTRGKNA
TIYIGGTAHE LGHSFGLPHT GDGWNYPDAG ASLMGHGNST YGDELRHEGK GAYLAPTDAL
KLASVPLFNG VETELPADAS FGRMLGKYVP GSFERLEAIP VKDGLRLKGR VHLTRPAYGI
VAHLDPPGGS DYDSNAVGAS LDEKGEFDLT ICRPGYKGGF IEMRVAVLNC DSTRSMITLP
VWMDARGTKA PSLAQIVYFG DVQNLWIRGR TEEARKALAE VERRHGSRSE VKEWLPVWKR
ALGRQEPALE VVPAQIPAAT ASISLNDCKP SVAQVGWAVP LWDVLFPSDL GPVPFFRTIG
RPERFILAHA PAVFAYDLDG RWKEFRADVG LPFGSRGSVK FSVYLDGRKL MESPVLKDGD
SIPVKTAVEG GRKLEIRVDD AGDGNAADWG IIANGMLTR