Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1438 |
Symbol | |
ID | 6275752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1725597 |
End bp | 1727516 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613497 |
Product | Glycosyl hydrolase family 98 putative carbohydrate binding module |
Protein accession | YP_001878041 |
Protein GI | 187735929 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTTA TGTCGAAACG TTTTTTTGCC TTGTTGCTGG TCCTGGGGTC CGGGATATGG AGTGTTCCTG CCATGGGGAT GGATGAGGAG TCAGCTTCCA GGGCATCTGT TCCTGCCTCC TCGGACAGGG AGGGAGCGGA GTTTACCCGT CTGCCCGTCA GCTGGACGGT TAATCCCAGG GATGCCGCCA ATGCCCGTGC CGCCTGGAAA ACGCTGAGCG CCTATCATCG GGGCAAACCG AAGTCTTCCA GAAAATTGCA TGTGGTTTAC GTAACGTTCA AGGACAGGCC TGCCCTTGAA GGATACCGGG AACGGTATGA TCATATTCTG AAGAATATTC AGGCATATTA TGCGGACCAG ATGCAGGCCA ACGGTTTCCC TCCGCTCACA TTCCAGCTGG ATTTGGATGA ACGGGGCAAG CTGGTTATTC ATGATGCCTA TGTGGATAAA CCCATGAGCG AGATGAGCGT GCAGAGCTCC GGCCCCGTCT CCCGGGAAGC CGCCAGGAAG GTGCTGGCTT CCAAGGGGAT TGATATTGAG AAGGAACATG TGCTGGTCGT TTGCCAGCTT CCGGACGGCG TGGGTCCTTA TTACGGCGGC GGCTTCAGCC ACCAGGGCAC GGGGTGGACC TGTGACCAGG AAGGGCTGGA TCCCGCCAGC TTTCTGGATA CGGAAATGAT GCAGGGCGGC CGTTTCAAGG TGACCAGGGG AAAGAACGCC ACCATTTACA TAGGCGGTAC GGCCCATGAA TTGGGGCATT CCTTCGGCCT TCCCCACACG GGCGACGGAT GGAATTACCC CGACGCCGGA GCTTCCCTGA TGGGCCATGG CAATTCCACC TACGGCGACG AGCTGCGCCA TGAAGGGAAG GGTGCCTATT TGGCGCCGAC GGATGCTTTG AAACTGGCCA GTGTTCCCCT GTTCAACGGG GTGGAGACGG AACTTCCCGC AGACGCCTCC TTCGGGCGTA TGCTCGGCAA GTATGTCCCG GGGTCTTTTG AACGGCTGGA GGCCATTCCC GTTAAAGACG GATTGCGGCT GAAAGGGAGG GTTCATCTGA CGCGCCCCGC GTATGGCATT GTTGCCCATT TGGATCCGCC CGGAGGTTCG GATTATGATT CCAATGCCGT GGGCGCTTCT CTGGATGAAA AGGGAGAGTT TGATCTCACC ATCTGCAGGC CGGGATATAA GGGTGGTTTT ATAGAAATGC GGGTGGCCGT ATTGAATTGC GACAGCACGC GCAGCATGAT TACTCTGCCC GTGTGGATGG ATGCCCGGGG AACCAAGGCC CCTTCTCTGG CCCAGATCGT TTATTTCGGG GATGTTCAGA ATCTGTGGAT ACGGGGCCGC ACGGAAGAAG CCCGGAAGGC GCTGGCGGAA GTGGAACGCA GGCATGGTTC CCGTTCTGAA GTGAAAGAAT GGCTCCCTGT CTGGAAGCGC GCTCTGGGGC GCCAGGAGCC CGCGCTGGAA GTTGTTCCGG CGCAGATTCC TGCGGCAACC GCCAGTATCA GCCTGAATGA CTGCAAGCCT TCCGTAGCCC AGGTGGGGTG GGCCGTTCCG TTGTGGGATG TCCTGTTCCC GTCGGATTTG GGGCCCGTGC CTTTCTTCCG GACAATCGGA AGGCCGGAAC GTTTCATTCT GGCTCATGCC CCGGCTGTTT TTGCCTATGA CCTGGATGGG CGCTGGAAAG AGTTCCGCGC TGATGTGGGG CTGCCCTTCG GCTCCCGCGG CAGCGTCAAA TTCAGCGTGT ATTTGGACGG CAGGAAGCTT ATGGAATCCC CTGTGCTCAA GGACGGGGAC TCCATTCCCG TGAAGACGGC GGTAGAGGGC GGCAGGAAGC TGGAAATCCG CGTGGATGAC GCCGGAGACG GCAATGCCGC CGATTGGGGC ATTATAGCCA ACGGCATGCT GACCCGGTAA
|
Protein sequence | MNVMSKRFFA LLLVLGSGIW SVPAMGMDEE SASRASVPAS SDREGAEFTR LPVSWTVNPR DAANARAAWK TLSAYHRGKP KSSRKLHVVY VTFKDRPALE GYRERYDHIL KNIQAYYADQ MQANGFPPLT FQLDLDERGK LVIHDAYVDK PMSEMSVQSS GPVSREAARK VLASKGIDIE KEHVLVVCQL PDGVGPYYGG GFSHQGTGWT CDQEGLDPAS FLDTEMMQGG RFKVTRGKNA TIYIGGTAHE LGHSFGLPHT GDGWNYPDAG ASLMGHGNST YGDELRHEGK GAYLAPTDAL KLASVPLFNG VETELPADAS FGRMLGKYVP GSFERLEAIP VKDGLRLKGR VHLTRPAYGI VAHLDPPGGS DYDSNAVGAS LDEKGEFDLT ICRPGYKGGF IEMRVAVLNC DSTRSMITLP VWMDARGTKA PSLAQIVYFG DVQNLWIRGR TEEARKALAE VERRHGSRSE VKEWLPVWKR ALGRQEPALE VVPAQIPAAT ASISLNDCKP SVAQVGWAVP LWDVLFPSDL GPVPFFRTIG RPERFILAHA PAVFAYDLDG RWKEFRADVG LPFGSRGSVK FSVYLDGRKL MESPVLKDGD SIPVKTAVEG GRKLEIRVDD AGDGNAADWG IIANGMLTR
|
| |