Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1187 |
Symbol | |
ID | 6273828 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1423063 |
End bp | 1424664 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613238 |
Product | Alpha-galactosidase |
Protein accession | YP_001877793 |
Protein GI | 187735681 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3345] Alpha-galactosidase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000788687 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTGT ACCATTTTTT ACTCCCTGCC GTTGTCAGCG CTGCCGTATC TGCGTCATTT GGGGCAGAGT TCCCTAATCC CTATCCTGCG CCCGCTCCCG GTGTCCGCCT GACTCCAGAG ATTCCGCTTT CACCCTCCAT TAATGGCGCC CGTATCGTCG GGGCTACCCC CGGTTCCCGC ATGCTGTTCC AGGTTCCCGT CTCCGGGGAG CGGCCCATGA AAATTCAGGC AACAGGGCTG CCCCCAGGCC TGAAGATGGA TTCGCGCGGA TTGATTTCGG GTACCGCTCC GTCCGGGAAG AGGGAATACA AGGTAAATAT CCAGGCTTCC AACAGGCATG GAAAGGACAT GAAGGAGCTG ATTCTGAAGG TGGGGGACGA ATTGTGCCTG ACTCCGCCCA TGGGCTGGAG CAGCTGGTAT TCCTACAGTG AGGCCGTAGG GGAGGATAAT GTGCTGAAGA CGGCACGGCT TTTTGTGGAA CGGGGTCTGG TCAATCATGG CTGGGCCTAT ATCAACATTG ACGACTGCTG GCAGGGCAGG CGCGGAGGGA AGTATGGCGC CATTCAACCC AATAAGCGTT TTCCTGACAT GAAGGCCATG TGCGACGCTA TTCACGCCAT GGGCATGAAA GCGGGCATTT ATTCCACGCC TTGGATGGGA ACGTATGCCG GTTTTATCGG AGGGAGCGCG CCCAACGCTA AGCCGGACTA CGGGGAAATG GCCATTCCGG AAAAGGAGCG CAAGCAGGAG GATCAAATCT TTGGAAGTTA TCCGGGAGTT CATCGCAGAA AGGCGGATCA TGTGGGAGCC GTCTGGCTGT TTGACCGTGA CGCTAAACAA TGGGCGGATT GGGGGTTCGA TTATGTGAAA GTGGATTGGA ATCCCAACGA TGTGTCTACG ACAAAGCGCA TCCGCAAGGC GCTGGACGAG TCCGGGAGGG ATATCGTGCT CAGCCTGTCC AATGCCGCCC CGTACGAACA TGTGGAAGAG CTGGGCAAGC TGGCGAATTT ATGGCGGACG ACGGGGGATA TCCAGGATCA CTGGGGCAGC GTCAGCGGCA TCGGTTTTTC CCAGGAACGC TGGCAGAAGC ATATGCGCCC GGGACATTGG AATGATCCGG ACATCCTCCA GATCGGGAAG CTGGGCAAAC CCAACCAGCC CAACACCACG TTTGTCCAGA CGCGGCTGAC TCCGGATGAA CAGTACACCC ATGTGACCCT GTGGTGCCTG CTGTCCGCTC CGCTCATCGT CTCCTGTGAT TTGGAGCATA TTGATTCGTT TACGATGGGA CTGCTTACCA ATGATGAGGT GATAGCGGTG GATCAGGATC CGGCTGCCCG TCCCGCCCGC AAAGCGTGGC ACCAGGGGAA TTTCCAGGTG TGGATGAAGG AGTTGTCCGA CGGTTCCGTG GCGGCTGGCT TTTTCAATAC CGGGAAGGAG AAAGGAATTT TGAAGGTGAA TCTGAAGGAG CTGGGGCTTT CCGGAGCGTA TGAGGCAAGG GACCTCTGGA AACGCGCTGA CCAGGGGACC GTACAGGGAG ATATGGCGGT AGAATTGAAC GGGCATGGAG CATCCATGTT CCGGTTCAGC AAAAAGAAGT AA
|
Protein sequence | MRLYHFLLPA VVSAAVSASF GAEFPNPYPA PAPGVRLTPE IPLSPSINGA RIVGATPGSR MLFQVPVSGE RPMKIQATGL PPGLKMDSRG LISGTAPSGK REYKVNIQAS NRHGKDMKEL ILKVGDELCL TPPMGWSSWY SYSEAVGEDN VLKTARLFVE RGLVNHGWAY INIDDCWQGR RGGKYGAIQP NKRFPDMKAM CDAIHAMGMK AGIYSTPWMG TYAGFIGGSA PNAKPDYGEM AIPEKERKQE DQIFGSYPGV HRRKADHVGA VWLFDRDAKQ WADWGFDYVK VDWNPNDVST TKRIRKALDE SGRDIVLSLS NAAPYEHVEE LGKLANLWRT TGDIQDHWGS VSGIGFSQER WQKHMRPGHW NDPDILQIGK LGKPNQPNTT FVQTRLTPDE QYTHVTLWCL LSAPLIVSCD LEHIDSFTMG LLTNDEVIAV DQDPAARPAR KAWHQGNFQV WMKELSDGSV AAGFFNTGKE KGILKVNLKE LGLSGAYEAR DLWKRADQGT VQGDMAVELN GHGASMFRFS KKK
|
| |