Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0771 |
Symbol | |
ID | 6274413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 906706 |
End bp | 908637 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642612822 |
Product | Beta-galactosidase |
Protein accession | YP_001877387 |
Protein GI | 187735275 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAAGC AATATGGTTT CAGTTGGTCT GCCGCACTGA TGGCCGTCGG AGTGGGAGCA TTGGCATGGG CTGGCCCTGA GGCTGTCCAG AATGTACAGA AACCCGCTCT TTCCGGAGGA CCCCCCGTTG TGTTCGGGTT CGGCGGGGAA GGGAACCAGG AGTTCATGCT GAACGGGAAA CCCTTCCAGA TCCGCGGCGC GGAGATGCAT CCCCAGCGCA TTCCCCGGGA ATACTGGAGG CACCGCATCA GGACCGCCAA GGCTATGGGG CTGAATACTA TTGCATTTTA TGTGTTCTGG AATGACCATG AGCAGCCGGA CGGCAGCTTT GACTTTAAGA CGGGCAATCG GGATCTGGAA GGGTTCCTCA AGTTATGCCA GGAGGAAGGA ATGTGGGTTT TATTCCGCCC CGGCCCCTAT GCATGCGGGG AATGGGATCT GGGTGGGCTG CCTCATTATT TGCTGAAGGA TCCCAAGGCC AAGTTGAGAA CTACGGAAGA CGCCAAATTC ATGAAGGCGC AGACGCGTTA TCTGGAGGCC GTGGCTCGTG TGGCGGAGCC TTTTTTAGCC AAAAACGGGG GCCCCATTCT GATGACCCAG CTTGAAAACG AATACGGGAG CTACCAGCGT AAAGACCGCA AGTATATGGA ATGGCTGAAG GCGTTCTGGA GCAGGAAGGG TTTTGGCCCC TTTTACACCT CCGACGGCGC GGGAGAACAT TTTCTGAAAG GCGTGGTGCT TCCAGGCGTG GCCGTAGGGC TGGATCCGGG GCTGAACGAC GGCCATTGGG CAGTGGCTAA TAAATGCAAT CCGGGAGTTC CCGTTTTTTC CTCGGAAACA TATCCGGGTT GGCTGCGGCA CTGGGGGGAG GGGAATTGGG CTCCCACCCC TGGAGTGGTC AACCACGTCC GCTGGTTTAT GGACAAGGGG CGTTCCTTCA GCTTGTTCGT TTTCCACGGA GGCACCAATT TCGGATTCTC GGCCGGAGCC AACAACGGAG GGCCGGGAAA ATACCAGCCG GACCTGACGA GTTATGATTA CGGTTCTCCC GTGGATGAGC AGGGGCGGAT GAATGAATAT TATGCCCAGA TGAGGGAAAT CATTTTGAAA AAGTTGCCTC CCGAAGCCGC TGTGCCGGAA CCTCCCGCAG ACATTCCGGC CATGGAAATT CCGGAGTTCA CGCCCGCAGT GCATGCCGGC CTTTGGGAGA ACCTGCCCAA GCCTTTCCGG TCCAAGTTCC CGCAGCCTCC CTATTTTGAA CAATGGAACC AGAACCAGGG TATTGCCGTT TACAGAACGG CCGTTCCGTC AGGACCGCCT GAAACGCTGG AATTTACCAA TGTCAATGAC TATGCCCAGG TGTATCTGGA TGGAGAGCTG GTCGGCACGC TGGATCGGCG GCTGGGGCAG AAGAGCGTGA AACTGCCGGA GCGCAGGAAG CCGGGGACGC TGGAAGTTCT GGTAGAGGCC ATGGGACATA TTAATTTTCA TATCAGCATG GAGAGTGACC GCAAGGGGAT TTACGGTCCT GTGAAGCTGG GAACGCGGGA GTTAAAGAAC TGGACGGTGA GGGCGCTTCC CCTGAAAGCT GATTCCATTG TGCGGGCTCC CAAAGGAAAG GGGCCTTCCC AGAAACGGGA AGGGGCGCAT TTCCGGGCCG TTGTAAATAT TGAAGAGCCT CAGGACACGT TTCTGGATAT GTCCCGCTAT GTCAAGGGGT ATGTATGGGT GAACGGAATC AACGTGGGGC GCTATTGGAA TGTGGGACCT CAGTTAAGGC TGTATGTCCC GGCCCCATTC CTGAAAAAAG GGGAGAATGT GATTGATATT CTGGACCTGC ACGAAAAGGA GCCCAAGCCT GTCCGCGGCA TGAAGGAACG CAACAAGGAA CCCGGAAAGA TAAATACCAA AAACCTGGAC AACCAGTGGT AA
|
Protein sequence | MMKQYGFSWS AALMAVGVGA LAWAGPEAVQ NVQKPALSGG PPVVFGFGGE GNQEFMLNGK PFQIRGAEMH PQRIPREYWR HRIRTAKAMG LNTIAFYVFW NDHEQPDGSF DFKTGNRDLE GFLKLCQEEG MWVLFRPGPY ACGEWDLGGL PHYLLKDPKA KLRTTEDAKF MKAQTRYLEA VARVAEPFLA KNGGPILMTQ LENEYGSYQR KDRKYMEWLK AFWSRKGFGP FYTSDGAGEH FLKGVVLPGV AVGLDPGLND GHWAVANKCN PGVPVFSSET YPGWLRHWGE GNWAPTPGVV NHVRWFMDKG RSFSLFVFHG GTNFGFSAGA NNGGPGKYQP DLTSYDYGSP VDEQGRMNEY YAQMREIILK KLPPEAAVPE PPADIPAMEI PEFTPAVHAG LWENLPKPFR SKFPQPPYFE QWNQNQGIAV YRTAVPSGPP ETLEFTNVND YAQVYLDGEL VGTLDRRLGQ KSVKLPERRK PGTLEVLVEA MGHINFHISM ESDRKGIYGP VKLGTRELKN WTVRALPLKA DSIVRAPKGK GPSQKREGAH FRAVVNIEEP QDTFLDMSRY VKGYVWVNGI NVGRYWNVGP QLRLYVPAPF LKKGENVIDI LDLHEKEPKP VRGMKERNKE PGKINTKNLD NQW
|
| |