Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0969 |
Symbol | |
ID | 6274185 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1156686 |
End bp | 1157864 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642613023 |
Product | galactokinase |
Protein accession | YP_001877582 |
Protein GI | 187735470 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | [TIGR00131] galactokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.003784 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 0.261209 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTAG TTCAGCGAGA AATCTCTAAA GAAACGGTGA CTCCGTATTT CATCGAGTAT TTCGGTCAGG CGCCTACTCA TGTGGCAGCG GCACCGGGAC GTGTGAACCT TATTGGTGAG CACACGGATT ATAATAACGG TTTTGTGATG CCTATGGCGC TTGATAACCA TTGTGTTGTG GCTGTGGCTC CCTCCCCCGT GGGCAAACAC CGCTTTTGCG GTTCCCTGGG TGACCAGATC CATGAAATTG CAGTGGAAGA CGCCTTGGTT CCCGGCGAAC CGTTCTGGTC CAATTATGTC CGCGGCGTTT TGGCCAACCT GCACAGGCGC GGCATAGAAA TCGGGCCTGT GGATATGCTG ATTGACAGCA ATGTGCCCCG CGGCGGCGGC CTCTCCTCCA GTGCCGCTCT TGAAGTTGCC GTCTGTACGG CGCTCGCCGC TTTTGCCGGC GTTGAAATAG ATCCCAAGGA AGTAGCCCTC ATTGGGCAGG CCGTGGAACA TGAATTCGTG AACGTTCCCT GCGGCATCAT GGACCAGTTT ATTTCCGCCA ACGGCAAGAA GGGCATGGCT CTCAAGCTGG ATTGCGCCAC GCTGGAATAT GAGCTGGTTC CGATGAACAA TGAATCCGTC TCCGTGCTGG TTCTGGACAG CGCTGTGAAG CATTCCCTGG CGGACGGAGC TTATGGACAG CGCCGCAAGC AGTGTGAGGA AGCTTCTTCC ATCATGGGCG TACCCTCCCT GCGGGAAGCT ACGCTGGAGC TGCTGGAATC CTTCAGGGAA CAGCTTGGCG ATGTGCGCTA TCGCCGCGCC CGCCACGTCA TTGGAGAAAA TGCGCGCGTG AACGCTTTTG CGAACGCCCT TGCCCGCGGC GATTGGGATG AGGCCGGCGT AGCCATGCGC GGCAGCCATG CTTCCCTGCG GGACGACTAT GAAGTTTCCT GTGCTGAGGT GGATACCCTT GTTTCACTTT GTGACCGCAT TCCCTCCGCA TCCTCCATTT ACGGCGCGCG CATGACGGGC GGCGGGTTTG GCGGATGCAT TGTGGCCCTG GTGAAGACGG AGGATGTGGA AAAGGTGGCC CAGGAGCTTC TGGACGGCTA CTGCCAGGAA ACGGGCATTG AAACTACGTA TCTTGTAACC CGTGCCGGAG AAGGCGCCCG TGTTTTGTAC CAAGCTTAA
|
Protein sequence | MDLVQREISK ETVTPYFIEY FGQAPTHVAA APGRVNLIGE HTDYNNGFVM PMALDNHCVV AVAPSPVGKH RFCGSLGDQI HEIAVEDALV PGEPFWSNYV RGVLANLHRR GIEIGPVDML IDSNVPRGGG LSSSAALEVA VCTALAAFAG VEIDPKEVAL IGQAVEHEFV NVPCGIMDQF ISANGKKGMA LKLDCATLEY ELVPMNNESV SVLVLDSAVK HSLADGAYGQ RRKQCEEASS IMGVPSLREA TLELLESFRE QLGDVRYRRA RHVIGENARV NAFANALARG DWDEAGVAMR GSHASLRDDY EVSCAEVDTL VSLCDRIPSA SSIYGARMTG GGFGGCIVAL VKTEDVEKVA QELLDGYCQE TGIETTYLVT RAGEGARVLY QA
|
| |