Gene Amuc_0969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0969 
Symbol 
ID6274185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1156686 
End bp1157864 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID642613023 
Productgalactokinase 
Protein accessionYP_001877582 
Protein GI187735470 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.003784 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value0.261209 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTAG TTCAGCGAGA AATCTCTAAA GAAACGGTGA CTCCGTATTT CATCGAGTAT 
TTCGGTCAGG CGCCTACTCA TGTGGCAGCG GCACCGGGAC GTGTGAACCT TATTGGTGAG
CACACGGATT ATAATAACGG TTTTGTGATG CCTATGGCGC TTGATAACCA TTGTGTTGTG
GCTGTGGCTC CCTCCCCCGT GGGCAAACAC CGCTTTTGCG GTTCCCTGGG TGACCAGATC
CATGAAATTG CAGTGGAAGA CGCCTTGGTT CCCGGCGAAC CGTTCTGGTC CAATTATGTC
CGCGGCGTTT TGGCCAACCT GCACAGGCGC GGCATAGAAA TCGGGCCTGT GGATATGCTG
ATTGACAGCA ATGTGCCCCG CGGCGGCGGC CTCTCCTCCA GTGCCGCTCT TGAAGTTGCC
GTCTGTACGG CGCTCGCCGC TTTTGCCGGC GTTGAAATAG ATCCCAAGGA AGTAGCCCTC
ATTGGGCAGG CCGTGGAACA TGAATTCGTG AACGTTCCCT GCGGCATCAT GGACCAGTTT
ATTTCCGCCA ACGGCAAGAA GGGCATGGCT CTCAAGCTGG ATTGCGCCAC GCTGGAATAT
GAGCTGGTTC CGATGAACAA TGAATCCGTC TCCGTGCTGG TTCTGGACAG CGCTGTGAAG
CATTCCCTGG CGGACGGAGC TTATGGACAG CGCCGCAAGC AGTGTGAGGA AGCTTCTTCC
ATCATGGGCG TACCCTCCCT GCGGGAAGCT ACGCTGGAGC TGCTGGAATC CTTCAGGGAA
CAGCTTGGCG ATGTGCGCTA TCGCCGCGCC CGCCACGTCA TTGGAGAAAA TGCGCGCGTG
AACGCTTTTG CGAACGCCCT TGCCCGCGGC GATTGGGATG AGGCCGGCGT AGCCATGCGC
GGCAGCCATG CTTCCCTGCG GGACGACTAT GAAGTTTCCT GTGCTGAGGT GGATACCCTT
GTTTCACTTT GTGACCGCAT TCCCTCCGCA TCCTCCATTT ACGGCGCGCG CATGACGGGC
GGCGGGTTTG GCGGATGCAT TGTGGCCCTG GTGAAGACGG AGGATGTGGA AAAGGTGGCC
CAGGAGCTTC TGGACGGCTA CTGCCAGGAA ACGGGCATTG AAACTACGTA TCTTGTAACC
CGTGCCGGAG AAGGCGCCCG TGTTTTGTAC CAAGCTTAA
 
Protein sequence
MDLVQREISK ETVTPYFIEY FGQAPTHVAA APGRVNLIGE HTDYNNGFVM PMALDNHCVV 
AVAPSPVGKH RFCGSLGDQI HEIAVEDALV PGEPFWSNYV RGVLANLHRR GIEIGPVDML
IDSNVPRGGG LSSSAALEVA VCTALAAFAG VEIDPKEVAL IGQAVEHEFV NVPCGIMDQF
ISANGKKGMA LKLDCATLEY ELVPMNNESV SVLVLDSAVK HSLADGAYGQ RRKQCEEASS
IMGVPSLREA TLELLESFRE QLGDVRYRRA RHVIGENARV NAFANALARG DWDEAGVAMR
GSHASLRDDY EVSCAEVDTL VSLCDRIPSA SSIYGARMTG GGFGGCIVAL VKTEDVEKVA
QELLDGYCQE TGIETTYLVT RAGEGARVLY QA