Gene Amuc_0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0097 
Symbol 
ID6274974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp123193 
End bp124152 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content59% 
IMG OID642612142 
ProductROK family protein 
Protein accessionYP_001876723 
Protein GI187734611 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID[TIGR00744] ROK family protein (putative glucokinase) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCTT CAACCATTCC CTCCATCGGC ATTGACTTCG GAGGCACCTC CATCAAGATG 
GGGGTCGTCA AGGGCGCGGA AGTAATCGCC CACGCTCCTT CAATAGCCAC CCAGGAATAC
GGCAATCCCG ATCAACTGAT TGAAGCCATA GCCCAGTTCG TGAATATGCT GCGGCTGAAT
CACCCGGAAG TGCAGGCCAT CGGCATGGGA ATGCCGGGGT TCGTCAATTT TTACCAGGGG
ACCGTTTATA CGCTTACCAA CGTGCCCGGT TGGAACAACG TTCCGGTAAA GGACATGCTC
CAGGCTGCCT GCGGACTGCC CGTATACGTG GAAAACGACG CCAACTGCAT GGCTTATGCG
GAATGGAAGC TGGGCGCCGG AAAAGGAAAA AGGCATCTGG TCTGCCTGAC CCTCGGCACC
GGCGTTGGCA GCGGCCTGAT CGTGAACGGA GAACTTCTGC GCGGGGCTAC CTGCTCCGCC
GGGGAGCTGG GGCAGACGAG CATCGACTAC CGGGGGCGTC TGGGCCATTA CGGAAACCGC
GGCTCCCTGG AAGATTATGT AGGCAACCGG GAAATAGCTG CGGATGCACG CACGCTGTAT
GCCAGCCACG GCATTGACAA GGCCATTGTG GACTGCAACC CCATCTCTTT GGAACGGGCC
GCATTGGCTG GTGATGAAGT AGCCGAGCAA GTATGGCGGG ACCTGGCCGT AAAACTCTCC
TGCGCCCTGA TGAATTGCTG CTATCTCCTG AACCCGGAAG CCATCATCAT CGGCGGGGGC
GTGGCCAAGG CCAGAACCCT GCTTTTCCAG CCCCTTCAGG AAATCATGAA AACCCAGCTC
GCCGCCCCTC TGGTGGAATA CCTTGAAATC CTTCCCGCCC AGTTCGGTAC GGAGGCGGGC
ATCCTGGGGG CCGCCCATCT GGCTCTCAAC ACCCACTTCG GAGAAACATT CCGGGCCTGA
 
Protein sequence
MTASTIPSIG IDFGGTSIKM GVVKGAEVIA HAPSIATQEY GNPDQLIEAI AQFVNMLRLN 
HPEVQAIGMG MPGFVNFYQG TVYTLTNVPG WNNVPVKDML QAACGLPVYV ENDANCMAYA
EWKLGAGKGK RHLVCLTLGT GVGSGLIVNG ELLRGATCSA GELGQTSIDY RGRLGHYGNR
GSLEDYVGNR EIAADARTLY ASHGIDKAIV DCNPISLERA ALAGDEVAEQ VWRDLAVKLS
CALMNCCYLL NPEAIIIGGG VAKARTLLFQ PLQEIMKTQL AAPLVEYLEI LPAQFGTEAG
ILGAAHLALN THFGETFRA