Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0097 |
Symbol | |
ID | 6274974 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 123193 |
End bp | 124152 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642612142 |
Product | ROK family protein |
Protein accession | YP_001876723 |
Protein GI | 187734611 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCTT CAACCATTCC CTCCATCGGC ATTGACTTCG GAGGCACCTC CATCAAGATG GGGGTCGTCA AGGGCGCGGA AGTAATCGCC CACGCTCCTT CAATAGCCAC CCAGGAATAC GGCAATCCCG ATCAACTGAT TGAAGCCATA GCCCAGTTCG TGAATATGCT GCGGCTGAAT CACCCGGAAG TGCAGGCCAT CGGCATGGGA ATGCCGGGGT TCGTCAATTT TTACCAGGGG ACCGTTTATA CGCTTACCAA CGTGCCCGGT TGGAACAACG TTCCGGTAAA GGACATGCTC CAGGCTGCCT GCGGACTGCC CGTATACGTG GAAAACGACG CCAACTGCAT GGCTTATGCG GAATGGAAGC TGGGCGCCGG AAAAGGAAAA AGGCATCTGG TCTGCCTGAC CCTCGGCACC GGCGTTGGCA GCGGCCTGAT CGTGAACGGA GAACTTCTGC GCGGGGCTAC CTGCTCCGCC GGGGAGCTGG GGCAGACGAG CATCGACTAC CGGGGGCGTC TGGGCCATTA CGGAAACCGC GGCTCCCTGG AAGATTATGT AGGCAACCGG GAAATAGCTG CGGATGCACG CACGCTGTAT GCCAGCCACG GCATTGACAA GGCCATTGTG GACTGCAACC CCATCTCTTT GGAACGGGCC GCATTGGCTG GTGATGAAGT AGCCGAGCAA GTATGGCGGG ACCTGGCCGT AAAACTCTCC TGCGCCCTGA TGAATTGCTG CTATCTCCTG AACCCGGAAG CCATCATCAT CGGCGGGGGC GTGGCCAAGG CCAGAACCCT GCTTTTCCAG CCCCTTCAGG AAATCATGAA AACCCAGCTC GCCGCCCCTC TGGTGGAATA CCTTGAAATC CTTCCCGCCC AGTTCGGTAC GGAGGCGGGC ATCCTGGGGG CCGCCCATCT GGCTCTCAAC ACCCACTTCG GAGAAACATT CCGGGCCTGA
|
Protein sequence | MTASTIPSIG IDFGGTSIKM GVVKGAEVIA HAPSIATQEY GNPDQLIEAI AQFVNMLRLN HPEVQAIGMG MPGFVNFYQG TVYTLTNVPG WNNVPVKDML QAACGLPVYV ENDANCMAYA EWKLGAGKGK RHLVCLTLGT GVGSGLIVNG ELLRGATCSA GELGQTSIDY RGRLGHYGNR GSLEDYVGNR EIAADARTLY ASHGIDKAIV DCNPISLERA ALAGDEVAEQ VWRDLAVKLS CALMNCCYLL NPEAIIIGGG VAKARTLLFQ PLQEIMKTQL AAPLVEYLEI LPAQFGTEAG ILGAAHLALN THFGETFRA
|
| |