Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1094 |
Symbol | |
ID | 6274008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1305302 |
End bp | 1306300 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642613145 |
Product | ROK family protein |
Protein accession | YP_001877701 |
Protein GI | 187735589 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | [TIGR00744] ROK family protein (putative glucokinase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000124678 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.000000000129961 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCTTTT CAGAACCCTG TGCCTTGGCC GTTGATTTCG GCGGCACGAG CATCAAAATG GGCGTAACGG CGGGGGATCG TATTCTGGCG ACGGCCGACC GCATCCCTAC TGCCATGTTC GAAAGCCCGC AGGCAATCAT TGATGCCATG ATTGCGTCCG CCCGCACCCT GCGCGGACAA TTCCCCTCCG CCTGTGTGAT GGGCATGGGA ATGCCGGGAT GGTGTGATTA CCAGCGGGGA GTGCTTTACC AGCTTACCAA TGTGAGGGTC TGGGATAGGG AAATTCCGGT GAAAGAGATG ATGGAGCAGG CCCTGGGCCT CCCCGTCGTG CTGGATAATG ACGCCAACTG CATGGCTTAT GCGGAATGGA AGCTTGGCGC CGGGCGCGGC ATGTCCAGCC TGGTGTGCCT GACGATGGGA ACGGGGATAG GCGGGGGAAT CGTGGTGCAT GACCGCATGC TGCGGGGAAG GCGGCTTTCC GCTGCGGAAC TGGGCCAGAC CAGCATTCAT TACCAGGGGA AAACGGGACC GTTCGGCAGC CGGGGAGCCA TTGAGGAATA CATCGGCAAC AACGAACTGG CGGCGGAGGC GGTTAAACGG TATGCCGGGG CGGGAATCAT CAAGACGGTG GATGAATGCA CGCCCAGGCA TCTGGACGAG GCTGCCCGGT CCGGATGTCC TATAGCCCTT CAATTATGGG AAGATACGGC GGAAATGCTG GGCTGCCTGA TCATGAACCT GATGTATACG CTGGTGCCGG ACGCCTTCAT CATCGGGGGC GGTGTGGCCA AGGCAGGGGA TTTGTTGATG AAGCCGCTGC TGGAGAACCT CAGGAAACAG TTGTTTCCTC TCCTGATGGA GGATTTGAAA ATTCTGCCTG CCAGATTTGG AGCGGAGGCG GGGTTGCTGG GAGCGGGAGC CATGGCGATG GATGAATTCA TGGGGCTGGG GATTTTGGAA CGGTTTAAGA ACCAGAAATC AACGCAGACT TTTTGTTAG
|
Protein sequence | MSFSEPCALA VDFGGTSIKM GVTAGDRILA TADRIPTAMF ESPQAIIDAM IASARTLRGQ FPSACVMGMG MPGWCDYQRG VLYQLTNVRV WDREIPVKEM MEQALGLPVV LDNDANCMAY AEWKLGAGRG MSSLVCLTMG TGIGGGIVVH DRMLRGRRLS AAELGQTSIH YQGKTGPFGS RGAIEEYIGN NELAAEAVKR YAGAGIIKTV DECTPRHLDE AARSGCPIAL QLWEDTAEML GCLIMNLMYT LVPDAFIIGG GVAKAGDLLM KPLLENLRKQ LFPLLMEDLK ILPARFGAEA GLLGAGAMAM DEFMGLGILE RFKNQKSTQT FC
|
| |