Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2082 |
Symbol | |
ID | 4783661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2228058 |
End bp | 2229014 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640090650 |
Product | N-acetylglucosamine kinase |
Protein accession | YP_001021273 |
Protein GI | 124267269 |
COG category | [G] Carbohydrate transport and metabolism [K] Transcription |
COG ID | [COG1940] Transcriptional regulator/sugar kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.87532 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCCT TCGCGGCCGA CCTACCGACC ATGAGCCCCA GCCCCCTGCC CGCCCTCGGC ATCGACCTCG GCGGCACCAA GATCGAAGCC ATGCTGCTCG ACGACACCGG CGCCACGCGC TGGCGCGAGC GCATCCCGAC ACCGCCCGAC GATTACCGGG CCGCGCTGGC CGCCATCGGC GGCCTGGTCG AGCAGGCCCG TACGGCCGCC GGGAGCGCCA TCAGCGTCGG CATCGGCACA CCGGGCACGC GGCGCGCCGA CGGCGCGATG AAGAATGCCA ACTCCACCTG CCTCAACGGC CAGCCGCTGC AACGCGATCT GGAGGCGCTG CTGGGCCAGC CGATCGCCCT GGCCAACGAC GCCAACTGCC TGGCGCTGTC CGAAGCCACC GACGGGGCCG GCGCCGGCGC GGCGGTGGTG TTCGCGGTGA TCCTCGGCAC CGGTTGCGGC GGGGGCGTGG CGGTGCATGG CCGGGTCCTG CAGGGCCCGA ACGGCCTGGC CGGGGAATGG GGCCACAACC CCCTGCCCTG GGCACGCGAC GACGAACGCC CCGGGCCGGC CTGCTACTGC GGCACCGCAG GGTGCATCGA GGCCTGGCTC AGCGGTCCGG CCGTGGCCGC CGACCACCGA CGCCACGGTG GCGCGGCCAT CGACGCCGTC GCGATTGCGC AAGGCGCCCT GGCCGGCGAT GCGGCCTGCC AGGCCAGCCT CGACCGCCAT GCGCTGCGCG TGGCGCGGGC GCTGGCGTCA GTGGTCAACC TGCTCGACCC GGACGTCATC GTGTTCGGTG GTGGCGCCTC GCGTCTGCCG GGACTCATCG AGCGCCTGCC GAGCCTGTGG ACACCCTGGG TGTTCGGCGC CCGCCACGAC CCGCCGGTGC GGACGCGGCT CGCGCTCTCG CAGCATGGCG ACGCCTCGGG CGTGCGCGGG GCGGCCTGGC TCGGACGCGC GCTGTGA
|
Protein sequence | MTAFAADLPT MSPSPLPALG IDLGGTKIEA MLLDDTGATR WRERIPTPPD DYRAALAAIG GLVEQARTAA GSAISVGIGT PGTRRADGAM KNANSTCLNG QPLQRDLEAL LGQPIALAND ANCLALSEAT DGAGAGAAVV FAVILGTGCG GGVAVHGRVL QGPNGLAGEW GHNPLPWARD DERPGPACYC GTAGCIEAWL SGPAVAADHR RHGGAAIDAV AIAQGALAGD AACQASLDRH ALRVARALAS VVNLLDPDVI VFGGGASRLP GLIERLPSLW TPWVFGARHD PPVRTRLALS QHGDASGVRG AAWLGRAL
|
| |