Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_0008 |
Symbol | |
ID | 4269539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 10399 |
End bp | 11340 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 638124735 |
Product | glucokinase |
Protein accession | YP_740857 |
Protein GI | 114319174 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0837] Glucokinase |
TIGRFAM ID | [TIGR00749] glucokinase, proteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCTT ACCTCCTGGC CGACATCGGC GGCACCCATA CCCGTATCGC CCTCGCGACG CCGGGCGGGG AACCGCAGCA AAGGCACCGC TACCGTAACA GCGAGCTTGG CGATCCCCTT TCCGGTCTGC AGCATTTTCT CGCTCAGGTC GCACCAGCCC GCCCCACGAC CCTGGCGATC GCCGTGGCCG GCCCGGTCCA GGGCGGGCGT GTTCAGTTGA CGAACCGAAG CTGGATGCTG CACGACGGCA GCCTGGCCCG CGCCCTCGGG CTCGATGCCG TGCACCTCTA CAATGACTTT CAGGCGCTCG CCCGTGCCCT CCCCCTGCTC TGCGCTTCCT CGGTGCGGCC GTTGGCGCCG GGTGTTGCCG AGCCGGGGGC GCCGCGGGCG GTTCTGGGTC CCGGTACCGG CTTGGGGGTG GCTGCCGCCG TGCCCTGCCC GGCGGGTTGG TCGGCGCTCG CCTGTGAAGG GGGGCATGTC ACCCTGGCAC CGGGCGATGT GGCGGAGAGC ACCCTGATCG ATCGCCTGCG GCAGCAGTTG GATCACGTGT CGGCGGAGGC GGTACTGTGC GGTGCCGGGC TCTGCCGGCT GCACGCCGTG CTGCATGGGG CGCCCTGTGA CGATCCGAAG GCGATCACCG AGGCGGGGCG CGCCGGCGAT CCCCGGGCGA CTGAGACCAT CCAGCGGTTC TTCAGCCTGC TTGGCGGGTT TGCCGGCAAC CTGGCGTTGA CCCTCGGCGC CCGCGGCGGT CTCTACCTTG CCGGAGGGAT GCTGCCAGCG CTGTGGCAAC CGATGCAGGA GTCCGCCTTC CTCGAGCGCT TCCGGGCCAA GGGACGGTTT CGCGATTACC TCACGGCCAT TCCGGTCCTG CTCATCCGCG ACCCGGAGGG TGCCACCCTC CTCGGCCTGC GCGCCCTGCT CGATGACGCC GGGGGCGGTT GA
|
Protein sequence | MSSYLLADIG GTHTRIALAT PGGEPQQRHR YRNSELGDPL SGLQHFLAQV APARPTTLAI AVAGPVQGGR VQLTNRSWML HDGSLARALG LDAVHLYNDF QALARALPLL CASSVRPLAP GVAEPGAPRA VLGPGTGLGV AAAVPCPAGW SALACEGGHV TLAPGDVAES TLIDRLRQQL DHVSAEAVLC GAGLCRLHAV LHGAPCDDPK AITEAGRAGD PRATETIQRF FSLLGGFAGN LALTLGARGG LYLAGGMLPA LWQPMQESAF LERFRAKGRF RDYLTAIPVL LIRDPEGATL LGLRALLDDA GGG
|
| |