Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2219 |
Symbol | |
ID | 8137556 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2589106 |
End bp | 2590089 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644869833 |
Product | glucokinase |
Protein accession | YP_003022027 |
Protein GI | 253700838 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0837] Glucokinase |
TIGRFAM ID | [TIGR00749] glucokinase, proteobacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0000000124092 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCATAC TTGCTGGAGA CGTCGGCGGG ACCTCGACCC GGCTAGCCTA CTTCGAGTAC GCTGCAACCG GGCTCGTGGT GCTGGCCGAG GGACGCTACC AAAGCCAGGA ACACAGCAGT CTCTCCGACA TCGTGCGCCG CTTTGCCGCC CAATACCGCT TCGACGCCGA CAGGGCCTGC TTCGGCATAG CCGGACCGGT CATCGACGGG CGGGTACGGA CCCCGAACCT CCCCTGGAAC ATCGACGGCA GCGAACTTGC CGCAGCCTTA GGTCTAGACC AGGTGCGCCT GATCAACGAT CTCGAGGCCA ACACCTACGG CATCGCGGAA CTGAAGGCGC AGGACCTGCT GACGCTCAAC CCGGGAGCGG CGGACCCCAC AGGCACCATA GCCGTGGTTT CCGCCGGCAC CGGGCTTGGG GAATCGCTTG CCTACTGGGA CGGCTCCGCC CACAGACCGC TTCCCAGCGA GGCGGGGCAT GCCGACTTCG CGGCGCGAAA CGATCTCGAG GCCGATCTTT TGCTCTACCT CCAGGGGAAG CATGGCCGGG TCAGTTACGA GCGCGTCCTG TCGGGACCGG GGCTCCTCGA TATCTACCGG TTTCTCAGGG ACAGGCATTA CTTCCAGGAG GATGAAGCGA TCATTGCCGC CATGAACGCG GGAGACGCCC CCGCGGTCAT CACCCGCGCC GCAATGGCCG GAACTTGCCC GATGTGCAGC AAGGCTCTCG ATATCTTCAT CACTGTGTAC GGTGCCGAAG CCGGGAATGC AGCTCTCAGG TTTCTCGCCA CAGGCGGAGT CTATCTCGGC GGGGGAATCG CGCCCAAGAT CCTGGACAAG CTGCGCGGGG CTTCCTTCAT CGTAGCCTTC ACGGCAAAGG GGCGTCTCAG CTCTTTGGTG CAAACAATTC CGGTGCACGT CATATTAAAT GAGAGAACCG CACTGCTAGG TGCGGGTAGG GCTGCTTCAA TCTCATCCAG TTGA
|
Protein sequence | MVILAGDVGG TSTRLAYFEY AATGLVVLAE GRYQSQEHSS LSDIVRRFAA QYRFDADRAC FGIAGPVIDG RVRTPNLPWN IDGSELAAAL GLDQVRLIND LEANTYGIAE LKAQDLLTLN PGAADPTGTI AVVSAGTGLG ESLAYWDGSA HRPLPSEAGH ADFAARNDLE ADLLLYLQGK HGRVSYERVL SGPGLLDIYR FLRDRHYFQE DEAIIAAMNA GDAPAVITRA AMAGTCPMCS KALDIFITVY GAEAGNAALR FLATGGVYLG GGIAPKILDK LRGASFIVAF TAKGRLSSLV QTIPVHVILN ERTALLGAGR AASISSS
|
| |