Gene GM21_2219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2219 
Symbol 
ID8137556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2589106 
End bp2590089 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content63% 
IMG OID644869833 
Productglucokinase 
Protein accessionYP_003022027 
Protein GI253700838 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0837] Glucokinase 
TIGRFAM ID[TIGR00749] glucokinase, proteobacterial type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0000000124092 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCATAC TTGCTGGAGA CGTCGGCGGG ACCTCGACCC GGCTAGCCTA CTTCGAGTAC 
GCTGCAACCG GGCTCGTGGT GCTGGCCGAG GGACGCTACC AAAGCCAGGA ACACAGCAGT
CTCTCCGACA TCGTGCGCCG CTTTGCCGCC CAATACCGCT TCGACGCCGA CAGGGCCTGC
TTCGGCATAG CCGGACCGGT CATCGACGGG CGGGTACGGA CCCCGAACCT CCCCTGGAAC
ATCGACGGCA GCGAACTTGC CGCAGCCTTA GGTCTAGACC AGGTGCGCCT GATCAACGAT
CTCGAGGCCA ACACCTACGG CATCGCGGAA CTGAAGGCGC AGGACCTGCT GACGCTCAAC
CCGGGAGCGG CGGACCCCAC AGGCACCATA GCCGTGGTTT CCGCCGGCAC CGGGCTTGGG
GAATCGCTTG CCTACTGGGA CGGCTCCGCC CACAGACCGC TTCCCAGCGA GGCGGGGCAT
GCCGACTTCG CGGCGCGAAA CGATCTCGAG GCCGATCTTT TGCTCTACCT CCAGGGGAAG
CATGGCCGGG TCAGTTACGA GCGCGTCCTG TCGGGACCGG GGCTCCTCGA TATCTACCGG
TTTCTCAGGG ACAGGCATTA CTTCCAGGAG GATGAAGCGA TCATTGCCGC CATGAACGCG
GGAGACGCCC CCGCGGTCAT CACCCGCGCC GCAATGGCCG GAACTTGCCC GATGTGCAGC
AAGGCTCTCG ATATCTTCAT CACTGTGTAC GGTGCCGAAG CCGGGAATGC AGCTCTCAGG
TTTCTCGCCA CAGGCGGAGT CTATCTCGGC GGGGGAATCG CGCCCAAGAT CCTGGACAAG
CTGCGCGGGG CTTCCTTCAT CGTAGCCTTC ACGGCAAAGG GGCGTCTCAG CTCTTTGGTG
CAAACAATTC CGGTGCACGT CATATTAAAT GAGAGAACCG CACTGCTAGG TGCGGGTAGG
GCTGCTTCAA TCTCATCCAG TTGA
 
Protein sequence
MVILAGDVGG TSTRLAYFEY AATGLVVLAE GRYQSQEHSS LSDIVRRFAA QYRFDADRAC 
FGIAGPVIDG RVRTPNLPWN IDGSELAAAL GLDQVRLIND LEANTYGIAE LKAQDLLTLN
PGAADPTGTI AVVSAGTGLG ESLAYWDGSA HRPLPSEAGH ADFAARNDLE ADLLLYLQGK
HGRVSYERVL SGPGLLDIYR FLRDRHYFQE DEAIIAAMNA GDAPAVITRA AMAGTCPMCS
KALDIFITVY GAEAGNAALR FLATGGVYLG GGIAPKILDK LRGASFIVAF TAKGRLSSLV
QTIPVHVILN ERTALLGAGR AASISSS