Gene GM21_3022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3022 
Symbol 
ID8138368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3509331 
End bp3510893 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content70% 
IMG OID644870623 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_003022809 
Protein GI253701620 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones120 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTGG TCACGGGACA GGTGATGCAG TCGATGGACA AAAGGGCCAT CGGGGAGTTC 
GGGGTCCCGG GGGTCGCTCT CATGGAGCGC GCCGGCACCG CGTGCGCCGA CGCCATCTCG
GCTCGCTTCG GAGACGGCGT CGGGAAAAAA GCGGTCATCG TCGCCGGCAA GGGTAACAAC
GGCGGCGACG GCTTCGTCAT CTCCAGGGTC CTCTCGCAAA AGGGGTGGGA GGCGCCGGTG
CTGCTTTTGG CCGCGCCGGG GTCTGTGACC GGGGACGCCG GCGCCAACCT GGAGCGCCTG
GACCCCGGCG TCGTCACCAC ACTCCCGCAG GGGCTTTCCG GGCAGGAGCA TCTCCTTAAA
AGCGCCACCG TCGTCGTCGA CGCGCTCCTT GGCACCGGCA TCAACAGCGA GGTGGGGGGG
ATCTACGGGG AGGCCATCGA CATCGTCAAC GCCTGCGGGG TCCCGGTGGT CGCGGTCGAC
ATCCCCTCCG GTATCGACGC CGGCAGCGGC CGGGTGCTGG GGCGGGCGGT GCGCGCCGAA
CTCACGGTGA GCTTTGCCCT GGCGAAGCTT GGGAACGTGC TTTACCCGGG GGCGGAGCTG
GGCGGGCGCC TGGCGGTGGC CGATATCGGC ATGCCGCAGG CGGTGATCGA CGAGGCGCCG
GGATGCGAGT ATGTGGACGA AGAGGGCGCG CGGGGGATGA TCGTTGAGCG GAGCCCCCTC
GCCCACAAGG GGAGCAACGG GCATTGCCTG GTCATCGCCG GGAGCGCCGG GAAGACCGGA
GCGGCGGCCA TGGCCGCGCA AAGCGCCTTG AGGGGCGGGG CTGGGCTGGT GAGCCTCGCG
GTTCCGGCGG CGCTCAACCC CGTGCTGGAA TCGAAGACGA CGGAGGCGAT GACCATTCCC
GTCGGGCCGG AGGATAAGGG GTACTTCCTG GCGGGGGCGC TCGACGAGTT GCTGTCGGTC
GCCAAGGGGA AGGATGCTGT GGCGCTCGGG CCTGGGCTTG GGACCGCCCC CTCCACGGTC
TACCTGGTGC ACTCGCTCCT CGCCCTCCTG GAGGCGCCGC TTGTCATCGA CGCCGACGGG
CTGAACGCGG TGGCGGCGGC CCCGGAGCTT CTCTTGCGCC GACGGGGGAG GGTGACGCTG
TTGACGCCGC ACCCGGGGGA GATGGCCAGG CTTACGGGTC TCTCCATCCA AGAGGTAGAG
GCGGACCGCA TCGGTTGCTC CCGCGACTTC GCCGCGCGCT TCCAGGTCTA CCTCGTCTTG
AAGGGGGCCC GCAGCATCGT GGCGGCACCC GACGGCGGCG TCAGTATCAA CGGCAGCGGC
AACCCCGGCA TGGCGACCGG GGGGATGGGG GACGTCCTGA CCGGCGTCAT CGCCGCCCTT
TTGGGGCAGG GGTACCACCC CTTTGACGCC GCCCGCTTAG GGACTTTCCT CCACGGCTAC
GCGGCGGATT TACTGGTTGA GGAATTGGGG ACGCGCGGCA TGGTCGCAAC CGACGTGCAG
GAGGCCCTCC CCAGGGCGAT GCGCCGGCTT ACGGCCGCGG GCCGCGGCGA GGCCGGAATA
TAG
 
Protein sequence
MKVVTGQVMQ SMDKRAIGEF GVPGVALMER AGTACADAIS ARFGDGVGKK AVIVAGKGNN 
GGDGFVISRV LSQKGWEAPV LLLAAPGSVT GDAGANLERL DPGVVTTLPQ GLSGQEHLLK
SATVVVDALL GTGINSEVGG IYGEAIDIVN ACGVPVVAVD IPSGIDAGSG RVLGRAVRAE
LTVSFALAKL GNVLYPGAEL GGRLAVADIG MPQAVIDEAP GCEYVDEEGA RGMIVERSPL
AHKGSNGHCL VIAGSAGKTG AAAMAAQSAL RGGAGLVSLA VPAALNPVLE SKTTEAMTIP
VGPEDKGYFL AGALDELLSV AKGKDAVALG PGLGTAPSTV YLVHSLLALL EAPLVIDADG
LNAVAAAPEL LLRRRGRVTL LTPHPGEMAR LTGLSIQEVE ADRIGCSRDF AARFQVYLVL
KGARSIVAAP DGGVSINGSG NPGMATGGMG DVLTGVIAAL LGQGYHPFDA ARLGTFLHGY
AADLLVEELG TRGMVATDVQ EALPRAMRRL TAAGRGEAGI