Gene GM21_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4110 
Symbol 
ID8139484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4693579 
End bp4694649 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID644871725 
Productgalactokinase 
Protein accessionYP_003023883 
Protein GI253702694 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0153] Galactokinase 
TIGRFAM ID[TIGR00131] galactokinase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones98 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTA CCGAGTTTGA GAAGACCTTC GAGGCCCCCT GCGAAGCGAC CGCCCGCGCA 
CCCGGCCGGG TCAACCTCCT GGGGGAGCAT ACCGACTACA ACGACGGCTT CGTCCTCCCC
ATCGCGGTCC CGCTGGAGAC CACGGTGGAG CTGGCCAAAA GCCGCGACGG CCGGAACCAC
TACTATGCGG AGGAGCTGCA GGAAAGGGCG TGGTCGGAGA CGGGAGGCGC GGTCCCCAGC
GGCTTCGCCG CCTACCTGCA CGGCTGCCTC GCGCTTTTGC GCCTCTCCGG GCACCACGTG
GACCCGGTTT CGGTGCGGGT CACCTCCCAG GTGCCCATGG GGAGCGGACT CTCCTCCAGC
GCCGCGCTCG AGGTCGCCTT CCTGCGCGGG ATGCGGGAGC TGTTCCGCCT CGACCTGGAC
GACGTCGAGA TCGCGCTCAT GGCCCAGCAG GCCGAGATCC GCTACGCCGG GGTCAACTGC
GGCATCATGG ACCAGATGGC CGCGAGCCTC GCCGATTCCA CCCACATGCT CTTCATCGAC
ACCCGGTCGC TGGAGCGAAA GCTCCTCCCG CTCCCCCCGC GCTCGGAGCT CCTGGTGATC
GACTGCGGGG TCCCGCGAAA GCTCGGCGAG AGCATGTACA ACCTGCGCCG CCAGGAGTGC
GAGGAGGCTG CGGAGCTTCT GGGGGTGGGT TCGCTGCGGG ACCTCTCGGA CCTGAACCAA
CTGATCAAGC TGCCGCGCAA CCTGGCGCGG CGCGCCCGGC ACGTGCTGAC CGAGAACGAG
CGGGTGTTGG AGGCGGTCAA GGGGGTGCAC GGCTGCCGCT TCGGGGAGTT GATGAACGCC
TCGCACAAGA GCCTCAGGGA CGACTTCCAG GTCTCCATAC CCGAACTGGA CCTTTTGGCC
AGGCTGCTGC AGGAACAGGT CGACGTGTAC GGAGCGCGGC TCACCGGGGC CGGCTTCGGA
GGGGCCTGCG TGGCGCTGGT GCGCGAGGGG AAGGCGGCGG AGGTAGCGTC GAACGTCCTG
GCGCTCTACC GCGAGCAAGG GGAGCAGGGG AAGCTATTGG TGCCGCAGTA G
 
Protein sequence
MPATEFEKTF EAPCEATARA PGRVNLLGEH TDYNDGFVLP IAVPLETTVE LAKSRDGRNH 
YYAEELQERA WSETGGAVPS GFAAYLHGCL ALLRLSGHHV DPVSVRVTSQ VPMGSGLSSS
AALEVAFLRG MRELFRLDLD DVEIALMAQQ AEIRYAGVNC GIMDQMAASL ADSTHMLFID
TRSLERKLLP LPPRSELLVI DCGVPRKLGE SMYNLRRQEC EEAAELLGVG SLRDLSDLNQ
LIKLPRNLAR RARHVLTENE RVLEAVKGVH GCRFGELMNA SHKSLRDDFQ VSIPELDLLA
RLLQEQVDVY GARLTGAGFG GACVALVREG KAAEVASNVL ALYREQGEQG KLLVPQ