Gene Rleg_4443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4443 
Symbolglk 
ID8015210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4576206 
End bp4577231 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content65% 
IMG OID644827018 
Productglucokinase 
Protein accessionYP_002978220 
Protein GI241207124 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0837] Glucokinase 
TIGRFAM ID[TIGR00749] glucokinase, proteobacterial type 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.862688 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAAC CGAACAACAG CATCGCTCCG CAGCCTTTCC CGATCCTGAT CGGCGATATC 
GGCGGCACGA ATGCCCGCTT CTCCATCCTG ACCGATGCCT ATGCCGAGCC GAAGCAGTTT
CCGAACGTGC GCACGGCGGA TTTCGCCACG ATCGACGAAG CGATCCAGCA AGGCGTGCTC
GACAAGACCG CCGTGCAGCC GCGCTCGGCG ATCCTCGCCG TCGCCGGCCC GATCAACGAC
GACGAGATCC CGCTGACCAA TTGCGACTGG GTGGTGCGGC CGAAGACGAT GATCGAGGGC
CTCGGCATGG AGGATGTGCT CGTCGTCAAC GATTTCGAGG CGCAGGCGCT GGCAATCGCC
GCGCTTTCGG ATGAAAACCG CGAACGCATC GGCGACGCCA CCGGCGACAT GATCGCCTCC
CGCGTCGTGC TCGGACCAGG CACCGGCCTC GGCGTCGGCG GGCTTGTGCA TGCCCAGCAC
AGCTGGATCC CGGTTCCCGG CGAAGGCGGC CATGTCGATC TCGGGCCGCG CAGCAAGCGC
GATTATGAAA TCTTCCCGCA TATCGAGACG ATCGAAGGCC GCGTTTCGGC CGAGCAGATC
CTCTGCGGGC GCGGCCTCGT CAACCTCTAC CATGCCATCT GCGTTGTCGA CGGCATCCAG
CCGACGATGA AAGATCCCGC CGACATCACC TCGCATGCGC TTGCCGGCAG CGACAAGGCA
GCCGTAGAGA CCGTCTCGCT GTTTGCCACC TATCTCGGCC GCGTGGCGGG CGACATGGCG
ATGGTGTTCA TGGCGCGCGG CGGCGTCTAT CTGTCCGGCG GCATCTCGCA GAAGATCATC
CCGGCGCTGA AGAAGCCGGA ATTCCGCATC GCCTTCGAGG ACAAGGCGCC GCATACGGCG
CTGCTTCGCA CCATCCCGAC CTATGTGGTG ACGCATCCGC TGGCAGCGCT TGCCGGGCTT
TCCTCCTATG CGCGGATGCC GGCAAATTTC GGCGTCTCGA CCGAAGGCCG CCGCTGGCGG
CGCTAG
 
Protein sequence
MPKPNNSIAP QPFPILIGDI GGTNARFSIL TDAYAEPKQF PNVRTADFAT IDEAIQQGVL 
DKTAVQPRSA ILAVAGPIND DEIPLTNCDW VVRPKTMIEG LGMEDVLVVN DFEAQALAIA
ALSDENRERI GDATGDMIAS RVVLGPGTGL GVGGLVHAQH SWIPVPGEGG HVDLGPRSKR
DYEIFPHIET IEGRVSAEQI LCGRGLVNLY HAICVVDGIQ PTMKDPADIT SHALAGSDKA
AVETVSLFAT YLGRVAGDMA MVFMARGGVY LSGGISQKII PALKKPEFRI AFEDKAPHTA
LLRTIPTYVV THPLAALAGL SSYARMPANF GVSTEGRRWR R