Gene Rleg2_5597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5597 
Symbol 
ID6978691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1242575 
End bp1243555 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content58% 
IMG OID643394695 
ProductGHMP kinase 
Protein accessionYP_002279513 
Protein GI209547595 
COG category[R] General function prediction only 
COG ID[COG2605] Predicted kinase related to galactokinase and mevalonate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCGTTT CATCCGCACC TTTTCGGGTC AGTTTCGCCG GCGGCGGCTC GGACATCGCG 
TCCTATTACC GCCGTCAGGC GGGCGCGGTT TTGTCCTGTG CGATAGCCAA ATACAGCTTC
GTGATCGTGC ACAACTATTT CAATGAAAAC AAATATCATC TGAAATATAC GCGCACCGAA
CTTGCCGAGA CGCTTGAGGA GATTGCGCAT CCGCTGCTGC GCGAGGCACT TCGGATGCAT
AGGGTCGAGC CCGGCATCGA AGTCGCTTCC GTCGCCGACA TTCCCTCCGG AACGGGGTTG
GGATCATCCA GCTCCTTTTC CGTGGCACTG ATCAACGCAC TCTATGCTCA CAGGTCGCGT
TTTGCCTCCA AGGACCAGCT GGCGGAAGAA GCCTGCAAAC TCGAAATCGA TATCCTCAAG
GAGCCGATCG GCAAACAGGA CCAATATGCA GCCGCGCATG GCGGTTTGAA CTTCATCGAG
TTCAATTCCA ACGGCAGCGT GAATGTTCAG CCCGTCGTCC TGAGCTCTGA AAAGATGGCG
GAGCTTGAGA GCAACATCCT GTTGTTTTTC ACCGGAAGCC AGCGCGATAC GCGCTCTGTG
CTGTCGACGC AGGTGCAGGC CATGGAGGCG GACGAGGAAA AATTCCGGAC CGTCGAGCGC
ATGGTGCAAC TGGCTTACGA AATGCGCGAC ATCCTGATGA GCGGGGATCT TGGCGCCTTC
GGCGAAGCGC TGCATCGCGG ATGGATGATG AAAAGATCGC TGACCTCGAA GATCACCAAC
AGCGCGATCG ACGAGTTTTA CGATGCGGCC CGCGCTGCCG GCGCCATCGG CGGCAAGCTC
GCGGGTGCTG GCGGAGGGGG CTTCCTCGTC CTCTACTGCC CGAAGGACCG ACAAGAAAAA
GTGCGCCGGG CGCTGTCGCA GCTCAAGGAA ATCGAGTTTC GCTTCGACTG GAGCGGCGCG
CGCATCGCCT TTGCACAATA G
 
Protein sequence
MIVSSAPFRV SFAGGGSDIA SYYRRQAGAV LSCAIAKYSF VIVHNYFNEN KYHLKYTRTE 
LAETLEEIAH PLLREALRMH RVEPGIEVAS VADIPSGTGL GSSSSFSVAL INALYAHRSR
FASKDQLAEE ACKLEIDILK EPIGKQDQYA AAHGGLNFIE FNSNGSVNVQ PVVLSSEKMA
ELESNILLFF TGSQRDTRSV LSTQVQAMEA DEEKFRTVER MVQLAYEMRD ILMSGDLGAF
GEALHRGWMM KRSLTSKITN SAIDEFYDAA RAAGAIGGKL AGAGGGGFLV LYCPKDRQEK
VRRALSQLKE IEFRFDWSGA RIAFAQ