Gene Rleg2_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0521 
Symbol 
ID6979237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp534806 
End bp535702 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content67% 
IMG OID643395233 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_002280044 
Protein GI209548127 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.601271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.317281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAG CGGGCCTGGC CGAGGCTTTC GGCGTCACCG AAGAGGCGCG CGCAAAGATC 
AATCTCGCTT TGCATGTGAC AGGCCAGCGG GCAGATGGCT ATCATCTGCT CGACATGCTG
GTGACCTTTG CCGATTGCGG CGACCGGCTG GGCTTCCTGC CTGCCCAGAC CGACGCCTTC
ACCCTGTCGG GTCGCTTCGG CGAGATGCTG GCCGGCGACG GCGGCACCAA TCTGGTGCTG
CGGGCGCGCG ATCTCCTGCG CGAGCAGTTC GGCGCCCTCG CCTTCCCCGT CCATATCCAC
CTGCAAAAGA ACCTGCCTGT TGCCTCCGGC ATCGGCGGCG GCTCGGCCGA TGCGGCCGCG
GCGCTGCGCG GGCTGATGCG GCTCTGGGGC ATGAGCCTGC CGGTGGAGGC GCTTGCCAGT
CTGGCGCTGA AGCTCGGCGC CGACGTGCCG ATGTGCCTTG AAAGCCGGCC GCTAATTGCC
CGCGGTATCG GCGAGGAGAT CGAGGCGGTG CCGGATCTGC CGGCCTTTGC CATGGTGCTC
GCCAATCCGC TGAAGGGTGT GTCGACGCCT GAGGTGTTCC GCCGGCTGAC GACAAAGAAC
AATTCGGCCC TGAGCCTCGC ACCCGGTCTG TCCGGGAGTG CCGGCTGGCT GGCAGTAATC
GATGCCGCCC GCAATGACCT GGAACCGCCG GCGCGTCAGC TGGTGCCCGA GATTGCGGTG
ATCTCGGCGA TGCTGCAGGC CCGCGGCGCG CTTTTGACGC GGATGTCCGG CTCCGGCGCT
ACCTGTTTCG GGATCTTTGC GAGCATGGCT GAGGCGCAAG ACGCGGCGGC AGCCCTTCAC
GGCGAGCGGC CCGACTGGTA TTTCCAGGCG ACGGAAACGG TTTCGGGAGG CATGTGA
 
Protein sequence
MPEAGLAEAF GVTEEARAKI NLALHVTGQR ADGYHLLDML VTFADCGDRL GFLPAQTDAF 
TLSGRFGEML AGDGGTNLVL RARDLLREQF GALAFPVHIH LQKNLPVASG IGGGSADAAA
ALRGLMRLWG MSLPVEALAS LALKLGADVP MCLESRPLIA RGIGEEIEAV PDLPAFAMVL
ANPLKGVSTP EVFRRLTTKN NSALSLAPGL SGSAGWLAVI DAARNDLEPP ARQLVPEIAV
ISAMLQARGA LLTRMSGSGA TCFGIFASMA EAQDAAAALH GERPDWYFQA TETVSGGM