Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_0521 |
Symbol | |
ID | 6979237 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 534806 |
End bp | 535702 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643395233 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_002280044 |
Protein GI | 209548127 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.601271 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.317281 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAG CGGGCCTGGC CGAGGCTTTC GGCGTCACCG AAGAGGCGCG CGCAAAGATC AATCTCGCTT TGCATGTGAC AGGCCAGCGG GCAGATGGCT ATCATCTGCT CGACATGCTG GTGACCTTTG CCGATTGCGG CGACCGGCTG GGCTTCCTGC CTGCCCAGAC CGACGCCTTC ACCCTGTCGG GTCGCTTCGG CGAGATGCTG GCCGGCGACG GCGGCACCAA TCTGGTGCTG CGGGCGCGCG ATCTCCTGCG CGAGCAGTTC GGCGCCCTCG CCTTCCCCGT CCATATCCAC CTGCAAAAGA ACCTGCCTGT TGCCTCCGGC ATCGGCGGCG GCTCGGCCGA TGCGGCCGCG GCGCTGCGCG GGCTGATGCG GCTCTGGGGC ATGAGCCTGC CGGTGGAGGC GCTTGCCAGT CTGGCGCTGA AGCTCGGCGC CGACGTGCCG ATGTGCCTTG AAAGCCGGCC GCTAATTGCC CGCGGTATCG GCGAGGAGAT CGAGGCGGTG CCGGATCTGC CGGCCTTTGC CATGGTGCTC GCCAATCCGC TGAAGGGTGT GTCGACGCCT GAGGTGTTCC GCCGGCTGAC GACAAAGAAC AATTCGGCCC TGAGCCTCGC ACCCGGTCTG TCCGGGAGTG CCGGCTGGCT GGCAGTAATC GATGCCGCCC GCAATGACCT GGAACCGCCG GCGCGTCAGC TGGTGCCCGA GATTGCGGTG ATCTCGGCGA TGCTGCAGGC CCGCGGCGCG CTTTTGACGC GGATGTCCGG CTCCGGCGCT ACCTGTTTCG GGATCTTTGC GAGCATGGCT GAGGCGCAAG ACGCGGCGGC AGCCCTTCAC GGCGAGCGGC CCGACTGGTA TTTCCAGGCG ACGGAAACGG TTTCGGGAGG CATGTGA
|
Protein sequence | MPEAGLAEAF GVTEEARAKI NLALHVTGQR ADGYHLLDML VTFADCGDRL GFLPAQTDAF TLSGRFGEML AGDGGTNLVL RARDLLREQF GALAFPVHIH LQKNLPVASG IGGGSADAAA ALRGLMRLWG MSLPVEALAS LALKLGADVP MCLESRPLIA RGIGEEIEAV PDLPAFAMVL ANPLKGVSTP EVFRRLTTKN NSALSLAPGL SGSAGWLAVI DAARNDLEPP ARQLVPEIAV ISAMLQARGA LLTRMSGSGA TCFGIFASMA EAQDAAAALH GERPDWYFQA TETVSGGM
|
| |