Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_0565 |
Symbol | |
ID | 8011753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 586881 |
End bp | 587792 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644823155 |
Product | 4-diphosphocytidyl-2-C-methyl-D-erythritol kinase |
Protein accession | YP_002974408 |
Protein GI | 241203312 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase |
TIGRFAM ID | [TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGTCGT CCGAAAACCG CTCACACTTT TCGGCATCAT GCTCTAGCTT TTCCGAGGAA GCACGCGCGA AAATCAACCT CGCCCTGCAT GTGACGGGCC AACGGCCGGA TGGCTATCAT CTGCTCGATA TGCTGGTGAC CTTCGCAGAT CACGGCGATC GGCTGGACTT CATGCGGTCG CCGACCGACG CATTGACCCT GTCCGGCCGT TTTGGCGAAA CGCTTGCCGG CGACGGCGGC ACCAACCTGG TGCTGAAGGC CCGCGACCTT CTGCGTGAGG TGGTCGGTCC CCTCGCCTTT CCCGTCCGCA TCCATTTGGA AAAGAACCTG CCGATCGCCT CCGGCATCGG CGGCGGCTCG GCCGATGCGG CCGCGACGCT GCGCGGGCTG ATGCGGCTCT GGGGCACGAC GTTATCTGCG GAGACGCTCG CAGCATTGGC TCTGAAGCTC GGCGCCGACG TGCCGATGTG CCTGGAGAGC CGGCCGCTGA TTGCCCGCGG CATCGGCGAA AAGATCGAAC CGGTGCCTGA TCTGCCGGCC TTTGCCATGG TGCTTGCCAA TCCGCTGAAG GGCGTCGCGA CGCCTGAGGT TTTCCGCCGG CTGGCGACAA AGAACAATCC GGCCCTGAGC CTGGCTTTGA GCGGGTCTCA GGCTGCCGAC TGGCTCGCGG CGATTGCTGC TGCCCGCAAC GACCTGGAGC CGCCGGCGCG CGGACTCGTG CCCGAGATTG CGGCGATCTC GGCGATGCTG CAGGCTCGCG GCGCCCTGTT GACCCGCATG TCCGGTTCCG GCGCCACCTG TTTCGGCATC TTTGCGAGCA TGACTGCTGC CGAAGGTGCG GCGGCAGCTC TTCACGACAA GCGCCCTGAC TGGTATTTCC AGGCGACAGA AACGGTTTCG GGAGGCGCGT GA
|
Protein sequence | MMSSENRSHF SASCSSFSEE ARAKINLALH VTGQRPDGYH LLDMLVTFAD HGDRLDFMRS PTDALTLSGR FGETLAGDGG TNLVLKARDL LREVVGPLAF PVRIHLEKNL PIASGIGGGS ADAAATLRGL MRLWGTTLSA ETLAALALKL GADVPMCLES RPLIARGIGE KIEPVPDLPA FAMVLANPLK GVATPEVFRR LATKNNPALS LALSGSQAAD WLAAIAAARN DLEPPARGLV PEIAAISAML QARGALLTRM SGSGATCFGI FASMTAAEGA AAALHDKRPD WYFQATETVS GGA
|
| |