Gene Rleg_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1809 
SymbolispDF 
ID8012867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1799934 
End bp1801151 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content62% 
IMG OID644824400 
Productbifunctional 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase/2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase protein 
Protein accessionYP_002975633 
Protein GI241204537 
COG category[I] Lipid transport and metabolism 
COG ID[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.274238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00324888 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGCAAA TGCCTTCTAA GCAACCGATA TCGGCTGGAA TTGTCATCGT GGCCGCCGGC 
CGCGGCGAGC GCGCAGGATC ATCCAAGGAA GGCCCCAAGC AGTATCGCAT GATCGGCGGC
AAGCCGGTTA TCGTGCATAC GCTTGAAAAC TTCATGACAT GGGAAGCGGC AACTGAGATC
GTCGTCGTCA TCCATCCCGA TGACGAGGCG CTGTTTGCGA GAGCCTTCCG CCACATCATC
TCGGCAACAC CGATCGAAAC GGTGCATGGG GGTGCGACTA GGCAGCAATC CGTGCTTGCC
GGCCTCAGAT ACCTCAAGGA CAAGCAAATC AGCCATGTGC TGATCCACGA TGCCGTTCGG
CCGTTCTTCG ACCATGATCT CCTGGACCGC ATTGCCGAGA GCCTTAATGC CGGCGCGCAG
GCGGTCCTGC CGGCGATCCC GGTCACCGAT ACGCTGAAAC GCGCCGACAA CACAGGCACG
GTGCTGACGA CGGTTTCACG CGAGCATCTT TACGCTGCGC AGACGCCGCA ATCCTTCGCC
TTCGAAACGA TCCTCGAGGC TCATGAGAAG GCGGCTTCAA GCGGGCGAAG CGATTTCACC
GACGATGCCT CGATCGCCGA ATGGCTGGGC ATTCCGGTGA CGATCGTCGA GGGCACGGCC
GACAACGTCA AGCTGACGGT CAAGAACGAT ATCGCCATGG CCGACGACAA GCTGTCGGCT
CCGCTGCTTC CGGACGTGCG CACCGGCAAC GGCTACGACG TGCACCAGCT CGTAGTCGGC
GATGGCGTGA CACTCTGCGG CGTGTTCATT CCGCATGACC AGAAGCTCAA AGGACACTCC
GATGCCGACG TCGCGCTGCA TGCGCTGACG GACGCGCTGC TTGCCACGTG CGGCGCCGGC
GATATCGGCG ATCATTTCCC ACCGTCCGAT CCGCAATGGA AGGGGGCAGC TTCGCGGATC
TTCATCGAGC ATGCTGCCCG GATCGTGCGC GAGCGCGGCG GCACGATCAT GAATGCCGAC
GTCTCGCTGA TCGCCGAGGC GCCGAAGGTC GGCCCGCATC GCGACGCCAT GCGGGCGAAA
CTATCGGACT ATCTCGGCAT TGACATCGAG CGTTGCTCGG TCAAGGCGAC GACCAACGAG
ACGATCGGCT TCGTCGGCCG GCGCGAAGGC ATCGCGGCGA TTGCGACGGC GACCGTCGTC
TATCGCGGGA GGAAATGA
 
Protein sequence
MLQMPSKQPI SAGIVIVAAG RGERAGSSKE GPKQYRMIGG KPVIVHTLEN FMTWEAATEI 
VVVIHPDDEA LFARAFRHII SATPIETVHG GATRQQSVLA GLRYLKDKQI SHVLIHDAVR
PFFDHDLLDR IAESLNAGAQ AVLPAIPVTD TLKRADNTGT VLTTVSREHL YAAQTPQSFA
FETILEAHEK AASSGRSDFT DDASIAEWLG IPVTIVEGTA DNVKLTVKND IAMADDKLSA
PLLPDVRTGN GYDVHQLVVG DGVTLCGVFI PHDQKLKGHS DADVALHALT DALLATCGAG
DIGDHFPPSD PQWKGAASRI FIEHAARIVR ERGGTIMNAD VSLIAEAPKV GPHRDAMRAK
LSDYLGIDIE RCSVKATTNE TIGFVGRREG IAAIATATVV YRGRK