Gene RPD_2587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2587 
Symbol 
ID4023083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2898580 
End bp2899776 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content69% 
IMG OID637962784 
Product2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 
Protein accessionYP_569717 
Protein GI91977058 
COG category[I] Lipid transport and metabolism 
COG ID[COG0245] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[COG1211] 4-diphosphocytidyl-2-methyl-D-erithritol synthase 
TIGRFAM ID[TIGR00151] 2C-methyl-D-erythritol 2,4-cyclodiphosphate synthase
[TIGR00453] 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.439438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00531739 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGAATC CCCCCCGCAC TGCCGCCATC ATCGTCGCCG CCGGCCGCGG CCTCCGCGCA 
GGCGCGGGCG GCCCCAAGCA ATACCGAACG CTGGCCGGCC GACCGGTTAT CGCACGTGCT
TTGCAGCCGT TTTGCACTCA CCCGGAGGTG TTTGCGGTCC AGCCGGTGAC TAACCCGGAC
GACACCGCGA TCTTCAACGA CGCCGTGACA GGCCTGAACT TCCGTCCCGC GGTCGGCGGC
GGCGCGACCC GACAGGGCTC GGTTCGCGCC GGGCTGGAGG CGCTGGCGGA GTTGAACCCT
GACATCGTGC TGATCCACGA CGCAGCGCGT CCCTTTGTCA CGCCCGATCT GATCTCGCGC
GCGATCGTCG CCGCAGGTCA GACCGGCGCC GCGCTGCCGG TCGTCGCGAT CAACGACACC
GTCAAGCAGA TCAACGCCGA AGGCTGCGTC GAGGCGACGC CGGACCGCGC GCGGTTGCGA
ATCGCGCAGA CCCCGCAGGC GTTCCGCTTC GACGTCATCC TCGATGCGCA TCGCCGCGCC
GCGCGCGACG GCCGCGACGA TTTCACCGAC GACGCCGCGA TCGCCGAGTG GGCAGGATTG
ACGGTCTCAA CCTTTGAAGG CGATGCTGCC AACATGAAAC TGACGACGCC GGACGATTTC
ATCCGCGAAG AGAGTCGCCT CACCGCGCTG CTCGGCGACA TCCGCACGGG CACCGGCTAC
GACGTTCACG CCTTCGGCGA CGGCGATCAC GTCTGGCTGT GCGGGCTGAA GGTGCCGCAC
AATCGCGGCT TCCTTGCGCA TTCCGATGGT GACGTCGGCT TGCACGCGCT GGTCGACGCG
ATCCTCGGCG CGCTGGCGGA CGGCGACATC GGCTCGCACT TCCCGCCGAC CGATCCGCAA
TGGAAGGGCG CGGCCTCCGA CAAGTTCTTG AAGTACGCGG TCGAGCGTGT CGCCGCGCGC
GGCGGCCGCA TCGCCAATCT CGAAGTGACG ATGATCTGCG AACGGCCGAA GATCGGCCCG
CTGCGCGAGG CGATGCGCGC CAGGATCGCC GAGATCACCG GGCTTCCCGT GTCGCGCATC
GCGGTCAAGG CCACCACCAG CGAAAGGCTC GGCTTCACCG GGCGCGAGGA AGGCATCGCC
GCCACCGCGT CCGCCACCAT CCGGCTGCCC TGGGGCGCCG AAGGACTGGC CGGCTGA
 
Protein sequence
MPNPPRTAAI IVAAGRGLRA GAGGPKQYRT LAGRPVIARA LQPFCTHPEV FAVQPVTNPD 
DTAIFNDAVT GLNFRPAVGG GATRQGSVRA GLEALAELNP DIVLIHDAAR PFVTPDLISR
AIVAAGQTGA ALPVVAINDT VKQINAEGCV EATPDRARLR IAQTPQAFRF DVILDAHRRA
ARDGRDDFTD DAAIAEWAGL TVSTFEGDAA NMKLTTPDDF IREESRLTAL LGDIRTGTGY
DVHAFGDGDH VWLCGLKVPH NRGFLAHSDG DVGLHALVDA ILGALADGDI GSHFPPTDPQ
WKGAASDKFL KYAVERVAAR GGRIANLEVT MICERPKIGP LREAMRARIA EITGLPVSRI
AVKATTSERL GFTGREEGIA ATASATIRLP WGAEGLAG