Gene RPC_4356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4356 
Symbol 
ID3970833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4856491 
End bp4857411 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content70% 
IMG OID637927465 
Product4-diphosphocytidyl-2-C-methyl-D-erythritol kinase 
Protein accessionYP_534198 
Protein GI90425828 
COG category[I] Lipid transport and metabolism 
COG ID[COG1947] 4-diphosphocytidyl-2C-methyl-D-erythritol 2-phosphate synthase 
TIGRFAM ID[TIGR00154] 4-diphosphocytidyl-2C-methyl-D-erythritol kinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.915644 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0109051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGGG CGGCTGAGTT GAGCGACGTG ATGTCGGTGC AGGCGCTGCA CGATGAGGCG 
CGTGCCAAGG TCAATCTGAC GCTGCGGGTG CTGGGCCGCC GCGTCGACGG CTATCACGAG
TTGGAAAGCG TGGTGGCGTT CGCCGACTGC GCCGACCGGC TGACGCTGCA AGCCGGATCC
GAACTCAGCC TCACCGCCAC CGGGCCGCGC GTCCAGGAAT GCGGCGACAA TGCCGACAAC
CTGGTGATCA AGGCGGCGCG GCTGCTCGGA GAACGCGTCG CCGATCTGCG CACGGGAAGC
TTCGCGCTCG ACAAGCAGCT GCCGATCGCC GCCGGCATCG GCGGCGGGTC GGCGGATGCC
GCGGCGGCGT TGCGGCTGTT GGCCCGCGCC AACGATCTGG CGCTGGACGA TCCCCGACTG
ATCGATGCCG CGCGAAAGAC CGGCGCCGAC GTGCCGGTGT GCCTAGCCTC AAAATCCTGC
ATCATGACCG GGATCGGCGA AACCCTGCTG CCGCTGGCGC TGCCGCGGCT GCCGGTGGTG
ATGGTCAATC CGCGCGTCGC GGTCGCCACC AAGGACGTGT TCGCGGCGCT CGGGCTGCGC
AGCGGTCAGT TGCGGGTCGG CGTCACCGAC GTCGTCACCG CGCCGAAATG GCCGGACCAG
GCCGCACCGC TCGATGCCTG GATCGCGGTG CTCGCCGCCG GCATCAACGA TCTCGAAGCG
CCGGCGAAGA AGCTGCAGCC GGTGATCGGC GAGGTTTTGA AGCTGCTCGG CAAGGCCCGG
GGCGCGCGGC TGGCGCGGAT GTCGGGGTCG GGTGCCACCT GTTTTGCGAT CTTTGCCGAC
GCCGCCGCAG CCGAGGCCGC GGCGCAAAGC GTCAGCGCAG CGCATCCCGA CTGGTGGGTG
CACGCCGGGA CGCTGGGCTG A
 
Protein sequence
MARAAELSDV MSVQALHDEA RAKVNLTLRV LGRRVDGYHE LESVVAFADC ADRLTLQAGS 
ELSLTATGPR VQECGDNADN LVIKAARLLG ERVADLRTGS FALDKQLPIA AGIGGGSADA
AAALRLLARA NDLALDDPRL IDAARKTGAD VPVCLASKSC IMTGIGETLL PLALPRLPVV
MVNPRVAVAT KDVFAALGLR SGQLRVGVTD VVTAPKWPDQ AAPLDAWIAV LAAGINDLEA
PAKKLQPVIG EVLKLLGKAR GARLARMSGS GATCFAIFAD AAAAEAAAQS VSAAHPDWWV
HAGTLG