Gene RPD_0317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0317 
SymbolispG 
ID4020776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp365780 
End bp367105 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID637960495 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_567456 
Protein GI91974797 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAAA AACCTGATAT TCCGGACGCC ATGAACAAGC TCGAAAATCC GCTGCAGAAC 
GACGTCGCCG GCCCCGCGCC GCGCCGCCAA ACCACCCAGG TCATGGTCGG CGATGTCGCC
GTCGGCGGCG GTGCGCCGAT CGTCGTGCAA TCGATGACCA ACACCGACAC CGCGGATGTC
GAAGGCACCA TCAAGCAGGT CGCCGCACTC GCCCGCGCCG GTTCCGAAAT GGTCCGGATC
ACCGTCGACC GCGAAGAGGC CGCCGCAGCC GTGCCGCACA TCCGCGACGG CCTCCGCAAG
CTCGGCCTGA CCACGCCGCT GATCGGCGAT TTCCATTACA TCGGCCACAA GCTGCTCGCC
GACTATCCGG CCTGCGCCGA GGCGCTCGAC AAATACCGGA TCAATCCGGG CAATGTCGGC
TTCAAGAACA AGCGCGACAG CCAGTTCACC GACATCGTCG AGATTGCGAT CAAGAACAAC
AAGGCGGTCC GGATAGGCGC CAATTGGGGC TCGCTCGACC AGGAGCTGCT GACCAAGCTG
ATGGACGAGA ACGCCGCCTC CGCCCAGCCG CGCGACGTCC GCGCGGTCAC CCGCGAGGCG
ATGGTGCAGT CGGCGCTGCT GTCGGCGGCG CGCGCCGAGG AGATCGGCCT GCCGAAGACC
AAGATGATCC TGTCGGCCAA GGTCTCGGCG GTGCAGGATC TGATCGCGGT GTATCAGGAC
CTCGCGTCGC GCTCGGACTA CGCCATCCAT CTCGGCCTGA CCGAGGCCGG CATGGGCTCG
AAGGGCATCG TCGCCTCCTC GGCCGCGCTC GGCATCCTGC TGCAGCAGGG CATCGGCGAC
ACCATCCGGA TTTCGCTCAC GCCGGAGCCC GGCGGCGATC GCACGCTGGA AGTTCAGGTC
GCGCAGGAAC TGCTGCAGAC GATGGGCTTC CGCACCTTCG TGCCGCTGGT CGCGGCTTGC
CCGGGCTGCG GCCGCACCAC CTCGACGACG TTCCAGGAAC TGGCCCGCTC GATCCAGGAT
TTCATCCGGC TGGAAATGCC CAGCTGGAAG ACCCGCTATC CTGGCGTCGA GAACCTCAAC
GTCGCGGTGA TGGGCTGCAT CGTCAACGGC CCGGGCGAGA GCAAGCACGC CAATATCGGC
ATCTCGCTGC CCGGCACCGG CGAAAGCCCG GCCGCCCCGG TGTTCGTCGA CGGCCAGAAG
TTCCGCACTC TGCGCGGCCC GTCGATCGCC ACCGATTTCA AGGCGCTGGT GATCGACTAT
ATCGAGCAGC GCTACGGCGC GGGCACCAAG CCCGGCGCGC CGCAAATGGT GCCGGCGGCG
GAGTAA
 
Protein sequence
MSQKPDIPDA MNKLENPLQN DVAGPAPRRQ TTQVMVGDVA VGGGAPIVVQ SMTNTDTADV 
EGTIKQVAAL ARAGSEMVRI TVDREEAAAA VPHIRDGLRK LGLTTPLIGD FHYIGHKLLA
DYPACAEALD KYRINPGNVG FKNKRDSQFT DIVEIAIKNN KAVRIGANWG SLDQELLTKL
MDENAASAQP RDVRAVTREA MVQSALLSAA RAEEIGLPKT KMILSAKVSA VQDLIAVYQD
LASRSDYAIH LGLTEAGMGS KGIVASSAAL GILLQQGIGD TIRISLTPEP GGDRTLEVQV
AQELLQTMGF RTFVPLVAAC PGCGRTTSTT FQELARSIQD FIRLEMPSWK TRYPGVENLN
VAVMGCIVNG PGESKHANIG ISLPGTGESP AAPVFVDGQK FRTLRGPSIA TDFKALVIDY
IEQRYGAGTK PGAPQMVPAA E