Gene Rpal_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_0520 
SymbolispG 
ID6408169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp567708 
End bp569003 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content66% 
IMG OID642710432 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_001989555 
Protein GI192288950 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGC TCGAAAATCC GCTGCGAGAC GACGTCGCCG GCCCCGCGCC GCGGCACCAA 
ACCACCCAGG TCATGGTCGG CGATGTGGCC GTCGGCGGCG GTGCCCCGAT CGTCGTTCAG
TCGATGACCA ATACCGACAC CGCGGATGTC GAGGGCACCA TCAAGCAGAT CGCCGCGCTG
GCCCGGGCCG GTTCGGAGAT GGTCCGGATC ACCGTCGATC GCGAGGAAGC GGCCGCCGCC
GTCCCGCACA TCCGCGACGG CATCCGCAAG CTAGGCCTGA CCACGCCGAT CATCGGCGAC
TTCCATTACA TCGGCCACAA GCTGCTCGCC GAATACCCGG CGTGCGCCGA GGCGCTCGAC
AAGTACCGGA TCAATCCGGG CAATGTCGGC TTCAAGAACA AGCGTGACAC GCAGTTCGCC
GACATCGTCG AGATCGCAAT CAAGAACAAT AAGGCGGTCC GCATCGGCGC CAATTGGGGT
TCGCTCGACC AGGAGCTGCT CACCAAGCTG ATGGACGAGA ACGCTGCGTC GGCCAATCCG
CGCGACGTCC GCGCCGTCAC CCGCGAGGCG ATGGTCCAGT CGGCGCTGCT GTCGGCCGCG
CGCGCCGAAG AGATCGGCTT GCCGAAGAAC AAGATGATCC TGTCGGCCAA GGTCTCGGCG
GTGCAGGACC TGATCGCCGT GTACCAGGAT CTCGCCTCGC GCTCCGATTA CGCGATCCAC
CTCGGCCTCA CCGAGGCTGG CATGGGCTCG AAGGGCATCG TCGCATCGTC CGCGGCGCTC
GGCATCCTGC TGCAGCAGGG CATCGGTGAC ACCATTCGGA TTTCGCTGAC CCCCGAGCCG
GGCGGTGACC GCACCCGCGA GGTTCAGGTT GGGCAGGAAC TGCTGCAGAC CATGGGCTTC
CGCACCTTCG TGCCGCTGGT TGCGGCCTGC CCGGGCTGCG GCCGCACCAC CTCGACGACG
TTCCAGGAGC TGGCGCGCTC GATCCAGGAT TTCATCCGCG ACGAGATGCC GGAGTGGCGC
AGCCGCTATC CGGGCGTCGA GAATCTCAAC GTTGCGGTGA TGGGCTGCAT CGTCAACGGC
CCGGGCGAAA GCAAGCACGC CAATATCGGC ATTTCGCTGC CCGGCACCGG CGAAACCCCG
GCGGCGCCGG TGTTCGTCGA CGGCGAGAAA TTCCGTACCC TGCGCGGCGA GAATATCGCG
GCCGACTTCA AGGCGCTGGT GATCGACTAC ATCGAGCAGC GCTACGGCGC GACGCCGAAG
CCCGGTGCCG CCCAGATGGT ACCGGCGGCG GAGTAA
 
Protein sequence
MNKLENPLRD DVAGPAPRHQ TTQVMVGDVA VGGGAPIVVQ SMTNTDTADV EGTIKQIAAL 
ARAGSEMVRI TVDREEAAAA VPHIRDGIRK LGLTTPIIGD FHYIGHKLLA EYPACAEALD
KYRINPGNVG FKNKRDTQFA DIVEIAIKNN KAVRIGANWG SLDQELLTKL MDENAASANP
RDVRAVTREA MVQSALLSAA RAEEIGLPKN KMILSAKVSA VQDLIAVYQD LASRSDYAIH
LGLTEAGMGS KGIVASSAAL GILLQQGIGD TIRISLTPEP GGDRTREVQV GQELLQTMGF
RTFVPLVAAC PGCGRTTSTT FQELARSIQD FIRDEMPEWR SRYPGVENLN VAVMGCIVNG
PGESKHANIG ISLPGTGETP AAPVFVDGEK FRTLRGENIA ADFKALVIDY IEQRYGATPK
PGAAQMVPAA E