Gene RPD_4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4030 
SymbolispH 
ID4024547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4478502 
End bp4479467 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content66% 
IMG OID637964233 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_571150 
Protein GI91978491 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.953611 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCCA AACCGCCTCT GAAGATCGTC CTCTGCTCGC CGCGCGGGTT CTGCGCCGGC 
GTGGTGCGGG CAATCGACAC CGTCGAGCGT GCGCTGACGC TGTATGGCGC TCCGGTCTAT
GTCCGGCACG AAATCGTCCA TAACAAATAC GTGGTCGACA ACCTCCGCGC CAAGGGCGCG
ATTTTCGTCG AGGAACTCGG GGAAATCCCG GACACCAAGG CGCCGGTTGT CTTCTCCGCC
CACGGCGTTC CGAAGTCGGT CCCCGAGGAC GCGACGGCGA GGAATTTCTT CTCGGTCGAC
GCCACCTGCC CGCTGGTCAC CAAGGTGCAT CGCGAGGCGG CGATCCATTT CAAGCGCGGT
CGCGAGATTC TGCTGATCGG CCATTCCCAT CATCCGGAAG TGGTCGGCAC GCTCGGCCAG
CTGCCGGTCG GCGCGGTCAC CCTGATCGAG ACCGCCGCCG ATGCTTCGAG CTTCACCCCG
AAGAACCCGG ACAACCTCGC TTTCGTGACG CAGACCACGC TGTCGATCGA CGACACCGCC
GGAATCGTCG CGGTGCTGCG CCAGCGCTTC CCGAACATTT CCGGGCCGCA CAAGGAAGAC
ATCTGCTACG CCACCACCAA CCGTCAGGCG GCGGTGAAGA AGGTGGCGCC GGTGGTCGAC
GCGATGATCG TGGTCGGCGC GCCGAATTCC TCGAATTCGC AGCGTCTGCG CGAGGTCGCC
GAGCGCGAAG GCTGCCGGGT CGCGGTGCTG GCGCAGCGCG CCTCCGACAT CGACTGGTCG
CTGTTCGAGG GCATCTCCAG CCTCGGCCTC ACGGCCGGCG CCTCGGCCCC CGAGGTGATC
GTCGAGGAGA TCATGGGCGC CTTCGCCGAG CGCTTCGACC TCAGCGTCGA AACGGTATCG
GCTGCGGAAG AGAACGAGTT TTTCCCGCTC CCCCGCGTGC TCCGCCCCGA CGCGGCTGCG
GAGTAG
 
Protein sequence
MLAKPPLKIV LCSPRGFCAG VVRAIDTVER ALTLYGAPVY VRHEIVHNKY VVDNLRAKGA 
IFVEELGEIP DTKAPVVFSA HGVPKSVPED ATARNFFSVD ATCPLVTKVH REAAIHFKRG
REILLIGHSH HPEVVGTLGQ LPVGAVTLIE TAADASSFTP KNPDNLAFVT QTTLSIDDTA
GIVAVLRQRF PNISGPHKED ICYATTNRQA AVKKVAPVVD AMIVVGAPNS SNSQRLREVA
EREGCRVAVL AQRASDIDWS LFEGISSLGL TAGASAPEVI VEEIMGAFAE RFDLSVETVS
AAEENEFFPL PRVLRPDAAA E