Gene RPB_1340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1340 
SymbolispH 
ID3907848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1526621 
End bp1527586 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content68% 
IMG OID637883234 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_484961 
Protein GI86748465 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.495158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.45404 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCCA AACCGCCTCT GAAGATCGTC CTTTGCTCGC CGCGCGGCTT CTGCGCCGGC 
GTGGTGCGGG CGATCGACAC CGTGGAGCGT GCGCTGGCGC TGTACGGCGC CCCGGTCTAC
GTCCGCCACG AGATCGTCCA CAACAAATAC GTGGTCGACA GCCTGCGCGC CAAGGGCGCG
ATTTTCGTCG AGGAACTGGG CGAGATTCCG GACACCCGGG CCCCGGTAGT GTTCTCCGCC
CATGGCGTGC CGAAATCGGT TCCGGAAGAT GCGCTGCAGC GGAATTTTTT CTCGGTCGAC
GCCACCTGTC CGCTGGTCAC CAAGGTGCAT CGCGAGGCGG CGATCCATTT CAAGCGCGGC
CGCGAGATCC TGCTGATCGG GCATTCGCAT CACCCGGAAG TGGTCGGCAC GCTGGGCCAG
CTTCCCCCCG GCGCGGTGAC ACTGATCGAG ACCGCCGCCG ACGCCCAGAG CTTCACCCCG
AAGAACCCGG ACAATCTCGC TTTCGTGACC CAGACCACGC TGTCGATCGA CGACACCGCC
GGCATCGTGG CGATCCTGCG CCAGCGCTTC CCGAACATCT CCGGGCCGCA CAAGGAAGAC
ATCTGCTACG CCACCACCAA CCGCCAGGCG GCGGTGAAGA AGGTGGCGCC GGTGGTCGAT
GCGATGATCG TGGTCGGTGC GCCGAATTCG TCGAATTCGC AGCGGCTGCG CGAGGTCGCC
GAGCGCGAGG GCTGCCGGGT CTCGGTGCTG GCGCAGCGCG CCGCCGATAT CGACTGGGCG
CAATTCGAGG GCATCACCAG CCTCGGCCTC ACCGCCGGCG CCTCCGCCCC GGAGGTGATC
GTCGAGGAGA TCATGGGCGC CTTCGCCGAG CGCTTCGAGC TCAGCGTCGA GACGGTGTCC
GCGGCGGAGG AGAACGAGTT CTTCCCGCTG CCGCGCGTGC TGCGCCCGGA CGCCGCCGCC
GAATAG
 
Protein sequence
MLAKPPLKIV LCSPRGFCAG VVRAIDTVER ALALYGAPVY VRHEIVHNKY VVDSLRAKGA 
IFVEELGEIP DTRAPVVFSA HGVPKSVPED ALQRNFFSVD ATCPLVTKVH REAAIHFKRG
REILLIGHSH HPEVVGTLGQ LPPGAVTLIE TAADAQSFTP KNPDNLAFVT QTTLSIDDTA
GIVAILRQRF PNISGPHKED ICYATTNRQA AVKKVAPVVD AMIVVGAPNS SNSQRLREVA
EREGCRVSVL AQRAADIDWA QFEGITSLGL TAGASAPEVI VEEIMGAFAE RFELSVETVS
AAEENEFFPL PRVLRPDAAA E