Gene RPB_1148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1148 
Symbol 
ID3909236 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1320460 
End bp1321479 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content68% 
IMG OID637883042 
Productalcohol dehydrogenase 
Protein accessionYP_484769 
Protein GI86748273 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.712603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCGA TGGTCCTGCG GGAACACGGG GGGCTCGACA AGCTCACCTT CGATCCGAAT 
TTCCCCGATC CCGACATCGG ACCCGGCGAC GTGCTGTTGC GCGTGCGCGC GACCTCGCTG
AACTATCACG ACATCTTCAC CCGTCGCGGC ATGCCGGGCA TCAAGATCCC GTTGCCGGTG
ATCATGGGGC TCGATGTCGC AGGCGAGATC GTCGTGGTCG GCGACGGCGT CGAGGGCTGG
AAGGCCGGCG ACCGCGTGCT GGTCGATCCG CTCAACCGCG TCGAAGGCGG GCTGATGGGC
GAGACCATGA ATGGCGGCCT CGCCGAATTG TGCAAGGCGC GCGCGCATCA GCTCGTGCGT
ATCCCCGATA ATGTGAGCTT CGAACAGGCC GCGGCGCTGC CGGTCGCCTA CGGCACCGCG
CACCGCATGA TGACCACCAA CGGGCAGGTC AAGGCGGGCG AGAAGGTGCT GATCCTCGGC
GCCTCCGGCG GCGTCGGCGT GTGCTGCGTG CAGCTCGCCA AGATCGCCGG CGCTTACGTG
ATCGCCTGCG CCGGCTCCGC CGAGAAAGGC GAGCGCCTGA AGCAGCTCGG CGCCGACGAG
GTCATCCTCT ACACGCAGGA AGACTTCATG CAGGTGGTGC GGCAACGCCA TGGCCGGCCG
GCGCGGGTCG GCGGCACAGG CTCGGAGAAC GGCGGCGTCG ACGTCGTGGT GAATTTCACG
GGCGGCGACA CCTGGGTGAA GTCGCTGCGC ACGCTCAAGC TCGGCGGTCG CATCCTGACC
TGCGGCGCCA CCGCGGGCTA CGATCCGGCC GAGGATCTGC GCGTGATCTG GACGTTCGAG
TTGCAGGTCC GCGGCTCCAA CGGCTGGGAG CGCGACGACA TCGAGAAGCT GTTCGGGCTG
CTGTCGTCGG GACGGCTCAC CGCCAAGGTC GACAAGGCGT TTCCGCTGCA GCAGGCCGCG
GATGCACTGG CGATGCTGGA AGACCGCACC GTGTTCGGAA AAGTCGTGGT GACGCCATGA
 
Protein sequence
MRAMVLREHG GLDKLTFDPN FPDPDIGPGD VLLRVRATSL NYHDIFTRRG MPGIKIPLPV 
IMGLDVAGEI VVVGDGVEGW KAGDRVLVDP LNRVEGGLMG ETMNGGLAEL CKARAHQLVR
IPDNVSFEQA AALPVAYGTA HRMMTTNGQV KAGEKVLILG ASGGVGVCCV QLAKIAGAYV
IACAGSAEKG ERLKQLGADE VILYTQEDFM QVVRQRHGRP ARVGGTGSEN GGVDVVVNFT
GGDTWVKSLR TLKLGGRILT CGATAGYDPA EDLRVIWTFE LQVRGSNGWE RDDIEKLFGL
LSSGRLTAKV DKAFPLQQAA DALAMLEDRT VFGKVVVTP