Gene RPB_2322 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2322 
Symbol 
ID3908953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2670208 
End bp2671221 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content69% 
IMG OID637884219 
Productalcohol dehydrogenase 
Protein accessionYP_485938 
Protein GI86749442 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID[TIGR02817] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0672997 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.160147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCCG TAGGCTACGC CAAATCCCTC CCGATCGACG ACGCCGACGC GCTGATCGAT 
GTCGACGTCG CCAGGCCGGA GCCAGGCCCG CGCGACCTGC GCGTCGCCGT GAAGGCGATC
TCGGTCAATC CGGTCGACTT CAAGGTCCGC AAGCGCGCCG CGCCGCCGGC CGGCCAGATC
AAGATTCTGG GCTACGACGC CGCCGGCGTG GTCGAGTCCG TGGGCAGCGA GGTGACGCTG
TTCCGGCCCG GCGACGAGGT GTTCTACGCC GGTTCGATCG GCCGCCAGGG CACCAATGCC
GAATTGCATC TGGTCGACGA GCGCATCGTC GGCCGCAAGC CGACGACACT GTCGTTCGCG
CAGGCGGCGG CGCTGCCGCT GACCTCGATT ACCGCGTGGG AACTGCTGTT CGACCGGCTC
GGCGTGCGGC CCGGCAAGGC GCACGACCCG CGCACGCTGC TGATCACCGG CGGCGCCGGC
GGCGTCGGCT CGATCCTGAT CCAGCTGGCG CGCCGGCTCA CCGCGCTGAC CGTGGTGGCG
ACCGCGTCGC GGCCGCAGAC GCAGGCCTGG TGCCGCGAGC TCGGCGCCGA TGCGGTGATC
GATCACAGCC GGGCGATGCA GCCGCAGATC GACGCCTTGA AGCTGCCGCC GGTGGCGCTG
ATCGCCAGCC TCACCAACAC CGATCAGCAT TTCCCGGCGC TGGTCGAGAT CCTGGCGCCC
CAGGGCAAGG TGGCGCTGAT CGACGACCCG GCGACGCTGA ACCCGATGCT GCTGAAGCCG
AAATCGGCGT CGCTGCATTG GGAGGCGATG TTCGCGCGCT CGACCTACAC CACGCCCGAC
ATGATCGCGC AGCACGACCT GTTGAACGAA GTCGCAGACC TGATCGACGC CGGCGTGCTG
CGCACCACGC TCGACCAGAC CTTCGGGACG ATCAATGCCG CGAATTTGAG ACGCGCCCAC
GCGCTGCTGG AGAGCGGCAA ATCGGTCGGC AAGATCGTGC TGGAGGGGTG GTAG
 
Protein sequence
MKAVGYAKSL PIDDADALID VDVARPEPGP RDLRVAVKAI SVNPVDFKVR KRAAPPAGQI 
KILGYDAAGV VESVGSEVTL FRPGDEVFYA GSIGRQGTNA ELHLVDERIV GRKPTTLSFA
QAAALPLTSI TAWELLFDRL GVRPGKAHDP RTLLITGGAG GVGSILIQLA RRLTALTVVA
TASRPQTQAW CRELGADAVI DHSRAMQPQI DALKLPPVAL IASLTNTDQH FPALVEILAP
QGKVALIDDP ATLNPMLLKP KSASLHWEAM FARSTYTTPD MIAQHDLLNE VADLIDAGVL
RTTLDQTFGT INAANLRRAH ALLESGKSVG KIVLEGW