Gene RPB_4494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4494 
Symbol 
ID3912310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5081241 
End bp5082239 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content68% 
IMG OID637886397 
Productalcohol dehydrogenase 
Protein accessionYP_488088 
Protein GI86751592 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID[TIGR02824] putative NAD(P)H quinone oxidoreductase, PIG3 family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.218287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.353288 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCGG TCCCTTCGCA AATGACCGTG ATCGGGATCA GCAAGCCCGG CGGACCCGAG 
GTGCTGCTGC CGGAAACGCG CGCCGTGCCC ACGCCCGGCC CCGGTGAAAT CCTGGTCAAG
GTCGCGGCGG CCGGGGTCAA CCGTCCCGAT GTGGCACAGC GCTCCGGCAG CTATCCGCCG
CCGCCCGGCG CCAGCGACCT GCCCGGCCTG GAGATCGCCG GCGAGGTGGT GGCGATCGGC
GACGGCGCGA CCCGGCACAA GCTCGGCGAC AAGGTGATGT CGCTGGTCGC CGGCGGCGGC
TATGCCGAAT ATTGCATCGC GCAGGATGCC CAGGCGATGA CGGTGCCGGC CGGCTTCTCG
ATGATCGAGG CCGGCGCCAC GCCGGAGACG CTGATGACGG TGTGGCACAA CGTGTTCGAG
CGCGGCGCGC TGCAGCCCGG CGAAACCCTG CTGGTGCATG GCGGCTCGTC CGGCATCGGC
ACGATGGCGA TCCAGCTCGC GGTCGCATTT GGCGCCAAGG TGATCGCCAC CGTCGGCGCG
CCCGACAAGG CCGAGGCCTG CCTGAAACTC GGCGCCACCC GGGCGATCAA CTACAAGACC
GAGGATTTCG TCGCAGCCGT CAAGGAAACG ACCAGCGGCA AGGGCGCCGA CGTGATCCTC
GACATGGTCG GCGGCGACTA CATCGAGCGC AATTACGATG CCGCCGCAAT GGACGGCCGG
ATCGTCCAGA TTGCGTTCCT CGGCGGCGCC AAGACCACGG TGAATTTCAC CAAGCTGATG
ATCAAGCGGC TGCACCACAC CGGTTCGACG CTGCGGCCGC GCAGCAATGC CGACAAGGCC
GCGATGGTCG CCGCGATCGA GACCAGGGTG ATGCCGCTGC TCGCCGAAGG CCGGATCAAG
CCGCTGATCG ACGGCACCTT CGCGTTGCAA GACGCCGCCG CCGCCCACCG GCGGATGGAG
ACCAGCCAAC ATATTGGCAA AATTGTGTTG ACGGTCTGA
 
Protein sequence
MEPVPSQMTV IGISKPGGPE VLLPETRAVP TPGPGEILVK VAAAGVNRPD VAQRSGSYPP 
PPGASDLPGL EIAGEVVAIG DGATRHKLGD KVMSLVAGGG YAEYCIAQDA QAMTVPAGFS
MIEAGATPET LMTVWHNVFE RGALQPGETL LVHGGSSGIG TMAIQLAVAF GAKVIATVGA
PDKAEACLKL GATRAINYKT EDFVAAVKET TSGKGADVIL DMVGGDYIER NYDAAAMDGR
IVQIAFLGGA KTTVNFTKLM IKRLHHTGST LRPRSNADKA AMVAAIETRV MPLLAEGRIK
PLIDGTFALQ DAAAAHRRME TSQHIGKIVL TV