Gene RPB_3418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3418 
Symbol 
ID3911220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3906995 
End bp3908104 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content65% 
IMG OID637885321 
Productalcohol dehydrogenase 
Protein accessionYP_487025 
Protein GI86750529 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR02818] S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.128659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.268531 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGACCC GCGCCGCCGT CGCATTCGAG GCCAAGAGGC CGCTCGAAAT CGTCGAACTG 
GATCTCGACG GGCCGAAGGC CGGCGAAGTG CTGGTCGAGA TCAAGGCGAC CGGGATCTGC
CACACCGACG CCTATACGCT CGACGGCCTC GACTCCGAAG GCATCTTCCC CTCGATCCTC
GGCCACGAGG GCGCGGGCAT CGTGCGCGAG GTCGGCGCCG GCGTCACCTC GGTGAAGCCC
GGCGATCACG TCATTCCGCT GTACACGCCG GAGTGCCGGC AGTGCAAAAG CTGCCTGAGC
CAGAAGACCA ATCTGTGCAC CTCGATCCGC GCCACCCAGG GCAAGGGCGT GATGCCGGAC
GGCACCTCGC GCTTCAGCTA TCAGGGCAAG CCGATCTATC ATTACATGGG CTGCTCGACG
TTTTCGAACT TCACCGTGCT GCCGGAGATC GCGCTGGCGA AGATCCGCGA CGACGCGCCG
TTCGACAAGA GCTGCTACAT CGGCTGCGGC GTCACCACCG GCGTCGGCGC GGTGGTCAAC
ACCGCGAAGG TGACGCCCGG CTCCAATGTC GTGGTGTTCG GCCTCGGCGG CATCGGCCTC
AACGTCATTC AGGGCGCGCG GATGGTCGGC GCCGACAAGA TCGTCGGCGT CGACATCAAC
GACGACAAGG AGGAATGGGG CCGCCGCTTC GGCATGACGC ATTTCGTCAA TCCGAAGACA
ATCGACGGCG ACATCGTCCA GCACCTCGTC GGCCTGACCG ACGGCGGCGC CGACTACACG
TTCGACTGCA CCGGCAACAC CACTGTGATG CGCCAGGCGC TGGAAGCCTG CCACCGCGGC
TGGGGCGTCT CGGTGGTGAT CGGCGTCGCC GAAGCCGGCA AGGAAATCTC GACGCGGCCG
TTCCAGCTCG TCACCGGCCG GGTCTGGAAA GGCAGCGCCT TCGGCGGCGC CCGCGGCCGC
ACCGACGTGC CGAAAATCGT CGACTGGTAC ATGAACGGCA AGATCGAGAT CGACCCGATG
ATCACCCACG TGCTCAAGCT CGAGGAGATC AACAAGGGTT TCGAGCTGAT GCACGAGGGC
AAGTCGATCC GGTCGGTGGT GGTGTTCTAG
 
Protein sequence
MKTRAAVAFE AKRPLEIVEL DLDGPKAGEV LVEIKATGIC HTDAYTLDGL DSEGIFPSIL 
GHEGAGIVRE VGAGVTSVKP GDHVIPLYTP ECRQCKSCLS QKTNLCTSIR ATQGKGVMPD
GTSRFSYQGK PIYHYMGCST FSNFTVLPEI ALAKIRDDAP FDKSCYIGCG VTTGVGAVVN
TAKVTPGSNV VVFGLGGIGL NVIQGARMVG ADKIVGVDIN DDKEEWGRRF GMTHFVNPKT
IDGDIVQHLV GLTDGGADYT FDCTGNTTVM RQALEACHRG WGVSVVIGVA EAGKEISTRP
FQLVTGRVWK GSAFGGARGR TDVPKIVDWY MNGKIEIDPM ITHVLKLEEI NKGFELMHEG
KSIRSVVVF