Gene RPC_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0101 
Symbol 
ID3971321 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp114870 
End bp115979 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content63% 
IMG OID637923217 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_529999 
Protein GI90421629 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR02818] S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.618314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCC GCGCCGCAGT TGCGTTCGAG GCGAAGAAGC CGCTTGAGAT CGTCGAAGTC 
GACTTGGAAG GACCGAAGAC CGGCGAAGTG CTGGTCGAAA TCAAAGCCAC CGGGATTTGC
CACACCGACG CCTACACGCT GGACGGTTTC GACAGCGAAG GAATCTTCCC CTCGATCCTC
GGCCATGAGG GCGCCGGCAT CGTCCGCGAG GTCGGCCCCG GCGTCAGCTC GGTGAAGCCC
GGCGACCATG TGATTCCGCT GTACACGCCG GAATGCCGGC AGTGCAAAAG CTGCCTCAGC
CAGAAGACCA ATCTGTGCAC CGCGATCCGC GCCACCCAGG GCAAGGGCGT GATGCCGGAC
GGCACCTCGC GGTTTAGCTA CAAGGGCCAG CAGATCTTTC ACTACATGGG CTGCTCGACG
TTCTCGAATT TCACCGTGTT GCCGGAGATC GCGGTGGCGA AGATCCGCGA CGACGCGCCG
TTCGACAAGA GCTGCTACAT CGGCTGCGGC GTCACCACCG GGGTCGGCGC CGTGGTCAAC
ACCGCCAAGG TGACGCCCGG GTCCAACGTC GTGGTGTTCG GGCTCGGCGG CATCGGGCTC
AACGTGATCC AGGGCGCCCG GCTGGTCGGC GCCGACAAGA TCATCGGCGT CGATCTCAAT
GACGACAAGG AAGAATGGGG CCGCCGCTTC GGCATGACGC ATTTCGTCAA TCCGAAAAAA
ATCGACGGCG ACATCGTGCA GCATCTGGTC GGGCTGACCG ACGGCGGCGC CGACTACACC
TTCGACTGCA CCGGCAACAC CACGGTGATG CGCCAGGCGC TGGAAGCCTG CCATCGCGGC
TGGGGCGTCT CGGTGGTGAT CGGGGTGGCG GAGGCCGGCA AGGAGATTTC CACCCGCCCG
TTCCAATTGG TGACCGGACG GGTCTGGAAA GGCACGGCGT TCGGCGGCGC TCGCGGCCGC
ACCGACGTGC CGAAGATCGT CGACTGGTAC ATGAACGGCA AGATCGAGAT CGACCCGATG
ATCACCCACG TCCTCAAATT GGACGAGATC AACAAGGGCT TCGATCTCAT GCATGAGGGC
AAGTCGATCC GCTCGGTCGT CGTGTTCTGA
 
Protein sequence
MKTRAAVAFE AKKPLEIVEV DLEGPKTGEV LVEIKATGIC HTDAYTLDGF DSEGIFPSIL 
GHEGAGIVRE VGPGVSSVKP GDHVIPLYTP ECRQCKSCLS QKTNLCTAIR ATQGKGVMPD
GTSRFSYKGQ QIFHYMGCST FSNFTVLPEI AVAKIRDDAP FDKSCYIGCG VTTGVGAVVN
TAKVTPGSNV VVFGLGGIGL NVIQGARLVG ADKIIGVDLN DDKEEWGRRF GMTHFVNPKK
IDGDIVQHLV GLTDGGADYT FDCTGNTTVM RQALEACHRG WGVSVVIGVA EAGKEISTRP
FQLVTGRVWK GTAFGGARGR TDVPKIVDWY MNGKIEIDPM ITHVLKLDEI NKGFDLMHEG
KSIRSVVVF