Gene RPB_1871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1871 
Symbol 
ID3908066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2134880 
End bp2135932 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content68% 
IMG OID637883765 
Productalcohol dehydrogenase 
Protein accessionYP_485490 
Protein GI86748994 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.843746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.357791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCT TTCGCGTCTC GGGCTTCGGC CAGCCGCTTA GCGAAGACAA CCGGCCGACG 
CCGGAATTGA CCGGCACGCA GGTGCTGCTG CGCGTCAAGG CCGCCGGCAT CTGCCACAGC
GATCTGCACA TCTGGGAGGG CGGCTACGAA CTCGGCCACG GCCGCAAGAA GCTGTCGCTG
GCCGATCGCG GCGTGGCATT GCCGCTGACG ATGGGGCACG AGACCGTCGG CGAGATCGTC
GCCGCAGGAC CCGACGCCAA GGATGCGAAG ATCGGCGATG TCGCGCTGGT GTATCCGTGG
ATCGGCTGCG GCCAATGCGC GGTGTGTCGC GAAGGCGACG AGAACATGTG CCTCAAGCCG
CGGTTCCTCG GCGTGTATTG CGACGGCGGC TATTCCGACG AACTGATCGT GCCGCATCCG
CGCTATCTGC TCAGCCTCGA CGGGCTCGAT CCGGTGACCG CGGCGCCGTA TGCGTGTTCG
GGCGTCACCA CCTACAGCGC GCTGAAGAAG CTGGAATTCG CCTTCGACGG TCCGATCGTG
ATGTTCGGCG CCGGCGGGCT CGGGCTGATG GCGCTGTCGC TGCTGAAGGC GATGGGCGGC
AAGGGCGCGA TCATGGTCGA TATCGACGCC AGGAAGCGCG AGGCGGCGGA GCAGGCCGGC
GCGATGGCGA CGGTCGACGG CGCGGCGCCC GACGCGCTGG AGCAAATCGC CAAGAAGGCC
GGCGCGCCGG TGCGTGGCGC GCTCGACCTC GTCGGCAATT CGCAGACCGC GCAACTCGGC
TTCGACTGTC TCACCAAAGG CGGCAAGCTG GTGATCGTCG GCCTGTTCGG CGGCGGCGCG
CCATGGGCGC TGCCGTTCAT CCCGATGCGC GCGATCACGA TTCAGGGCTC GTATGTCGGC
AATCTGCGCG AGACCCAGGA ACTGCTCGAT CTGGTGCGCG CCAACAAGAT CGCGCCGATT
CCGGTGACGC CGCTGCCGCT GCCCAAGGCC AACGAGGCGC TGATGGATCT GCAGAAGGGG
CGGTTGGTCG GCCGCGCGGT GCTGACGCCG TGA
 
Protein sequence
MKSFRVSGFG QPLSEDNRPT PELTGTQVLL RVKAAGICHS DLHIWEGGYE LGHGRKKLSL 
ADRGVALPLT MGHETVGEIV AAGPDAKDAK IGDVALVYPW IGCGQCAVCR EGDENMCLKP
RFLGVYCDGG YSDELIVPHP RYLLSLDGLD PVTAAPYACS GVTTYSALKK LEFAFDGPIV
MFGAGGLGLM ALSLLKAMGG KGAIMVDIDA RKREAAEQAG AMATVDGAAP DALEQIAKKA
GAPVRGALDL VGNSQTAQLG FDCLTKGGKL VIVGLFGGGA PWALPFIPMR AITIQGSYVG
NLRETQELLD LVRANKIAPI PVTPLPLPKA NEALMDLQKG RLVGRAVLTP