Gene Rpal_3639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3639 
Symbol 
ID6411315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp3899457 
End bp3900473 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content67% 
IMG OID642713519 
Productzinc-binding alcohol dehydrogenase family protein 
Protein accessionYP_001992614 
Protein GI192292009 
COG category[C] Energy production and conversion
[R] General function prediction only 
COG ID[COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases 
TIGRFAM ID[TIGR02817] zinc-binding alcohol dehydrogenase family protein 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.103532 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCTG TCGGCTACTC CAAGAGCCTC CCGATCGACG ATCCGGAGGC ACTGCTCGAT 
CTTGAACTGC CCACGCCCGA ACCGGGTCCA CGCGACCTGC GGGTTTCGGT GAAGGCGATC
TCGGTCAATC CGGTCGACTT CAAGGTGCGC AAGCGCGCCG CCCCGCCCGC CGGCGAACCC
AAGATCCTCG GTTACGACGC GGCCGGCGTG GTCGAGGCGG TCGGCGCCGA GGTGACGCTG
TTCAAACCGG GCGACGAGGT GTTTTACGCC GGCTCGATCC AGCGGCCGGG CACGAACGCC
GAACAGCATC TGGTCGACGA GCGCATCGTC GGCCGCAAAC CAAAGACGCT GTCGTTCGCG
CAGGCCGCGG CGCTGCCGCT GACCTCGATC ACCGCCTGGG AATTGCTGTT CGACCGGCTC
GGCGTCGTGC CGAGCAAGGC GTTCGATCCG CGCACGCTGC TGATCGTCGG CGGCGCCGGC
GGCGTCGGCT CGATCCTGAT CCAGCTCGCG CGCCGCCTCA CCGGGCTGAC CATCATCGCC
ACGGCGTCGC GGCCGGAAAC GCAGGCATGG TGCCTCGACC TCGGCGCCCA TGCGGTGATC
GATCACAGCC ATCCGATGAA GCCGCAGGTC GAAGCGCTGA AACTTCCGCC GGTTGCGCTG
ATCGCTAGCC TCACCGGCAC CGAGGGGCAT TTCGCCGGCC TGGTCGACAT CCTGGCGCCG
CAGGGCAAGA TCGGCCTGAT CGACGATCCG GCGACACTGA ACCCGATGCT GCTGAAGCCG
AAGTCAGCGT CGCTGCACTG GGAGGCGATG TTCGCCCGCT CGTCGTATCA GACCGCCGAC
ATGATCGCGC AGCACGACCT GCTCGACGAG ATCGCCGGCC TGATCGACAC CGGCGTGCTC
CGCACCACGC TGGACAAGAC CTTCGGCACG ATCACCGCCG CCAACCTGAA ACGCGCCCAC
GCCCTGCTGG AGAGCGGCAC ATCGATCGGG AAGATCGTGC TGGAGGGATG GGAGTAG
 
Protein sequence
MKAVGYSKSL PIDDPEALLD LELPTPEPGP RDLRVSVKAI SVNPVDFKVR KRAAPPAGEP 
KILGYDAAGV VEAVGAEVTL FKPGDEVFYA GSIQRPGTNA EQHLVDERIV GRKPKTLSFA
QAAALPLTSI TAWELLFDRL GVVPSKAFDP RTLLIVGGAG GVGSILIQLA RRLTGLTIIA
TASRPETQAW CLDLGAHAVI DHSHPMKPQV EALKLPPVAL IASLTGTEGH FAGLVDILAP
QGKIGLIDDP ATLNPMLLKP KSASLHWEAM FARSSYQTAD MIAQHDLLDE IAGLIDTGVL
RTTLDKTFGT ITAANLKRAH ALLESGTSIG KIVLEGWE