Gene RPD_3495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3495 
Symbol 
ID4024009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3885683 
End bp3886735 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content67% 
IMG OID637963699 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_570619 
Protein GI91977960 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGCT TTCGCGTCTC CGGCTTCGGT CAGCCGCTCA GCGAAGACAA CCGGCCGACG 
CCGGAATTGA CCGGCACGCA GGTGCTGGTC CGCGTCAAGG CCGCCGGCAT CTGCCACAGC
GATCTGCACA TCTGGGAAGG CGGCTACGAA CTCGGCCACG GCCGCAAGAA ATTGTCGCTC
GCCGATCGCG GCGTATCGCT GCCGCTGACG ATGGGCCACG AGACCGTCGG CGAAGTGGTT
GCCGCCGGTC CGGATGCGAA GGATGCCAAG ATCGGCGACG TCGCGCTGGT CTATCCGTGG
ATCGGCTGCG GCCAATGCGA GGTCTGCCGC GCCGGTGACG AGAACATGTG CCTGAAGCCG
CGCTTCCTCG GCGTGTATTG CGACGGCGGC TATTCGGACG AACTGATCGT GCCGCATCCG
CGCTATCTGC TCAGCCTCGA AGGTCTCGAT CCGGTGACGG CCGCGCCCTA CGCCTGTTCG
GGCGTCACCA CCTACAGCGC GCTGAAGAAA CTCGAATTCG CGTTCGCCGG TCCGATCGTG
ATGTTCGGCG CCGGCGGGCT CGGCCTGATG GCGCTGTCGC TGCTCAAGGC GATGGGCGGC
AAGGGCGCGA TCATGGTCGA TATCGACGCC AGCAAGCGCG AAGCCGCCGA AAAGGCCGGC
GCGATGGCGA CGGTCGATGG CGCGGCGCCG GATGCGCTGG AGCAGATCGC CGCGAAAGCC
GGCGCTCCGG TGCGCGGTGC GCTCGATCTG GTCGGCAACG CCCAGACCGC GCAGCTCGGT
TTCGACTGTC TCACCAAGGG CGGCAAGCTG GTGATCGTCG GGCTGTTCGG CGGCGGCGCG
CCATGGGCGC TGCCGTTCAT CCCGATGCGG GCGATCACCA TCCAGGGCAG CTATGTCGGC
AATCTGCGCG AGACGCAGGA ATTGCTCGAC CTCGTGCGTA CCAAGAAGAT CGCGCCGATC
CCGGTCACGC CGCTGCCGCT GCAGAAGGCG AACGATGCGT TGATCGACCT GCAGAACGGC
AGGCTGGTCG GCCGTGCGGT GTTGACGCCG TAG
 
Protein sequence
MKSFRVSGFG QPLSEDNRPT PELTGTQVLV RVKAAGICHS DLHIWEGGYE LGHGRKKLSL 
ADRGVSLPLT MGHETVGEVV AAGPDAKDAK IGDVALVYPW IGCGQCEVCR AGDENMCLKP
RFLGVYCDGG YSDELIVPHP RYLLSLEGLD PVTAAPYACS GVTTYSALKK LEFAFAGPIV
MFGAGGLGLM ALSLLKAMGG KGAIMVDIDA SKREAAEKAG AMATVDGAAP DALEQIAAKA
GAPVRGALDL VGNAQTAQLG FDCLTKGGKL VIVGLFGGGA PWALPFIPMR AITIQGSYVG
NLRETQELLD LVRTKKIAPI PVTPLPLQKA NDALIDLQNG RLVGRAVLTP