Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_3158 |
Symbol | |
ID | 4023663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 3510848 |
End bp | 3511864 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637963359 |
Product | alcohol dehydrogenase GroES-like protein |
Protein accession | YP_570285 |
Protein GI | 91977626 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases |
TIGRFAM ID | [TIGR02817] zinc-binding alcohol dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.187629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.496916 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCTA TCGGCTATAC GAAACCCCTT CCGATCGACG ACGCTGATGC CCTGATCGAG TTCGATACGC CGCGGCCCGA GCCCGGCCCG CGCGATCTGC GCGTCGCGGT CAAGGCGATC TCGGTCAACC CGGTCGACTT CAAGGTCCGC AACCGGGCCG CACCGCCGGC AGGCGAGACC AAGATCCTCG GCTACGACGC CGCCGGCGTG GTCGAAGCGA TCGGCAGCGA CGTCTCACTG TTCAAGCCGG GCGACGAAGT GTTCTACGCA GGCTCGATCC AGCGCCCCGG CACCAATGCC GAATTTCATC TGGTCGACGA GCGCATCGTC GGCCGCAAGC CGACCACCCT CTCGTTCGCG CAGGCCGCGG CGCTGCCGCT GACCTCGATC ACCGCGTGGG AATTGCTGTT CGACCGGCTC GGCGTGCGGC CGGGCAAGGC CTACGATCCG CGTACATTGC TGATCACCGG CGGGGCCGGC GGCGTCGGCT CGATCCTGAT CCAACTCGCG CGAAAACTCA CGTCGCTGAC CGTGATCGCG ACCGCATCGC GGCCCGAGAC CGAGACATGG TGCCGCGCGC TCGGCGCCAA TGCGGTGATC GATCATTCCA AGCCGATGAA GCCGCAGATC GACGCGCTGA AGCTGCCGCC GGTGGCGCTG ATCGCCAGCC TCATCGGCAC CGAGCAACAC TTTCCGGCGC TGGTGGAGAT TCTCGCGCCG CAGGGCAAGA TCGCATTGAT CGACGATCCG GCGTCGCTGA ATCCGATGCT GCTCAAGCCG AAATCCGCAT CGCTGCATTG GGAGGCGATG TTTGTGCGCT CGACCTTCAC GACCGCCGAC ATGATCGCGC AGCACGATCT CCTGAACGAA GTCGCCGATC TGATCGACGC CGGCGTGCTG CGCACCACGC TGGAGCAAAC CTTCGGCGCC ATCAACGCAG CGAATCTCAA GCGCGCCCAC GCGTTGCTGG AGAGCGGAAA ATCGGTCGGC AAGATCGTGC TGGAGGGGTG GGAGTAG
|
Protein sequence | MKAIGYTKPL PIDDADALIE FDTPRPEPGP RDLRVAVKAI SVNPVDFKVR NRAAPPAGET KILGYDAAGV VEAIGSDVSL FKPGDEVFYA GSIQRPGTNA EFHLVDERIV GRKPTTLSFA QAAALPLTSI TAWELLFDRL GVRPGKAYDP RTLLITGGAG GVGSILIQLA RKLTSLTVIA TASRPETETW CRALGANAVI DHSKPMKPQI DALKLPPVAL IASLIGTEQH FPALVEILAP QGKIALIDDP ASLNPMLLKP KSASLHWEAM FVRSTFTTAD MIAQHDLLNE VADLIDAGVL RTTLEQTFGA INAANLKRAH ALLESGKSVG KIVLEGWE
|
| |