Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_3639 |
Symbol | |
ID | 6411315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 3899457 |
End bp | 3900473 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642713519 |
Product | zinc-binding alcohol dehydrogenase family protein |
Protein accession | YP_001992614 |
Protein GI | 192292009 |
COG category | [C] Energy production and conversion [R] General function prediction only |
COG ID | [COG0604] NADPH:quinone reductase and related Zn-dependent oxidoreductases |
TIGRFAM ID | [TIGR02817] zinc-binding alcohol dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.103532 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGCTG TCGGCTACTC CAAGAGCCTC CCGATCGACG ATCCGGAGGC ACTGCTCGAT CTTGAACTGC CCACGCCCGA ACCGGGTCCA CGCGACCTGC GGGTTTCGGT GAAGGCGATC TCGGTCAATC CGGTCGACTT CAAGGTGCGC AAGCGCGCCG CCCCGCCCGC CGGCGAACCC AAGATCCTCG GTTACGACGC GGCCGGCGTG GTCGAGGCGG TCGGCGCCGA GGTGACGCTG TTCAAACCGG GCGACGAGGT GTTTTACGCC GGCTCGATCC AGCGGCCGGG CACGAACGCC GAACAGCATC TGGTCGACGA GCGCATCGTC GGCCGCAAAC CAAAGACGCT GTCGTTCGCG CAGGCCGCGG CGCTGCCGCT GACCTCGATC ACCGCCTGGG AATTGCTGTT CGACCGGCTC GGCGTCGTGC CGAGCAAGGC GTTCGATCCG CGCACGCTGC TGATCGTCGG CGGCGCCGGC GGCGTCGGCT CGATCCTGAT CCAGCTCGCG CGCCGCCTCA CCGGGCTGAC CATCATCGCC ACGGCGTCGC GGCCGGAAAC GCAGGCATGG TGCCTCGACC TCGGCGCCCA TGCGGTGATC GATCACAGCC ATCCGATGAA GCCGCAGGTC GAAGCGCTGA AACTTCCGCC GGTTGCGCTG ATCGCTAGCC TCACCGGCAC CGAGGGGCAT TTCGCCGGCC TGGTCGACAT CCTGGCGCCG CAGGGCAAGA TCGGCCTGAT CGACGATCCG GCGACACTGA ACCCGATGCT GCTGAAGCCG AAGTCAGCGT CGCTGCACTG GGAGGCGATG TTCGCCCGCT CGTCGTATCA GACCGCCGAC ATGATCGCGC AGCACGACCT GCTCGACGAG ATCGCCGGCC TGATCGACAC CGGCGTGCTC CGCACCACGC TGGACAAGAC CTTCGGCACG ATCACCGCCG CCAACCTGAA ACGCGCCCAC GCCCTGCTGG AGAGCGGCAC ATCGATCGGG AAGATCGTGC TGGAGGGATG GGAGTAG
|
Protein sequence | MKAVGYSKSL PIDDPEALLD LELPTPEPGP RDLRVSVKAI SVNPVDFKVR KRAAPPAGEP KILGYDAAGV VEAVGAEVTL FKPGDEVFYA GSIQRPGTNA EQHLVDERIV GRKPKTLSFA QAAALPLTSI TAWELLFDRL GVVPSKAFDP RTLLIVGGAG GVGSILIQLA RRLTGLTIIA TASRPETQAW CLDLGAHAVI DHSHPMKPQV EALKLPPVAL IASLTGTEGH FAGLVDILAP QGKIGLIDDP ATLNPMLLKP KSASLHWEAM FARSSYQTAD MIAQHDLLDE IAGLIDTGVL RTTLDKTFGT ITAANLKRAH ALLESGTSIG KIVLEGWE
|
| |