Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4687 |
Symbol | |
ID | 6412373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5045873 |
End bp | 5046661 |
Gene Length | 789 bp |
Protein Length | 262 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 642714566 |
Product | 3-hydroxybutyrate dehydrogenase |
Protein accession | YP_001993653 |
Protein GI | 192293048 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | [TIGR01963] 3-hydroxybutyrate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAATC TGACAGGCAA GACCGCGGTC GTGACCGGCT CGACCTCGGG CATCGGGCTC AGCTATGCCC GCGCCTTCGC CAAGGCCGGC GCCAACGTTG TGATCAACGG CATGGGCGAC GCGGACGCGA TCGAGAAGGA ACGCAAGGCG ATCGAGAGCG AATTTGCGGT CAAGGCAGTG TACTCGCCGG CCGACATGCT GAAGCCGGCC GAGATAGCCG AGATGATCAA GCTCGGCGAG ACCACCTTCG GTTCGGTCGA CATCCTGGTC AATAATGCCG GCATCCAGTT CGTGTCGCCG ATCGAGGATT TTCCGATCGA GAAGTGGGAT GCGATCATCG GCATCAACCT GTCGTCGGCG TTTCATGGCA TCCGCGCCGC GGTGCCGGGG ATGAAGAAGC GCGGCTGGGG CCGTATCATC AACACCGCCT CGGCGCACTC GCTGGTCGCA TCGCCCTTCA AGTCGGCCTA TGTGGCGGCC AAGCACGGTA TCGCGGGCCT GACCAAGACG GTGGCGCTGG AGCTCGCCAC CCACAAGATC ACCTGCAACT GCATCTCGCC CGGCTACGTC TGGACGCCGC TGGTCGAGAA GCAGATCCCC GACACCATGA AGGCGCGTAA CCTAACCAAG GAGCAGGTGA TCAACGACGT CCTGCTCGCC GCCCAGCCGA CCAAGCAGTT CGTCACCCCT GAGCAGGTCG CCGCGCTCGC GGTCTATCTC TGCGGCGACG ACGCCAGCCA GATCACCGGC GCCAACCTGT CGATGGACGG CGGCTGGACC GCGGCGTAA
|
Protein sequence | MTNLTGKTAV VTGSTSGIGL SYARAFAKAG ANVVINGMGD ADAIEKERKA IESEFAVKAV YSPADMLKPA EIAEMIKLGE TTFGSVDILV NNAGIQFVSP IEDFPIEKWD AIIGINLSSA FHGIRAAVPG MKKRGWGRII NTASAHSLVA SPFKSAYVAA KHGIAGLTKT VALELATHKI TCNCISPGYV WTPLVEKQIP DTMKARNLTK EQVINDVLLA AQPTKQFVTP EQVAALAVYL CGDDASQITG ANLSMDGGWT AA
|
| |