Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2094 |
Symbol | |
ID | 3908508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 2379551 |
End bp | 2380285 |
Gene Length | 735 bp |
Protein Length | 244 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883987 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_485711 |
Protein GI | 86749215 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00509935 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCGACC GACTGAAAGG CAAGCGCGCA TTCGTCACGG CAGCGGCGGC GGGGATCGGC CGGGCGTCGG CGATCGCGTT CGTGCGCGAG GGCGCCGAGG TGTTCGCCAC CGACATCGAC GAGGCCGGGC TCGCCTCGCT CGCCAAGGAA GGCATCGGCG AAGCGGCGAA ACTCGACGTA CGCGACAGCG ACGCGGTGGC CGCGATCGCA AAGCAGGTCG GCGCCGTCGA CATCCTGCTC AACGCCGCGG GCTTCGTGCA TCACGGCACC GTGCTCGACT GTTCGGACGC CGATTGGGAC TTTTCGTTCG ACCTCAACGT CAAGTCGATG CACCGCACCA TCCGCGCGTT CCTGCCGGGG ATGCTGGAGA AAGGCGGCGG TTCGATCGTC AACATCTCGT CCGCCGCCGG CGTCTTCAAG GCGGCGCCGA ACCGCTACGT CTATGGCGCG ACCAAAGCCG CGGTCGCAGC ACTCACGCGC GCGATCGCCG CCGACTTCAT CACCCGCGGC ATCCGCTGCA ACGCGATCTG CCCGGGCACG ATCGAGACGC CGTCGATGCT CGGTCGCGCC GCCGCCGCGG GCCCTCAGGG CCGCGAGATG TTCGTGGCGC GCCAGCCGAT GGGTCGGCTC GGCACCGCCG AGGAAATCGC AGCACTCGCG GTGTATCTCG CCAGCGACGA AAGCGCCTTC ACCACCGGCG TCGCGCACAT CATCGACGGC GGCTGGACGT TGTAA
|
Protein sequence | MSDRLKGKRA FVTAAAAGIG RASAIAFVRE GAEVFATDID EAGLASLAKE GIGEAAKLDV RDSDAVAAIA KQVGAVDILL NAAGFVHHGT VLDCSDADWD FSFDLNVKSM HRTIRAFLPG MLEKGGGSIV NISSAAGVFK AAPNRYVYGA TKAAVAALTR AIAADFITRG IRCNAICPGT IETPSMLGRA AAAGPQGREM FVARQPMGRL GTAEEIAALA VYLASDESAF TTGVAHIIDG GWTL
|
| |