Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0022 |
Symbol | |
ID | 3909705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 22420 |
End bp | 23301 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637881903 |
Product | short chain dehydrogenase |
Protein accession | YP_483645 |
Protein GI | 86747149 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCAC TCGCGGGCAA GACGCTGTTC ATCACCGGGG CCAGCCGCGG CATCGGGCTG GCGATCGCGC TGCGGGCGGC GCGCGACGGC GCCAATGTGG CGATCGCCGC CAAGACCGCC GAGCCGCAGC CCAAGCTGAA GGGCACGATC TACACTGCGG CCGACGAGAT CCGCGCCGCC GGCGGGCAGG CGCTGCCGCT GATCTGCGAC ATCCGCGACG AGGCGCAAGT GATCGCCGCG ATCGACAAGA CGGTGGCCGA ATTCGGCGGC ATCGACATCT GCGTCAACAA TGCCAGCGCG ATCAGCCTGA CCAATTCGCA GGCGACCGAC ATGAAGCGCT ACGACCTGAT GATGGGCATC AACAGCCGCG GCACCTTCAT GGTGTCGAAA TACTGCATCC CGCATCTGAA GAAGGCGGCC AACCCGCACA TCCTGATGCT GTCGCCGCCG CTCGACATGA AGGCGAAATG GTTCGCGGCC TCGACCGCCT ACACCATGGC CAAATTCGGC ATGAGCATGG TGGTGCTGGG ATTGTCGGGT GAATTGAAGG GCGCGGGAAT CGCCGTCAAC GCGCTGTGGC CGCGCACCAC CATCGCCACC GCCGCGGTCG GCAATCTCTT GGGCGGCGAC GCGATGATGC GCGCCAGCCG CACGCCGGAG ATCATGGGCG ACGCCGCGCA CGCGATCCTG ACCAAGCCGT CGCGCGACTT CACCGGGCAG TTCTGCATCG ACGACAAGGT GCTGTATGAG GCCGGCGTCA CCGATTTCGA GCGCTACCGC GTCGATCCGA GCGTGCCATT GATGTCGGAT TTCTTCGTGC CGGACGACGA CGTGCCCCCG CCCGGCGTCA CCGTCGCGTC GCTGCCGGGC GCGAAGGGGT AG
|
Protein sequence | MSSLAGKTLF ITGASRGIGL AIALRAARDG ANVAIAAKTA EPQPKLKGTI YTAADEIRAA GGQALPLICD IRDEAQVIAA IDKTVAEFGG IDICVNNASA ISLTNSQATD MKRYDLMMGI NSRGTFMVSK YCIPHLKKAA NPHILMLSPP LDMKAKWFAA STAYTMAKFG MSMVVLGLSG ELKGAGIAVN ALWPRTTIAT AAVGNLLGGD AMMRASRTPE IMGDAAHAIL TKPSRDFTGQ FCIDDKVLYE AGVTDFERYR VDPSVPLMSD FFVPDDDVPP PGVTVASLPG AKG
|
| |