Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4070 |
Symbol | |
ID | 6411754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 4372356 |
End bp | 4373183 |
Gene Length | 828 bp |
Protein Length | 275 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 642713952 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_001993041 |
Protein GI | 192292436 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0214725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAGC GCGCCGCCCT GCCCCTGCAC GGCCGCCGCG CGATGGTCAC CGGAGCCGCG CAAGGCATCG GCCTCGCGAT CGCCCGCACG CTGGCACTGC ACGGCGCGCA GGTCGTGCTG GTCAACCTCA AGCACGAAGC CGGCGAAGCC GCGGCACGCG CGATCACGGA CGCAGGTGGC GATGCGCGGT TCATCGTCGG CGACATGAGC GACGAGGATG CGATCGCCGC GACAGTGGCG GCGAGCGCCG CCGCAATCGG CGGCCTCGAT ATCCTGGTCA ACAACGCCGC GCCGACGCAG CGGGCGCGGC CGCCATTCGC CGAGCAGAGC GCCGCGCTGT GGGATGCGAC CGAAGCGGTG ATGCTGCGCG GTTACATGCT CACCGCGCAG GCCGCGCTGC CGCATCTGGC GCAAGCGCGC GGCGCGATCG TCAATCTGTC GTCGGTGCTG GCGCGCAGCG TGGCGCACGA AACCGCCGCC TATCACGTCG CCAAGGCCGG CGTCGAACAG CTCACCCGCT ATCTCGCCTG GCATCTCGGC CGCAGCGGCG TGCGGGTCAA CGCGGTGGCG CCCGGGGTGG TCGATCGCGA TATCGGCGCC AAGCTCAGCG ACGATCCGGT CAATCGGGCT GTGCTCGAAG CTGCGGTGCC GCTCGGCCGT GCCGCCACCG GCACCGAGAT CGCCGAGGTC GTGGCGTTCC TGTGTTCGCC GGCGGCCGGT TACATCACCG GTCAGACGCT GGTGATCGAC GGCGGATTGT CGCTCGGCGA GCCGTTCGGC GTCGCCCGCG CCACGCTGCG CGCGGCGGCC GGAGTCGGAG CCGCTTAA
|
Protein sequence | MSERAALPLH GRRAMVTGAA QGIGLAIART LALHGAQVVL VNLKHEAGEA AARAITDAGG DARFIVGDMS DEDAIAATVA ASAAAIGGLD ILVNNAAPTQ RARPPFAEQS AALWDATEAV MLRGYMLTAQ AALPHLAQAR GAIVNLSSVL ARSVAHETAA YHVAKAGVEQ LTRYLAWHLG RSGVRVNAVA PGVVDRDIGA KLSDDPVNRA VLEAAVPLGR AATGTEIAEV VAFLCSPAAG YITGQTLVID GGLSLGEPFG VARATLRAAA GVGAA
|
| |