Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_2983 |
Symbol | |
ID | 3969905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 3264025 |
End bp | 3264900 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637926094 |
Product | short-chain dehydrogenase/reductase SDR |
Protein accession | YP_532847 |
Protein GI | 90424477 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.888321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.289512 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAA GCCCTGAGGG ACAACGAACC GCCTTGGTCA CTGGCGCCAA CTCGGGCCTT GGCAAGGCGA TCTCCACGGC GCTGGCGGCC GAGGGCATTC GCGTTGGCCT GGTGGTCCGC GACCGCGCGC GGGGCGAAAA GGCGTTACAA GACATCCGCG CCGCCACGGG CAACGAAGAT CTGCATCTGT TCGTCGCCGA TCTGGCCGAT CAATCTGCGA TCCGCGCGCT GGCGCAGGCG GTTCACCAGC GCCTCGGCCG CCTGGAGCTG CTGGTGAACA ATGCCGGCAC CGCCTTCCCG GAACGGCGGC TCAGCCCGGA CGGAATCGAG TGCGCGCTGG CGGTGAACCA TCTCGGCCCG TTTCTCCTCA CCACCCTGCT GCTGGATCTG CTCAAAGCCA GCGCGCCGGC GCGGATCGTC AACGTCGGCA CCCGCATCAC CACGGCGATG GAGTTCGAGG ATTTGAATTG GGAGCGGCGG CCCTACAGCA TGATGAAGGG CTACGGCCAA TCCAAGCTCG GCAATCTGCA CTTCACCTTC GAACTGGCGC GCCGGCTTGA GGGCTCCGGC GTGACGGTGA ATTGCGTATT CCCCGGCGTC TTCAAATCGA ATCTGGGCGG CACCGATGGC GCCCAAGGCG TGTTGTTGAA GCTGTTTGCC CGGCTGCTGG GCTGGGCGAT CCCTTCGCCC GAGAAGGCGG CGCGGCGCGT GCTCTATCTC GCCAACGCCC CCGAATTGGC GAACGTCAGT GGTCGGTATT TTGGCGATCG CAAGACCATC CCGGCGCCCG CACAGACGCT CGACCCGCAA GCCAACCGAC GGCTGTGGCA GATCAGCGAG GCGCTGACGG CCGAACCGGC GGTGGCGGGC AGCTGA
|
Protein sequence | MSTSPEGQRT ALVTGANSGL GKAISTALAA EGIRVGLVVR DRARGEKALQ DIRAATGNED LHLFVADLAD QSAIRALAQA VHQRLGRLEL LVNNAGTAFP ERRLSPDGIE CALAVNHLGP FLLTTLLLDL LKASAPARIV NVGTRITTAM EFEDLNWERR PYSMMKGYGQ SKLGNLHFTF ELARRLEGSG VTVNCVFPGV FKSNLGGTDG AQGVLLKLFA RLLGWAIPSP EKAARRVLYL ANAPELANVS GRYFGDRKTI PAPAQTLDPQ ANRRLWQISE ALTAEPAVAG S
|
| |