Gene RPD_1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1869 
Symbol 
ID4022351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2092970 
End bp2093842 
Gene Length873 bp 
Protein Length290 aa 
Translation table11 
GC content64% 
IMG OID637962062 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_569005 
Protein GI91976346 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.19411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTACCG ATCAGCTTCT CGCAGGCCGC CGTATCCTCG TCACCGGCGG AGGCACAGGC 
CTCGGCAAAT CGATGGCCGC GCGCTTCCTG CAGCTCGGCG CCGAAGTCCA CATTTGCGGC
CGCCGCAAGG GCGTCTGTGA CGAGACCGCG ACCGAATTGA TGGACGCCTA TGGCGGCAAG
GTGATGACTT ACGGCGTCGA CATCCGCGAC GCCGGCGCGG TCGACCACAT GGTCGAGACC
ATTTTCAGCG GTGGTCCCCT CACCGACTTG ATCAACAACG CCGCCGGGAA TTTCATCTCG
CGGACGGAAG AGCTGTCGCC GCGCGGCTTC GACGCCGTCG CCAACATCGT GATGCACGGA
ACGTTTTACG TCACCCACGC GGTCGGCAAA CGCTGGATCG AAGGCGGCCA TCGCGGCAAC
GTGGTCTCGA TCACCACGAC ATGGGTGCGC AACGGAAGCC CCTATGTGGT CCCCTCGGCG
ATGAGCAAAT CGGCGATCCA CGCGATGACG ATGTCGCTCG CCACCGAATG GGGCCGTTAC
GGCATTCGCC TCAACACCAT CGCGCCGGGT GAAATCCCGA CCGAAGGCAT GAGCAAGCGG
ATCAAGCCCG GCGACGAGGC CGGCGCGCGC ACCGTCAAGA TGAATCCGAT GGGCCGGGTC
GGCACCATGG AGGAACTGCA AAACGTCGCA GTGTTCCTGA TCTCCGGCGG CTGCGACTGG
ATCAACGGCG AGACCATCGC GATGGACGGC GCCCAGGGCC TGGCGATGGG CGGCAACTTC
TACCAGCTCC GCGACTGGAG CAACGCCGAC TGGGACCAGG CCAAGGCCTC GATCAAGGCG
CAAAACGAAA AGGACCGCGC GCAACGGGGA TGA
 
Protein sequence
MFTDQLLAGR RILVTGGGTG LGKSMAARFL QLGAEVHICG RRKGVCDETA TELMDAYGGK 
VMTYGVDIRD AGAVDHMVET IFSGGPLTDL INNAAGNFIS RTEELSPRGF DAVANIVMHG
TFYVTHAVGK RWIEGGHRGN VVSITTTWVR NGSPYVVPSA MSKSAIHAMT MSLATEWGRY
GIRLNTIAPG EIPTEGMSKR IKPGDEAGAR TVKMNPMGRV GTMEELQNVA VFLISGGCDW
INGETIAMDG AQGLAMGGNF YQLRDWSNAD WDQAKASIKA QNEKDRAQRG